Center for Machine Learning and Intelligent Systems
About  Citation Policy  Donate a Data Set  Contact


Repository Web            Google
View ALL Data Sets

× Check out the beta version of the new UCI Machine Learning Repository we are currently testing! Contact us if you have any issues, questions, or concerns. Click here to try out the new site.

Browse Through:

Default Task - Undo

Classification (29)
Regression (15)
Clustering (13)
Other (1)

Attribute Type

Categorical (0)
Numerical (8)
Mixed (0)

Data Type - Undo

Multivariate (13)
Univariate (3)
Sequential (4)
Time-Series (6)
Text (3)
Domain-Theory (2)
Other (0)

Area - Undo

Life Sciences (6)
Physical Sciences (2)
CS / Engineering (13)
Social Sciences (1)
Business (6)
Game (0)
Other (4)

# Attributes - Undo

Less than 10 (13)
10 to 100 (13)
Greater than 100 (11)

# Instances

Less than 100 (2)
100 to 1000 (6)
Greater than 1000 (4)

Format Type

Matrix (7)
Non-Matrix (6)

13 Data Sets

Table View  List View


1. 3W dataset: The first realistic and public dataset with rare undesirable real events in oil wells.

2. AAAI 2013 Accepted Papers: This data set compromises the metadata for the 2013 AAAI conference's accepted papers (main track only), including paper titles, abstracts, and keywords of varying granularity.

3. AAAI 2014 Accepted Papers: This data set compromises the metadata for the 2014 AAAI conference's accepted papers, including paper titles, authors, abstracts, and keywords of varying granularity.

4. Alcohol QCM Sensor Dataset: Five different QCM gas sensors are used, and five different gas measurements (1-octanol, 1-propanol, 2-butanol, 2-propanol and 1-isobutanol) are conducted in each of these sensors.

5. Dataset for ADL Recognition with Wrist-worn Accelerometer: Recordings of 16 volunteers performing 14 Activities of Daily Living (ADL) while carrying a single wrist-worn tri-axial accelerometer.

6. Dishonest Internet users Dataset: The dataset was used to test an architecture based on a trust model capable to cope with the evaluation of the trustworthiness of users interacting in pervasive environments.

7. Improved Spiral Test Using Digitized Graphics Tablet for Monitoring Parkinson’s Disease: Handwriting database consists of 25 PWP(People with Parkinson) and 15 healthy individuals.Three types of recordings (Static Spiral Test, Dynamic Spiral Test and Stability Test) are taken.

8. Kain Tradisional Sambas: This data set consist of 5 patterns of Kain Tradisional Sambas's features from CFS (Correlation-Based Feature Selection) method which are Angular Second Moment, Contrast, and Correlation

9. Parking Birmingham: Data collected from car parks in Birmingham that are operated by NCP from Birmingham City Council. UK Open Government Licence (OGL). https://data.birmingham.gov.uk/dataset/birmingham-parking

10. Parkinson Disease Spiral Drawings Using Digitized Graphics Tablet: Handwriting database consists of 62 PWP(People with Parkinson) and 15 healthy individuals. Three types of recordings (Static Spiral Test, Dynamic Spiral Test and Stability Test) are taken.

11. Query Analytics Workloads Dataset: The data-set contains three (3) sets of range/radius query workloads from Gaussian distributions over a real dataset; Each query is associated with aggregate scalar values (count/sum/average).

12. Taxi Service Trajectory - Prediction Challenge, ECML PKDD 2015: An accurate dataset describing trajectories performed by all the 442 taxis running in the city of Porto, in Portugal.

13. User Knowledge Modeling: It is the real dataset about the students' knowledge status about the subject of Electrical DC Machines. The dataset had been obtained from Ph.D. Thesis.


Supported By:

 In Collaboration With:

About  ||  Citation Policy  ||  Donation Policy  ||  Contact  ||  CML