Center for Machine Learning and Intelligent Systems
About  Citation Policy  Donate a Data Set  Contact


Repository Web            Google
View ALL Data Sets

Browse Through:

Default Task

Classification (12)
Regression (6)
Clustering (8)
Other (1)

Attribute Type - Undo

Categorical (3)
Numerical (18)
Mixed (3)

Data Type

Multivariate (11)
Univariate (3)
Sequential (9)
Time-Series (9)
Text (2)
Domain-Theory (2)
Other (0)

Area

Life Sciences (3)
Physical Sciences (1)
CS / Engineering (12)
Social Sciences (0)
Business (1)
Game (0)
Other (1)

# Attributes - Undo

Less than 10 (18)
10 to 100 (33)
Greater than 100 (16)

# Instances

Less than 100 (2)
100 to 1000 (4)
Greater than 1000 (12)

Format Type - Undo

Matrix (53)
Non-Matrix (18)

18 Data Sets

Table View  List View

Name

Data Types

Default Task

Attribute Types

# Instances

# Attributes

Year

 

USPTO Algorithm Challenge, run by NASA-Harvard Tournament Lab and TopCoder Problem: Pat

Domain-Theory 

Classification 

Integer 

306 

2013 

 

ser Knowledge Modeling Data (Students' Knowledge Levels on DC Electrical Machines)

Multivariate 

Classification 

Real 

403 

2013 

 

Improved Spiral Test Using Digitized Graphics Tablet for Monitoring Parkinson’s Disease

Multivariate 

Classification, Regression, Clustering 

Real 

40 

2016 

 

Query Analytics Workloads Dataset

Multivariate 

Regression, Clustering 

Real 

260000 

2019 

 

Alcohol QCM Sensor Dataset

Multivariate 

Classification, Regression, Clustering 

Real 

125 

2019 

 

Indoor User Movement Prediction from RSS data

Multivariate, Sequential, Time-Series 

Classification 

Real 

13197 

2016 

 

Activity Recognition system based on Multisensor data fusion (AReM)

Multivariate, Sequential, Time-Series 

Classification 

Real 

42240 

2016 

 

Taxi Service Trajectory - Prediction Challenge, ECML PKDD 2015

Multivariate, Sequential, Time-Series, Domain-Theory 

Clustering, Causal-Discovery 

Real 

1710671 

2015 

 

EMG dataset in Lower Limb

Multivariate, Time-Series 

 

Real 

132 

2014 

 

Individual household electric power consumption

Multivariate, Time-Series 

Regression, Clustering 

Real 

2075259 

2012 

 

WISDM Smartphone and Smartwatch Activity and Biometrics Dataset

Multivariate, Time-Series 

Classification 

Real 

15630426 

2019 

 

Parking Birmingham

Multivariate, Univariate, Sequential, Time-Series 

Classification, Regression, Clustering 

Real 

35717 

2019 

 

Activity recognition with healthy older people using a batteryless wearable sensor

Sequential 

Classification 

Real 

75128 

2016 

 

DSRC Vehicle Communications

Sequential, Text 

Clustering 

Real 

10000 

2017 

 

3D Road Network (North Jutland, Denmark)

Sequential, Text 

Regression, Clustering 

Real 

434874 

2013 

 

Machine Learning based ZZAlpha Ltd. Stock Recommendations 2012-2014

Sequential, Time-Series 

Classification 

Real 

314080 

2015 

 

Immunotherapy Dataset

Univariate 

Classification 

Integer, Real 

90 

2018 

 

Localization Data for Person Activity

Univariate, Sequential, Time-Series 

Classification 

Real 

164860 

2010 

Supported By:

 In Collaboration With:

About  ||  Citation Policy  ||  Donation Policy  ||  Contact  ||  CML