Center for Machine Learning and Intelligent Systems
About  Citation Policy  Donate a Data Set  Contact


Repository Web            Google
View ALL Data Sets

Browse Through:

Default Task - Undo

Classification (47)
Regression (8)
Clustering (12)
Other (15)

Attribute Type

Categorical (5)
Numerical (25)
Mixed (7)

Data Type

Multivariate (33)
Univariate (3)
Sequential (6)
Time-Series (8)
Text (6)
Domain-Theory (1)
Other (2)

Area - Undo

Life Sciences (86)
Physical Sciences (31)
CS / Engineering (115)
Social Sciences (13)
Business (21)
Game (7)
Other (47)

# Attributes

Less than 10 (11)
10 to 100 (25)
Greater than 100 (3)

# Instances

Less than 100 (3)
100 to 1000 (19)
Greater than 1000 (20)

Format Type

Matrix (31)
Non-Matrix (16)

47 Data Sets

Table View  List View

Name

Data Types

Default Task

Attribute Types

# Instances

# Attributes

Year

 

Badges

Univariate, Text 

Classification 

 

294 

1994 

 

User Identification From Walking Activity

Univariate, Sequential, Time-Series 

Classification, Clustering 

Real 

 

 

2014 

 

Activity Recognition from Single Chest-Mounted Accelerometer

Univariate, Sequential, Time-Series 

Classification, Clustering 

Real 

 

 

2014 

 

Synthetic Control Chart Time Series

Time-Series 

Classification, Clustering 

Real 

600 

 

1999 

 

Sentence Classification

Text 

Classification 

Integer 

 

 

2014 

 

Sentiment Labelled Sentences

Text 

Classification 

 

3000 

 

2015 

 

Reuters-21578 Text Categorization Collection

Text 

Classification 

Categorical 

21578 

1997 

 

University of Tehran Question Dataset 2016 (UTQD.2016)

Text 

Classification 

 

1175 

2017 

 

Legal Case Reports

Text 

Classification 

 

 

 

2012 

 

Bach Choral Harmony

Sequential 

Classification 

 

5665 

17 

2014 

 

Hill-Valley

Sequential 

Classification 

Real 

606 

101 

2008 

 

Australian Sign Language signs

Multivariate, Time-Series 

Classification 

Categorical, Real 

6650 

15 

1999 

 

Australian Sign Language signs (High Quality)

Multivariate, Time-Series 

Classification 

Real 

2565 

22 

2002 

 

Japanese Vowels

Multivariate, Time-Series 

Classification 

Real 

640 

12 

 

 

Spoken Arabic Digit

Multivariate, Time-Series 

Classification 

Real 

8800 

13 

2010 

 

Gesture Phase Segmentation

Multivariate, Sequential, Time-Series 

Classification, Clustering 

Real 

9900 

50 

2014 

 

Libras Movement

Multivariate, Sequential 

Classification, Clustering 

Real 

360 

91 

2009 

 

Turkiye Student Evaluation

Multivariate 

Classification, Clustering 

 

5820 

33 

2013 

 

seismic-bumps

Multivariate 

Classification 

Real 

2584 

19 

2013 

 

Pittsburgh Bridges

Multivariate 

Classification 

Categorical, Integer 

108 

13 

1990 

 

Car Evaluation

Multivariate 

Classification 

Categorical 

1728 

1997 

 

Flags

Multivariate 

Classification 

Categorical, Integer 

194 

30 

1990 

 

StoneFlakes

Multivariate 

Classification, Clustering, Causal-Discovery 

Real 

79 

2014 

 

Tennis Major Tournament Match Statistics

Multivariate 

Classification, Regression, Clustering 

Integer, Real 

127 

42 

2014 

 

Image Segmentation

Multivariate 

Classification 

Real 

2310 

19 

1990 

 

Lenses

Multivariate 

Classification 

Categorical 

24 

1990 

 

Geographical Original of Music

Multivariate 

Classification, Regression 

Real 

1059 

68 

2014 

 

Meta-data

Multivariate 

Classification 

Categorical, Integer, Real 

528 

22 

1996 

 

Firm-Teacher_Clave-Direction_Classification

Multivariate 

Classification 

 

10800 

20 

2015 

 

MONK's Problems

Multivariate 

Classification 

Categorical 

432 

1992 

 

Chronic_Kidney_Disease

Multivariate 

Classification 

Real 

400 

25 

2015 

 

Folio

Multivariate 

Classification, Clustering 

 

637 

20 

2015 

 

Teaching Assistant Evaluation

Multivariate 

Classification 

Categorical, Integer 

151 

1997 

 

Trains

Multivariate 

Classification 

Categorical 

10 

32 

1994 

 

News Aggregator

Multivariate 

Classification, Clustering 

 

422937 

2016 

 

University

Multivariate 

Classification 

Categorical, Integer 

285 

17 

1988 

 

Statlog (Image Segmentation)

Multivariate 

Classification 

Real 

2310 

19 

1990 

 

Statlog (Vehicle Silhouettes)

Multivariate 

Classification 

Integer 

946 

18 

 

 

Dexter

Multivariate 

Classification 

Integer 

2600 

20000 

2008 

 

Madelon

Multivariate 

Classification 

Real 

4400 

500 

2008 

 

ICMLA 2014 Accepted Papers Data Set

Multivariate 

Classification, Clustering 

 

105 

2018 

 

AutoUniv

Multivariate 

Classification 

Categorical, Integer, Real 

 

 

2010 

 

Record Linkage Comparison Patterns

Multivariate 

Classification 

Real 

5749132 

12 

2011 

 

QSAR biodegradation

Multivariate 

Classification 

Integer, Real 

1055 

41 

2013 

 

CMU Face Images

Image 

Classification 

Integer 

640 

 

1999 

 

USPTO Algorithm Challenge, run by NASA-Harvard Tournament Lab and TopCoder Problem: Pat

Domain-Theory 

Classification 

Integer 

306 

2013 

 

Connectionist Bench (Vowel Recognition - Deterding Data)

 

Classification 

Real 

528 

10 

 

Supported By:

 In Collaboration With:

About  ||  Citation Policy  ||  Donation Policy  ||  Contact  ||  CML