Center for Machine Learning and Intelligent Systems
About  Citation Policy  Donate a Data Set  Contact


Repository Web            Google
View ALL Data Sets

Browse Through:

Default Task

Classification (31)
Regression (6)
Clustering (10)
Other (5)

Attribute Type

Categorical (5)
Numerical (20)
Mixed (11)

Data Type

Multivariate (37)
Univariate (2)
Sequential (4)
Time-Series (8)
Text (2)
Domain-Theory (0)
Other (1)

Area - Undo

Life Sciences (75)
Physical Sciences (36)
CS / Engineering (107)
Social Sciences (18)
Business (22)
Game (8)
Other (42)

# Attributes

Less than 10 (12)
10 to 100 (22)
Greater than 100 (4)

# Instances

Less than 100 (3)
100 to 1000 (14)
Greater than 1000 (23)

Format Type - Undo

Matrix (42)
Non-Matrix (27)

42 Data Sets

Table View  List View

Name

Data Types

Default Task

Attribute Types

# Instances

# Attributes

Year

 

Turkiye Student Evaluation

Multivariate 

Classification, Clustering 

 

5820 

33 

2013 

 

Auto MPG

Multivariate 

Regression 

Categorical, Real 

398 

1993 

 

Automobile

Multivariate 

Regression 

Categorical, Integer, Real 

205 

26 

1987 

 

seismic-bumps

Multivariate 

Classification 

Real 

2584 

19 

2013 

 

Pittsburgh Bridges

Multivariate 

Classification 

Categorical, Integer 

108 

13 

1990 

 

Car Evaluation

Multivariate 

Classification 

Categorical 

1728 

1997 

 

Flags

Multivariate 

Classification 

Categorical, Integer 

194 

30 

1990 

 

StoneFlakes

Multivariate 

Classification, Clustering, Causal-Discovery 

Real 

79 

2014 

 

Tennis Major Tournament Match Statistics

Multivariate 

Classification, Regression, Clustering 

Integer, Real 

127 

42 

2014 

 

Image Segmentation

Multivariate 

Classification 

Real 

2310 

19 

1990 

 

Lenses

Multivariate 

Classification 

Categorical 

24 

1990 

 

Meta-data

Multivariate 

Classification 

Categorical, Integer, Real 

528 

22 

1996 

 

MONK's Problems

Multivariate 

Classification 

Categorical 

432 

1992 

 

Teaching Assistant Evaluation

Multivariate 

Classification 

Categorical, Integer 

151 

1997 

 

Trains

Multivariate 

Classification 

Categorical 

10 

32 

1994 

 

News Aggregator

Multivariate 

Classification, Clustering 

 

422937 

2016 

 

Facebook Comment Volume Dataset

Multivariate 

Regression 

Integer, Real 

40949 

54 

2016 

 

Corel Image Features

Multivariate 

 

Real 

68040 

89 

1999 

 

KDD Cup 1998 Data

Multivariate 

Regression 

Categorical, Integer 

191779 

481 

1998 

 

Statlog (Image Segmentation)

Multivariate 

Classification 

Real 

2310 

19 

1990 

 

Statlog (Vehicle Silhouettes)

Multivariate 

Classification 

Integer 

946 

18 

 

 

Connectionist Bench (Nettalk Corpus)

Multivariate 

 

Categorical 

20008 

 

 

Dexter

Multivariate 

Classification 

Integer 

2600 

20000 

2008 

 

Madelon

Multivariate 

Classification 

Real 

4400 

500 

2008 

 

ICMLA 2014 Accepted Papers Data Set

Multivariate 

Classification, Clustering 

 

105 

2018 

 

AutoUniv

Multivariate 

Classification 

Categorical, Integer, Real 

 

 

2010 

 

YearPredictionMSD

Multivariate 

Regression 

Real 

515345 

90 

2011 

 

Record Linkage Comparison Patterns

Multivariate 

Classification 

Real 

5749132 

12 

2011 

 

QSAR biodegradation

Multivariate 

Classification 

Integer, Real 

1055 

41 

2013 

 

Movie

Multivariate, Relational 

 

 

10000 

 

1999 

 

Libras Movement

Multivariate, Sequential 

Classification, Clustering 

Real 

360 

91 

2009 

 

Gesture Phase Segmentation

Multivariate, Sequential, Time-Series 

Classification, Clustering 

Real 

9900 

50 

2014 

 

Australian Sign Language signs

Multivariate, Time-Series 

Classification 

Categorical, Real 

6650 

15 

1999 

 

Australian Sign Language signs (High Quality)

Multivariate, Time-Series 

Classification 

Real 

2565 

22 

2002 

 

Japanese Vowels

Multivariate, Time-Series 

Classification 

Real 

640 

12 

 

 

CalIt2 Building People Counts

Multivariate, Time-Series 

 

Categorical, Integer 

10080 

2006 

 

Dodgers Loop Sensor

Multivariate, Time-Series 

 

Categorical, Integer 

50400 

2006 

 

Bach Choral Harmony

Sequential 

Classification 

 

5665 

17 

2014 

 

Bag of Words

Text 

Clustering 

Integer 

8000000 

100000 

2008 

 

Synthetic Control Chart Time Series

Time-Series 

Classification, Clustering 

Real 

600 

 

1999 

 

Activity Recognition from Single Chest-Mounted Accelerometer

Univariate, Sequential, Time-Series 

Classification, Clustering 

Real 

 

 

2014 

 

Badges

Univariate, Text 

Classification 

 

294 

1994 

Supported By:

 In Collaboration With:

About  ||  Citation Policy  ||  Donation Policy  ||  Contact  ||  CML