Center for Machine Learning and Intelligent Systems
About  Citation Policy  Donate a Data Set  Contact


Repository Web            Google
View ALL Data Sets

Browse Through:

Default Task

Classification (12)
Regression (2)
Clustering (2)
Other (5)

Attribute Type

Categorical (2)
Numerical (11)
Mixed (4)

Data Type - Undo

Multivariate (19)
Univariate (0)
Sequential (2)
Time-Series (5)
Text (1)
Domain-Theory (0)
Other (1)

Area - Undo

Life Sciences (18)
Physical Sciences (13)
CS / Engineering (31)
Social Sciences (10)
Business (3)
Game (5)
Other (19)

# Attributes

Less than 10 (4)
10 to 100 (11)
Greater than 100 (3)

# Instances - Undo

Less than 100 (3)
100 to 1000 (12)
Greater than 1000 (19)

Format Type - Undo

Matrix (19)
Non-Matrix (2)

19 Data Sets

Table View  List View

Name

Data Types

Default Task

Attribute Types

# Instances

# Attributes

Year

 

Corel Image Features

Multivariate 

 

Real 

68040 

89 

1999 

 

Movie

Multivariate, Relational 

 

 

10000 

 

1999 

 

Connectionist Bench (Nettalk Corpus)

Multivariate 

 

Categorical 

20008 

 

 

CalIt2 Building People Counts

Multivariate, Time-Series 

 

Categorical, Integer 

10080 

2006 

 

Dodgers Loop Sensor

Multivariate, Time-Series 

 

Categorical, Integer 

50400 

2006 

 

seismic-bumps

Multivariate 

Classification 

Real 

2584 

19 

2013 

 

Car Evaluation

Multivariate 

Classification 

Categorical 

1728 

1997 

 

Image Segmentation

Multivariate 

Classification 

Real 

2310 

19 

1990 

 

Australian Sign Language signs

Multivariate, Time-Series 

Classification 

Categorical, Real 

6650 

15 

1999 

 

Australian Sign Language signs (High Quality)

Multivariate, Time-Series 

Classification 

Real 

2565 

22 

2002 

 

Statlog (Image Segmentation)

Multivariate 

Classification 

Real 

2310 

19 

1990 

 

Dexter

Multivariate 

Classification 

Integer 

2600 

20000 

2008 

 

Madelon

Multivariate 

Classification 

Real 

4400 

500 

2008 

 

Record Linkage Comparison Patterns

Multivariate 

Classification 

Real 

5749132 

12 

2011 

 

QSAR biodegradation

Multivariate 

Classification 

Integer, Real 

1055 

41 

2013 

 

Turkiye Student Evaluation

Multivariate 

Classification, Clustering 

 

5820 

33 

2013 

 

Gesture Phase Segmentation

Multivariate, Sequential, Time-Series 

Classification, Clustering 

Real 

9900 

50 

2014 

 

KDD Cup 1998 Data

Multivariate 

Regression 

Categorical, Integer 

191779 

481 

1998 

 

YearPredictionMSD

Multivariate 

Regression 

Real 

515345 

90 

2011 

Supported By:

 In Collaboration With:

About  ||  Citation Policy  ||  Donation Policy  ||  Contact  ||  CML