Center for Machine Learning and Intelligent Systems
About  Citation Policy  Donate a Data Set  Contact


Repository Web            Google
View ALL Data Sets

Browse Through:

Default Task - Undo

Classification (47)
Regression (8)
Clustering (12)
Other (15)

Attribute Type

Categorical (2)
Numerical (2)
Mixed (3)

Data Type

Multivariate (5)
Univariate (2)
Sequential (1)
Time-Series (4)
Text (2)
Domain-Theory (1)
Other (5)

Area - Undo

Life Sciences (10)
Physical Sciences (5)
CS / Engineering (16)
Social Sciences (4)
Business (2)
Game (2)
Other (15)

# Attributes

Less than 10 (4)
10 to 100 (1)
Greater than 100 (0)

# Instances

Less than 100 (0)
100 to 1000 (1)
Greater than 1000 (9)

Format Type

Matrix (5)
Non-Matrix (10)

15 Data Sets

Table View  List View

Name

Data Types

Default Task

Attribute Types

# Instances

# Attributes

Year

 

Bach Chorales

Univariate, Time-Series 

 

Categorical, Integer 

100 

 

 

Pseudo Periodic Synthetic Time Series

Univariate, Time-Series 

 

 

100000 

 

1999 

 

Entree Chicago Recommendation Data

Transactional, Sequential 

Recommender-Systems 

Categorical 

50672 

 

2000 

 

Twenty Newsgroups

Text 

 

 

20000 

 

1999 

 

NSF Research Award Abstracts 1990-2003

Text 

 

 

129000 

 

2003 

 

CalIt2 Building People Counts

Multivariate, Time-Series 

 

Categorical, Integer 

10080 

2006 

 

Dodgers Loop Sensor

Multivariate, Time-Series 

 

Categorical, Integer 

50400 

2006 

 

Movie

Multivariate, Relational 

 

 

10000 

 

1999 

 

Corel Image Features

Multivariate 

 

Real 

68040 

89 

1999 

 

Connectionist Bench (Nettalk Corpus)

Multivariate 

 

Categorical 

20008 

 

 

Prodigy

Domain-Theory 

 

 

 

 

 

 

DGP2 - The Second Data Generation Program

Data-Generator 

 

Real 

 

 

 

 

Document Understanding

 

 

 

 

 

1994 

 

Statlog Project

 

 

 

 

 

1992 

 

Undocumented

 

 

 

 

 

 

Supported By:

 In Collaboration With:

About  ||  Citation Policy  ||  Donation Policy  ||  Contact  ||  CML