Center for Machine Learning and Intelligent Systems
About  Citation Policy  Donate a Data Set  Contact


Repository Web            Google
View ALL Data Sets

Browse Through:

Default Task - Undo

Classification (45)
Regression (5)
Clustering (7)
Other (35)

Attribute Type

Categorical (3)
Numerical (7)
Mixed (4)

Data Type

Multivariate (6)
Univariate (2)
Sequential (4)
Time-Series (6)
Text (5)
Domain-Theory (9)
Other (11)

Area

Life Sciences (6)
Physical Sciences (3)
CS / Engineering (10)
Social Sciences (3)
Business (1)
Game (2)
Other (10)

# Attributes

Less than 10 (2)
10 to 100 (6)
Greater than 100 (0)

# Instances

Less than 100 (3)
100 to 1000 (7)
Greater than 1000 (6)

Format Type - Undo

Matrix (15)
Non-Matrix (35)

35 Data Sets

Table View  List View

Name

Data Types

Default Task

Attribute Types

# Instances

# Attributes

Year

 

Chess (Domain Theories)

Domain-Theory 

 

 

 

 

 

 

Bach Chorales

Univariate, Time-Series 

 

Categorical, Integer 

100 

 

 

Diabetes

Multivariate, Time-Series 

 

Categorical, Integer 

 

20 

 

 

DGP2 - The Second Data Generation Program

Data-Generator 

 

Real 

 

 

 

 

Document Understanding

 

 

 

 

 

1994 

 

EBL Domain Theories

 

 

 

 

 

 

 

ICU

Multivariate, Time-Series 

 

Real 

 

 

 

 

Labor Relations

Multivariate 

 

Categorical, Integer, Real 

57 

16 

1988 

 

Logic Theorist

Domain-Theory 

 

 

 

 

 

 

Mobile Robots

Domain-Theory 

 

Categorical, Integer, Real 

 

 

1995 

 

Moral Reasoner

Domain-Theory 

 

 

202 

 

1994 

 

Othello Domain Theory

Domain-Theory 

 

 

 

 

1991 

 

Prodigy

Domain-Theory 

 

 

 

 

 

 

Qualitative Structure Activity Relationships

Domain-Theory 

 

 

 

 

 

 

Statlog Project

 

 

 

 

 

1992 

 

Student Loan Relational

Domain-Theory 

 

 

1000 

 

1993 

 

Undocumented

 

 

 

 

 

 

 

Twenty Newsgroups

Text 

 

 

20000 

 

1999 

 

E. Coli Genes

Relational 

 

 

 

 

2001 

 

El Nino

Spatio-temporal 

 

Integer, Real 

178080 

12 

1999 

 

M. Tuberculosis Genes

Relational 

 

 

 

 

2001 

 

MSNBC.com Anonymous Web Data

Sequential 

 

Categorical 

989818 

 

 

 

NSF Research Award Abstracts 1990-2003

Text 

 

 

129000 

 

2003 

 

Pseudo Periodic Synthetic Time Series

Univariate, Time-Series 

 

 

100000 

 

1999 

 

UNIX User Data

Text, Sequential 

 

 

 

 

 

 

Economic Sanctions

Domain-Theory 

 

 

 

 

 

 

Protein Data

 

 

 

 

 

 

 

Predict keywords activities in a online social media

Multivariate, Sequential, Time-Series 

 

Integer, Real 

51 

35 

2013 

 

EMG dataset in Lower Limb

Multivariate, Time-Series 

 

Real 

132 

2014 

 

Opinosis Opinion ⁄ Review

Text 

 

 

51 

 

2010 

 

OpinRank Review Dataset

Text 

 

 

 

 

2011 

 

Abscisic Acid Signaling Network

Multivariate 

Causal-Discovery 

Integer 

300 

43 

2008 

 

Function Finding

 

Function-Learning 

Real 

352 

 

1990 

 

Entree Chicago Recommendation Data

Transactional, Sequential 

Recommender-Systems 

Categorical 

50672 

 

2000 

 

Kinship

Relational 

Relational-Learning 

Categorical 

104 

12 

1990 

Supported By:

 In Collaboration With:

About  ||  Citation Policy  ||  Donation Policy  ||  Contact  ||  CML