Center for Machine Learning and Intelligent Systems
About  Citation Policy  Donate a Data Set  Contact


Repository Web            Google
View ALL Data Sets

Browse Through:

Default Task - Undo

Classification (308)
Regression (78)
Clustering (69)
Other (54)

Attribute Type

Categorical (5)
Numerical (12)
Mixed (12)

Data Type

Multivariate (22)
Univariate (2)
Sequential (5)
Time-Series (10)
Text (6)
Domain-Theory (9)
Other (13)

Area

Life Sciences (9)
Physical Sciences (5)
CS / Engineering (16)
Social Sciences (4)
Business (2)
Game (2)
Other (15)

# Attributes

Less than 10 (9)
10 to 100 (12)
Greater than 100 (3)

# Instances

Less than 100 (3)
100 to 1000 (12)
Greater than 1000 (19)

Format Type

Matrix (17)
Non-Matrix (37)

54 Data Sets

Table View  List View

Name

Data Types

Default Task

Attribute Types

# Instances

# Attributes

Year

 

EMG dataset in Lower Limb

Multivariate, Time-Series 

 

Real 

132 

2014 

 

DGP2 - The Second Data Generation Program

Data-Generator 

 

Real 

 

 

 

 

Function Finding

 

Function-Learning 

Real 

352 

 

1990 

 

ICU

Multivariate, Time-Series 

 

Real 

 

 

 

 

Corel Image Features

Multivariate 

 

Real 

68040 

89 

1999 

 

Cloud

Multivariate 

 

Real 

1024 

10 

1989 

 

Predict keywords activities in a online social media

Multivariate, Sequential, Time-Series 

 

Integer, Real 

51 

35 

2013 

 

El Nino

Spatio-temporal 

 

Integer, Real 

178080 

12 

1999 

 

SIFT10M

Multivariate 

Causal-Discovery 

Integer 

11164866 

128 

2016 

 

KASANDR

Multivariate 

Causal-Discovery 

Integer 

17764280 

2158859 

2017 

 

Abscisic Acid Signaling Network

Multivariate 

Causal-Discovery 

Integer 

300 

43 

2008 

 

QtyT40I10D100K

Sequential 

 

Integer 

3960456 

2012 

 

Coil 1999 Competition Data

Multivariate 

 

Categorical, Real 

340 

17 

1999 

 

Pioneer-1 Mobile Robot Data

Multivariate, Time-Series 

 

Categorical, Real 

 

 

1999 

 

Labor Relations

Multivariate 

 

Categorical, Integer, Real 

57 

16 

1988 

 

Liver Disorders

Multivariate 

 

Categorical, Integer, Real 

345 

1990 

 

Mobile Robots

Domain-Theory 

 

Categorical, Integer, Real 

 

 

1995 

 

EEG Database

Multivariate, Time-Series 

 

Categorical, Integer, Real 

122 

1999 

 

Bach Chorales

Univariate, Time-Series 

 

Categorical, Integer 

100 

 

 

Diabetes

Multivariate, Time-Series 

 

Categorical, Integer 

 

20 

 

 

Internet Usage Data

Multivariate 

 

Categorical, Integer 

10104 

72 

1999 

 

IPUMS Census Database

Multivariate 

 

Categorical, Integer 

256932 

61 

1999 

 

CalIt2 Building People Counts

Multivariate, Time-Series 

 

Categorical, Integer 

10080 

2006 

 

Dodgers Loop Sensor

Multivariate, Time-Series 

 

Categorical, Integer 

50400 

2006 

 

Anonymous Microsoft Web Data

 

Recommender-Systems 

Categorical 

37711 

294 

1998 

 

Kinship

Relational 

Relational-Learning 

Categorical 

104 

12 

1990 

 

Entree Chicago Recommendation Data

Transactional, Sequential 

Recommender-Systems 

Categorical 

50672 

 

2000 

 

MSNBC.com Anonymous Web Data

Sequential 

 

Categorical 

989818 

 

 

 

Connectionist Bench (Nettalk Corpus)

Multivariate 

 

Categorical 

20008 

 

 

UbiqLog (smartphone lifelogging)

Multivariate 

Causal-Discovery 

 

9782222 

 

2016 

 

Eco-hotel

Text 

 

 

401 

2017 

 

Opinosis Opinion ⁄ Review

Text 

 

 

51 

 

2010 

 

OpinRank Review Dataset

Text 

 

 

 

 

2011 

 

Restaurant & consumer data

Multivariate 

 

 

138 

47 

2012 

 

Chess (Domain Theories)

Domain-Theory 

 

 

 

 

 

 

Document Understanding

 

 

 

 

 

1994 

 

EBL Domain Theories

 

 

 

 

 

 

 

Logic Theorist

Domain-Theory 

 

 

 

 

 

 

Moral Reasoner

Domain-Theory 

 

 

202 

 

1994 

 

Othello Domain Theory

Domain-Theory 

 

 

 

 

1991 

 

Prodigy

Domain-Theory 

 

 

 

 

 

 

Qualitative Structure Activity Relationships

Domain-Theory 

 

 

 

 

 

 

Statlog Project

 

 

 

 

 

1992 

 

Student Loan Relational

Domain-Theory 

 

 

1000 

 

1993 

 

Undocumented

 

 

 

 

 

 

 

Twenty Newsgroups

Text 

 

 

20000 

 

1999 

 

E. Coli Genes

Relational 

 

 

 

 

2001 

 

M. Tuberculosis Genes

Relational 

 

 

 

 

2001 

 

Movie

Multivariate, Relational 

 

 

10000 

 

1999 

 

NSF Research Award Abstracts 1990-2003

Text 

 

 

129000 

 

2003 

 

Pseudo Periodic Synthetic Time Series

Univariate, Time-Series 

 

 

100000 

 

1999 

 

UNIX User Data

Text, Sequential 

 

 

 

 

 

 

Economic Sanctions

Domain-Theory 

 

 

 

 

 

 

Protein Data

 

 

 

 

 

 

Supported By:

 In Collaboration With:

About  ||  Citation Policy  ||  Donation Policy  ||  Contact  ||  CML