Center for Machine Learning and Intelligent Systems
About  Citation Policy  Donate a Data Set  Contact


Repository Web            Google
View ALL Data Sets

× Check out the beta version of the new UCI Machine Learning Repository we are currently testing! Contact us if you have any issues, questions, or concerns. Click here to try out the new site.

Browse Through:

Default Task - Undo

Classification (466)
Regression (151)
Clustering (121)
Other (56)

Attribute Type

Categorical (6)
Numerical (13)
Mixed (12)

Data Type

Multivariate (24)
Univariate (2)
Sequential (5)
Time-Series (11)
Text (6)
Domain-Theory (9)
Other (13)

Area

Life Sciences (11)
Physical Sciences (5)
CS / Engineering (16)
Social Sciences (4)
Business (2)
Game (2)
Other (15)

# Attributes

Less than 10 (10)
10 to 100 (12)
Greater than 100 (3)

# Instances

Less than 100 (3)
100 to 1000 (13)
Greater than 1000 (19)

Format Type

Matrix (18)
Non-Matrix (38)

56 Data Sets

Table View  List View

Name

Data Types

Default Task

Attribute Types

# Instances

# Attributes

Year

 

Chess (Domain Theories)

Domain-Theory 

 

 

 

 

 

 

Document Understanding

 

 

 

 

 

1994 

 

EBL Domain Theories

 

 

 

 

 

 

 

Logic Theorist

Domain-Theory 

 

 

 

 

 

 

Moral Reasoner

Domain-Theory 

 

 

202 

 

1994 

 

Othello Domain Theory

Domain-Theory 

 

 

 

 

1991 

 

Prodigy

Domain-Theory 

 

 

 

 

 

 

Qualitative Structure Activity Relationships

Domain-Theory 

 

 

 

 

 

 

Statlog Project

 

 

 

 

 

1992 

 

Student Loan Relational

Domain-Theory 

 

 

1000 

 

1993 

 

Undocumented

 

 

 

 

 

 

 

Twenty Newsgroups

Text 

 

 

20000 

 

1999 

 

E. Coli Genes

Relational 

 

 

 

 

2001 

 

M. Tuberculosis Genes

Relational 

 

 

 

 

2001 

 

Movie

Multivariate, Relational 

 

 

10000 

 

1999 

 

NSF Research Award Abstracts 1990-2003

Text 

 

 

129000 

 

2003 

 

Pseudo Periodic Synthetic Time Series

Univariate, Time-Series 

 

 

100000 

 

1999 

 

UNIX User Data

Text, Sequential 

 

 

 

 

 

 

Economic Sanctions

Domain-Theory 

 

 

 

 

 

 

Protein Data

 

 

 

 

 

 

 

UbiqLog (smartphone lifelogging)

Multivariate 

Causal-Discovery 

 

9782222 

 

2016 

 

Eco-hotel

Text 

 

 

401 

2017 

 

Opinosis Opinion ⁄ Review

Text 

 

 

51 

 

2010 

 

OpinRank Review Dataset

Text 

 

 

 

 

2011 

 

Restaurant & consumer data

Multivariate 

 

 

138 

47 

2012 

 

Anonymous Microsoft Web Data

 

Recommender-Systems 

Categorical 

37711 

294 

1998 

 

Kinship

Relational 

Relational-Learning 

Categorical 

104 

12 

1990 

 

Entree Chicago Recommendation Data

Transactional, Sequential 

Recommender-Systems 

Categorical 

50672 

 

2000 

 

MSNBC.com Anonymous Web Data

Sequential 

 

Categorical 

989818 

 

 

 

Connectionist Bench (Nettalk Corpus)

Multivariate 

 

Categorical 

20008 

 

 

PANDOR

Multivariate 

Recommendation 

Categorical 

 

 

2018 

 

Bach Chorales

Univariate, Time-Series 

 

Categorical, Integer 

100 

 

 

Diabetes

Multivariate, Time-Series 

 

Categorical, Integer 

 

20 

 

 

Internet Usage Data

Multivariate 

 

Categorical, Integer 

10104 

72 

1999 

 

IPUMS Census Database

Multivariate 

 

Categorical, Integer 

256932 

61 

1999 

 

CalIt2 Building People Counts

Multivariate, Time-Series 

 

Categorical, Integer 

10080 

2006 

 

Dodgers Loop Sensor

Multivariate, Time-Series 

 

Categorical, Integer 

50400 

2006 

 

Labor Relations

Multivariate 

 

Categorical, Integer, Real 

57 

16 

1988 

 

Liver Disorders

Multivariate 

 

Categorical, Integer, Real 

345 

1990 

 

Mobile Robots

Domain-Theory 

 

Categorical, Integer, Real 

 

 

1995 

 

EEG Database

Multivariate, Time-Series 

 

Categorical, Integer, Real 

122 

1999 

 

Coil 1999 Competition Data

Multivariate 

 

Categorical, Real 

340 

17 

1999 

 

Pioneer-1 Mobile Robot Data

Multivariate, Time-Series 

 

Categorical, Real 

 

 

1999 

 

Horton General Hospital

Multivariate, Time-Series 

Causal-Discovery 

Integer 

139 

2019 

 

SIFT10M

Multivariate 

Causal-Discovery 

Integer 

11164866 

128 

2016 

 

KASANDR

Multivariate 

Causal-Discovery 

Integer 

17764280 

2158859 

2017 

 

Abscisic Acid Signaling Network

Multivariate 

Causal-Discovery 

Integer 

300 

43 

2008 

 

QtyT40I10D100K

Sequential 

 

Integer 

3960456 

2012 

 

Predict keywords activities in a online social media

Multivariate, Sequential, Time-Series 

 

Integer, Real 

51 

35 

2013 

 

El Nino

Spatio-temporal 

 

Integer, Real 

178080 

12 

1999 

 

EMG dataset in Lower Limb

Multivariate, Time-Series 

 

Real 

132 

2014 

 

DGP2 - The Second Data Generation Program

Data-Generator 

 

Real 

 

 

 

 

Function Finding

 

Function-Learning 

Real 

352 

 

1990 

 

ICU

Multivariate, Time-Series 

 

Real 

 

 

 

 

Corel Image Features

Multivariate 

 

Real 

68040 

89 

1999 

 

Cloud

Multivariate 

 

Real 

1024 

10 

1989 

Supported By:

 In Collaboration With:

About  ||  Citation Policy  ||  Donation Policy  ||  Contact  ||  CML