Center for Machine Learning and Intelligent Systems
About  Citation Policy  Donate a Data Set  Contact


Repository Web            Google
View ALL Data Sets

× Check out the beta version of the new UCI Machine Learning Repository we are currently testing! Contact us if you have any issues, questions, or concerns. Click here to try out the new site.

Browse Through:

Default Task - Undo

Classification (253)
Regression (94)
Clustering (71)
Other (19)

Attribute Type

Categorical (4)
Numerical (6)
Mixed (4)

Data Type

Multivariate (11)
Univariate (1)
Sequential (3)
Time-Series (3)
Text (2)
Domain-Theory (0)
Other (3)

Area

Life Sciences (1)
Physical Sciences (2)
CS / Engineering (5)
Social Sciences (1)
Business (0)
Game (0)
Other (9)

# Attributes

Less than 10 (4)
10 to 100 (5)
Greater than 100 (3)

# Instances - Undo

Less than 100 (3)
100 to 1000 (13)
Greater than 1000 (19)

Format Type

Matrix (11)
Non-Matrix (8)

19 Data Sets

Table View  List View

Name

Data Types

Default Task

Attribute Types

# Instances

# Attributes

Year

 

Twenty Newsgroups

Text 

 

 

20000 

 

1999 

 

Movie

Multivariate, Relational 

 

 

10000 

 

1999 

 

NSF Research Award Abstracts 1990-2003

Text 

 

 

129000 

 

2003 

 

Pseudo Periodic Synthetic Time Series

Univariate, Time-Series 

 

 

100000 

 

1999 

 

UbiqLog (smartphone lifelogging)

Multivariate 

Causal-Discovery 

 

9782222 

 

2016 

 

Anonymous Microsoft Web Data

 

Recommender-Systems 

Categorical 

37711 

294 

1998 

 

Entree Chicago Recommendation Data

Transactional, Sequential 

Recommender-Systems 

Categorical 

50672 

 

2000 

 

MSNBC.com Anonymous Web Data

Sequential 

 

Categorical 

989818 

 

 

 

Connectionist Bench (Nettalk Corpus)

Multivariate 

 

Categorical 

20008 

 

 

Internet Usage Data

Multivariate 

 

Categorical, Integer 

10104 

72 

1999 

 

IPUMS Census Database

Multivariate 

 

Categorical, Integer 

256932 

61 

1999 

 

CalIt2 Building People Counts

Multivariate, Time-Series 

 

Categorical, Integer 

10080 

2006 

 

Dodgers Loop Sensor

Multivariate, Time-Series 

 

Categorical, Integer 

50400 

2006 

 

SIFT10M

Multivariate 

Causal-Discovery 

Integer 

11164866 

128 

2016 

 

KASANDR

Multivariate 

Causal-Discovery 

Integer 

17764280 

2158859 

2017 

 

QtyT40I10D100K

Sequential 

 

Integer 

3960456 

2012 

 

El Nino

Spatio-temporal 

 

Integer, Real 

178080 

12 

1999 

 

Corel Image Features

Multivariate 

 

Real 

68040 

89 

1999 

 

Cloud

Multivariate 

 

Real 

1024 

10 

1989 

Supported By:

 In Collaboration With:

About  ||  Citation Policy  ||  Donation Policy  ||  Contact  ||  CML