Center for Machine Learning and Intelligent Systems
About  Citation Policy  Donate a Data Set  Contact


Repository Web            Google
View ALL Data Sets

× Check out the beta version of the new UCI Machine Learning Repository we are currently testing! Contact us if you have any issues, questions, or concerns. Click here to try out the new site.

Browse Through:

Default Task

Classification (21)
Regression (7)
Clustering (4)
Other (2)

Attribute Type

Categorical (6)
Numerical (12)
Mixed (6)

Data Type - Undo

Multivariate (31)
Univariate (1)
Sequential (1)
Time-Series (4)
Text (9)
Domain-Theory (1)
Other (1)

Area - Undo

Life Sciences (123)
Physical Sciences (48)
CS / Engineering (177)
Social Sciences (31)
Business (36)
Game (9)
Other (53)

# Attributes

Less than 10 (6)
10 to 100 (16)
Greater than 100 (8)

# Instances

Less than 100 (2)
100 to 1000 (9)
Greater than 1000 (20)

Format Type

Matrix (27)
Non-Matrix (4)

31 Data Sets

Table View  List View

Name

Data Types

Default Task

Attribute Types

# Instances

# Attributes

Year

 

Balloons

Multivariate 

Classification 

Categorical 

16 

 

 

US Census Data (1990)

Multivariate 

Clustering 

Categorical 

2458285 

68 

 

 

Congressional Voting Records

Multivariate 

Classification 

Categorical 

435 

16 

1987 

 

Labor Relations

Multivariate 

 

Categorical, Integer, Real 

57 

16 

1988 

 

Hayes-Roth

Multivariate 

Classification 

Categorical 

160 

1989 

 

Balance Scale

Multivariate 

Classification 

Categorical 

625 

1994 

 

Adult

Multivariate 

Classification 

Categorical, Integer 

48842 

14 

1996 

 

Census Income

Multivariate 

Classification 

Categorical, Integer 

48842 

14 

1996 

 

Nursery

Multivariate 

Classification 

Categorical 

12960 

1997 

 

IPUMS Census Database

Multivariate 

 

Categorical, Integer 

256932 

61 

1999 

 

Census-Income (KDD)

Multivariate 

Classification 

Categorical, Integer 

299285 

40 

2000 

 

Insurance Company Benchmark (COIL 2000)

Multivariate 

Regression, Description 

Categorical, Integer 

9000 

86 

2000 

 

Communities and Crime

Multivariate 

Regression 

Real 

1994 

128 

2009 

 

Communities and Crime Unnormalized

Multivariate 

Regression 

Real 

2215 

147 

2011 

 

NYSK

Multivariate, Sequential, Text 

Clustering 

 

10421 

2013 

 

BlogFeedback

Multivariate 

Regression 

Integer, Real 

60021 

281 

2014 

 

Student Performance

Multivariate 

Classification, Regression 

Integer 

649 

33 

2014 

 

wiki4HE

Multivariate 

Regression, Clustering, Causal-Discovery 

 

913 

53 

2015 

 

Drug consumption (quantified)

Multivariate 

Classification 

Real 

1885 

32 

2016 

 

Sports articles for objectivity analysis

Multivariate, Text 

Classification 

Integer 

1000 

59 

2018 

 

Multimodal Damage Identification for Humanitarian Computing

Multivariate, Text 

Classification 

Integer 

5879 

 

2018 

 

GitHub MUSAE

Multivariate 

Classification 

 

37700 

4006 

2019 

 

Real-time Election Results: Portugal 2019

Multivariate, Time-Series, Text 

Regression 

Integer, Real 

21643 

29 

2019 

 

A study of Asian Religious and Biblical Texts

Multivariate, Text 

Classification, Clustering 

Integer 

590 

8265 

2019 

 

Speaker Accent Recognition

Multivariate 

Classification 

Real 

329 

12 

2020 

 

Facebook Large Page-Page Network

Multivariate 

Classification 

 

22470 

4714 

2020 

 

LastFM Asia Social Network

Multivariate 

Classification 

 

7624 

7842 

2020 

 

LastFM Asia Social Network

Multivariate 

Classification 

 

7624 

7842 

2020 

 

Wisesight Sentiment Corpus

Multivariate, Text 

Classification 

 

26737 

2020 

 

Higher Education Students Performance Evaluation Dataset

Multivariate 

Classification 

Integer 

145 

33 

2021 

 

Gender Gap in Spanish WP

Multivariate 

Classification 

Integer, Real 

4746 

21 

2021 

Supported By:

 In Collaboration With:

About  ||  Citation Policy  ||  Donation Policy  ||  Contact  ||  CML