Center for Machine Learning and Intelligent Systems
About  Citation Policy  Donate a Data Set  Contact


Repository Web            Google
View ALL Data Sets

Browse Through:

Default Task

Classification (13)
Regression (2)
Clustering (0)
Other (4)

Attribute Type - Undo

Categorical (15)
Numerical (158)
Mixed (19)

Data Type

Multivariate (19)
Univariate (0)
Sequential (0)
Time-Series (3)
Text (0)
Domain-Theory (1)
Other (0)

Area

Life Sciences (4)
Physical Sciences (0)
CS / Engineering (4)
Social Sciences (5)
Business (0)
Game (2)
Other (4)

# Attributes

Less than 10 (6)
10 to 100 (11)
Greater than 100 (2)

# Instances - Undo

Less than 100 (3)
100 to 1000 (28)
Greater than 1000 (19)

Format Type

Matrix (17)
Non-Matrix (2)

19 Data Sets

Table View  List View

Name

Data Types

Default Task

Attribute Types

# Instances

# Attributes

Year

 

Abalone

Multivariate 

Classification 

Categorical, Integer, Real 

4177 

1995 

 

Adult

Multivariate 

Classification 

Categorical, Integer 

48842 

14 

1996 

 

Artificial Characters

Multivariate 

Classification 

Categorical, Integer, Real 

6000 

1992 

 

Australian Sign Language signs

Multivariate, Time-Series 

Classification 

Categorical, Real 

6650 

15 

1999 

 

CalIt2 Building People Counts

Multivariate, Time-Series 

 

Categorical, Integer 

10080 

2006 

 

Census Income

Multivariate 

Classification 

Categorical, Integer 

48842 

14 

1996 

 

Census-Income (KDD)

Multivariate 

Classification 

Categorical, Integer 

299285 

40 

2000 

 

Chess (King-Rook vs. King)

Multivariate 

Classification 

Categorical, Integer 

28056 

1994 

 

Contraceptive Method Choice

Multivariate 

Classification 

Categorical, Integer 

1473 

1997 

 

Covertype

Multivariate 

Classification 

Categorical, Integer 

581012 

54 

1998 

 

Dodgers Loop Sensor

Multivariate, Time-Series 

 

Categorical, Integer 

50400 

2006 

 

Insurance Company Benchmark (COIL 2000)

Multivariate 

Regression, Description 

Categorical, Integer 

9000 

86 

2000 

 

Internet Advertisements

Multivariate 

Classification 

Categorical, Integer, Real 

3279 

1558 

1998 

 

Internet Usage Data

Multivariate 

 

Categorical, Integer 

10104 

72 

1999 

 

IPUMS Census Database

Multivariate 

 

Categorical, Integer 

256932 

61 

1999 

 

KDD Cup 1998 Data

Multivariate 

Regression 

Categorical, Integer 

191779 

481 

1998 

 

KDD Cup 1999 Data

Multivariate 

Classification 

Categorical, Integer 

4000000 

42 

1999 

 

Poker Hand

Multivariate 

Classification 

Categorical, Integer 

1025010 

11 

2007 

 

Thyroid Disease

Multivariate, Domain-Theory 

Classification 

Categorical, Real 

7200 

21 

1987 

Supported By:

 In Collaboration With:

About  ||  Citation Policy  ||  Donation Policy  ||  Contact  ||  CML