Center for Machine Learning and Intelligent Systems
About  Citation Policy  Donate a Data Set  Contact


Repository Web            Google
View ALL Data Sets

Browse Through:

Default Task

Classification (10)
Regression (3)
Clustering (5)
Other (9)

Attribute Type

Categorical (2)
Numerical (8)
Mixed (3)

Data Type - Undo

Multivariate (299)
Univariate (16)
Sequential (39)
Time-Series (74)
Text (34)
Domain-Theory (22)
Other (21)

Area

Life Sciences (3)
Physical Sciences (2)
CS / Engineering (9)
Social Sciences (1)
Business (3)
Game (2)
Other (2)

# Attributes

Less than 10 (3)
10 to 100 (3)
Greater than 100 (5)

# Instances

Less than 100 (0)
100 to 1000 (7)
Greater than 1000 (8)

Format Type

Matrix (6)
Non-Matrix (16)

22 Data Sets

Table View  List View

Name

Data Types

Default Task

Attribute Types

# Instances

# Attributes

Year

 

Amazon Access Samples

Time-Series, Domain-Theory 

Regression, Clustering, Causal-Discovery 

 

30000 

20000 

2011 

 

Amazon Commerce reviews set

Multivariate, Text, Domain-Theory 

Classification 

Real 

1500 

10000 

2011 

 

Chess (Domain Theories)

Domain-Theory 

 

 

 

 

 

 

Economic Sanctions

Domain-Theory 

 

 

 

 

 

 

Japanese Credit Screening

Multivariate, Domain-Theory 

Classification 

Categorical, Real, Integer 

125 

 

1992 

 

Logic Theorist

Domain-Theory 

 

 

 

 

 

 

Mobile Robots

Domain-Theory 

 

Categorical, Integer, Real 

 

 

1995 

 

Molecular Biology (Promoter Gene Sequences)

Sequential, Domain-Theory 

Classification 

Categorical 

106 

58 

1990 

 

Molecular Biology (Splice-junction Gene Sequences)

Sequential, Domain-Theory 

Classification 

Categorical 

3190 

61 

1992 

 

Moral Reasoner

Domain-Theory 

 

 

202 

 

1994 

 

Othello Domain Theory

Domain-Theory 

 

 

 

 

1991 

 

Perfume Data

Univariate, Domain-Theory 

Classification, Clustering 

Integer 

560 

2014 

 

Prodigy

Domain-Theory 

 

 

 

 

 

 

Qualitative Structure Activity Relationships

Domain-Theory 

 

 

 

 

 

 

Relative location of CT slices on axial axis

Domain-Theory 

Regression 

Real 

53500 

386 

2011 

 

Reuter_50_50

Multivariate, Text, Domain-Theory 

Classification, Clustering 

Real 

2500 

10000 

2011 

 

SMS Spam Collection

Multivariate, Text, Domain-Theory 

Classification, Clustering 

Real 

5574 

 

2012 

 

Student Loan Relational

Domain-Theory 

 

 

1000 

 

1993 

 

Taxi Service Trajectory - Prediction Challenge, ECML PKDD 2015

Multivariate, Sequential, Time-Series, Domain-Theory 

Clustering, Causal-Discovery 

Real 

1710671 

2015 

 

Thyroid Disease

Multivariate, Domain-Theory 

Classification 

Categorical, Real 

7200 

21 

1987 

 

Twin gas sensor arrays

Multivariate, Time-Series, Domain-Theory 

Classification, Regression 

Real 

640 

480000 

2016 

 

USPTO Algorithm Challenge, run by NASA-Harvard Tournament Lab and TopCoder Problem: Pat

Domain-Theory 

Classification 

Integer 

306 

2013 

Supported By:

 In Collaboration With:

About  ||  Citation Policy  ||  Donation Policy  ||  Contact  ||  CML