Center for Machine Learning and Intelligent Systems
About  Citation Policy  Donate a Data Set  Contact


Repository Web            Google
View ALL Data Sets

Browse Through:

Default Task

Classification (7)
Regression (1)
Clustering (2)
Other (1)

Attribute Type

Categorical (1)
Numerical (2)
Mixed (3)

Data Type

Multivariate (5)
Univariate (2)
Sequential (0)
Time-Series (1)
Text (3)
Domain-Theory (1)
Other (0)

Area - Undo

Life Sciences (16)
Physical Sciences (4)
CS / Engineering (15)
Social Sciences (3)
Business (5)
Game (1)
Other (9)

# Attributes - Undo

Less than 10 (9)
10 to 100 (14)
Greater than 100 (1)

# Instances - Undo

Less than 100 (2)
100 to 1000 (9)
Greater than 1000 (10)

Format Type

Matrix (6)
Non-Matrix (3)

9 Data Sets

Table View  List View


1. Auto MPG: Revised from CMU StatLib library, data concerns city-cycle fuel consumption

2. Bach Chorales: Time-series data based on chorales; challenge is to learn generative grammar; data in Lisp

3. Badges: Badges labeled with a "+" or "-" as a function of a person's name

4. BuddyMove Data Set: User interest information extracted from user reviews published in holidayiq.com about various types of point of interests in South India

5. ICMLA 2014 Accepted Papers Data Set: This data set compromises the metadata for the 2014 ICMLA conference's accepted papers, including ID, paper titles, author's keywords, abstracts and sessions in which they were exposed.

6. MONK's Problems: A set of three artificial domains over the same attribute space; Used to test a wide range of induction algorithms

7. Russian Corpus of Biographical Texts: Sentence classification (Russian). The corpus contains Wikipedia texts splitted into sentences/ Each sentence has a topic label.

8. Teaching Assistant Evaluation: The data consist of evaluations of teaching performance; scores are "low", "medium", or "high"

9. USPTO Algorithm Challenge, run by NASA-Harvard Tournament Lab and TopCoder Problem: Pat: Data used for USPTO Algorithm Competition. Contains drawing pages from US patents with manually labeled figure and part labels.


Supported By:

 In Collaboration With:

About  ||  Citation Policy  ||  Donation Policy  ||  Contact  ||  CML