Center for Machine Learning and Intelligent Systems
About  Citation Policy  Donate a Data Set  Contact


Repository Web            Google
View ALL Data Sets

Browse Through:

Default Task

Classification (6)
Regression (2)
Clustering (0)
Other (4)

Attribute Type

Categorical (2)
Numerical (3)
Mixed (0)

Data Type

Multivariate (4)
Univariate (1)
Sequential (1)
Time-Series (3)
Text (5)
Domain-Theory (0)
Other (0)

Area - Undo

Life Sciences (10)
Physical Sciences (6)
CS / Engineering (35)
Social Sciences (4)
Business (3)
Game (0)
Other (11)

# Attributes

Less than 10 (2)
10 to 100 (4)
Greater than 100 (0)

# Instances - Undo

Less than 100 (0)
100 to 1000 (8)
Greater than 1000 (11)

Format Type - Undo

Matrix (24)
Non-Matrix (11)

11 Data Sets

Table View  List View

Name

Data Types

Default Task

Attribute Types

# Instances

# Attributes

Year

 

University of Tehran Question Dataset 2016 (UTQD.2016)

Text 

Classification 

 

1175 

2017 

 

Twenty Newsgroups

Text 

 

 

20000 

 

1999 

 

Spoken Arabic Digit

Multivariate, Time-Series 

Classification 

Real 

8800 

13 

2010 

 

Sentiment Labelled Sentences

Text 

Classification 

 

3000 

 

2015 

 

Reuters-21578 Text Categorization Collection

Text 

Classification 

Categorical 

21578 

1997 

 

Pseudo Periodic Synthetic Time Series

Univariate, Time-Series 

 

 

100000 

 

1999 

 

NSF Research Award Abstracts 1990-2003

Text 

 

 

129000 

 

2003 

 

Geographical Original of Music

Multivariate 

Classification, Regression 

Real 

1059 

68 

2014 

 

Firm-Teacher_Clave-Direction_Classification

Multivariate 

Classification 

 

10800 

20 

2015 

 

Entree Chicago Recommendation Data

Transactional, Sequential 

Recommender-Systems 

Categorical 

50672 

 

2000 

 

Air quality

Multivariate, Time-Series 

Regression 

Real 

9358 

15 

2016 

Supported By:

 In Collaboration With:

About  ||  Citation Policy  ||  Donation Policy  ||  Contact  ||  CML