Center for Machine Learning and Intelligent Systems
About  Citation Policy  Donate a Data Set  Contact


Repository Web            Google
View ALL Data Sets

Browse Through:

Default Task - Undo

Classification (56)
Regression (16)
Clustering (16)
Other (2)

Attribute Type - Undo

Categorical (0)
Numerical (16)
Mixed (0)

Data Type

Multivariate (11)
Univariate (0)
Sequential (1)
Time-Series (5)
Text (7)
Domain-Theory (1)
Other (0)

Area

Life Sciences (2)
Physical Sciences (0)
CS / Engineering (13)
Social Sciences (0)
Business (0)
Game (0)
Other (1)

# Attributes - Undo

Less than 10 (15)
10 to 100 (22)
Greater than 100 (16)

# Instances

Less than 100 (0)
100 to 1000 (4)
Greater than 1000 (12)

Format Type

Matrix (14)
Non-Matrix (2)

16 Data Sets

Table View  List View

Name

Data Types

Default Task

Attribute Types

# Instances

# Attributes

Year

 

NIPS Conference Papers 1987-2015

Text 

Clustering 

Integer 

11463 

5812 

2016 

 

TTC-3600: Benchmark dataset for Turkish text categorization

Text 

Classification, Clustering 

Integer 

3600 

4814 

2017 

 

Mturk User-Perceived Clusters over Images

Multivariate, Text 

Clustering 

Integer 

180 

500 

2016 

 

Bag of Words

Text 

Clustering 

Integer 

8000000 

100000 

2008 

 

YouTube Multiview Video Games Dataset

Multivariate, Text 

Classification, Clustering 

Integer, Real 

120000 

1000000 

2013 

 

Epileptic Seizure Recognition

Multivariate, Time-Series 

Classification, Clustering 

Integer, Real 

11500 

179 

2017 

 

Daily and Sports Activities

Multivariate, Time-Series 

Classification, Clustering 

Real 

9120 

5625 

2013 

 

Gas Sensor Array Drift Dataset at Different Concentrations

Multivariate, Time-Series 

Classification, Regression, Clustering, Causa 

Real 

13910 

129 

2013 

 

ElectricityLoadDiagrams20112014

Time-Series 

Regression, Clustering 

Real 

370 

140256 

2015 

 

DrivFace

Multivariate 

Classification, Regression, Clustering 

Real 

606 

6400 

2016 

 

FMA: A Dataset For Music Analysis

Multivariate, Time-Series 

Classification, Clustering 

Real 

106574 

518 

2017 

 

gene expression cancer RNA-Seq

Multivariate 

Classification, Clustering 

Real 

801 

20531 

2016 

 

Health News in Twitter

Text 

Clustering 

Real 

58000 

25000 

2018 

 

Repeat Consumption Matrices

Multivariate 

Clustering 

Real 

130000 

21000 

2018 

 

detection_of_IoT_botnet_attacks_N_BaIoT

Multivariate, Sequential 

Classification, Clustering 

Real 

1000000 

115 

2018 

 

Reuter_50_50

Multivariate, Text, Domain-Theory 

Classification, Clustering 

Real 

2500 

10000 

2011 

Supported By:

 In Collaboration With:

About  ||  Citation Policy  ||  Donation Policy  ||  Contact  ||  CML