Center for Machine Learning and Intelligent Systems
About  Citation Policy  Donate a Data Set  Contact


Repository Web            Google
View ALL Data Sets

Browse Through:

Default Task - Undo

Classification (113)
Regression (43)
Clustering (22)
Other (5)

Attribute Type - Undo

Categorical (1)
Numerical (43)
Mixed (2)

Data Type

Multivariate (39)
Univariate (3)
Sequential (5)
Time-Series (14)
Text (5)
Domain-Theory (0)
Other (0)

Area

Life Sciences (8)
Physical Sciences (3)
CS / Engineering (17)
Social Sciences (2)
Business (7)
Game (1)
Other (5)

# Attributes - Undo

Less than 10 (18)
10 to 100 (43)
Greater than 100 (16)

# Instances

Less than 100 (1)
100 to 1000 (14)
Greater than 1000 (28)

Format Type

Matrix (32)
Non-Matrix (11)

43 Data Sets

Table View  List View

Name

Data Types

Default Task

Attribute Types

# Instances

# Attributes

Year

 

Air Quality

Multivariate, Time-Series 

Regression 

Real 

9358 

15 

2016 

 

Air quality

Multivariate, Time-Series 

Regression 

Real 

9358 

15 

2016 

 

Appliances energy prediction

Multivariate, Time-Series 

Regression 

Real 

19735 

29 

2017 

 

Beijing PM2.5 Data

Multivariate, Time-Series 

Regression 

Integer, Real 

43824 

13 

2017 

 

Bike Sharing Dataset

Univariate 

Regression 

Integer, Real 

17389 

16 

2013 

 

Breast Cancer Wisconsin (Prognostic)

Multivariate 

Classification, Regression 

Real 

198 

34 

1995 

 

Buzz in social media

Time-Series, Multivariate 

Regression, Classification 

Integer, Real 

140000 

77 

2013 

 

Cargo 2000 Freight Tracking and Tracing

Multivariate, Sequential 

Classification, Regression 

Integer 

3942 

98 

2016 

 

Concrete Slump Test

Multivariate 

Regression 

Real 

103 

10 

2009 

 

Condition Based Maintenance of Naval Propulsion Plants

Multivariate 

Regression 

Real 

11934 

16 

2014 

 

CSM (Conventional and Social Media Movies) Dataset 2014 and 2015

Multivariate 

Classification, Regression 

Integer 

217 

12 

2017 

 

Daily Demand Forecasting Orders

Time-Series 

Regression 

Integer 

60 

13 

2017 

 

Early biomarkers of Parkinson’s disease based on natural connected speech

Multivariate 

Classification, Regression 

Integer, Real 

130 

65 

2017 

 

Educational Process Mining (EPM): A Learning Analytics Data Set

Multivariate, Sequential, Time-Series 

Classification, Regression, Clustering 

Integer 

230318 

13 

2015 

 

EEG Steady-State Visual Evoked Potential Signals

Multivariate, Time-Series 

Classification, Regression 

Integer 

9200 

16 

2018 

 

Facebook Comment Volume Dataset

Multivariate 

Regression 

Integer, Real 

40949 

54 

2016 

 

Facebook metrics

Multivariate 

Regression 

Integer 

500 

19 

2016 

 

Fertility

Multivariate 

Classification, Regression 

Real 

100 

10 

2013 

 

Forest Fires

Multivariate 

Regression 

Real 

517 

13 

2008 

 

Gas sensor array under dynamic gas mixtures

Multivariate, Time-Series 

Classification, Regression 

Real 

4178504 

19 

2015 

 

Geo-Magnetic field and WLAN dataset for indoor localisation from wristband and smartphone

Multivariate, Sequential, Time-Series 

Classification, Regression, Clustering 

Integer, Real 

153540 

25 

2017 

 

Geographical Original of Music

Multivariate 

Classification, Regression 

Real 

1059 

68 

2014 

 

GPS Trajectories

Multivariate 

Classification, Regression 

Real 

163 

15 

2016 

 

KEGG Metabolic Reaction Network (Undirected)

Multivariate, Univariate, Text 

Classification, Regression, Clustering 

Integer, Real 

65554 

29 

2011 

 

KEGG Metabolic Relation Network (Directed)

Multivariate, Univariate, Text 

Classification, Regression, Clustering 

Integer, Real 

53414 

24 

2011 

 

Las Vegas Strip

 

Classification, Regression 

Integer 

504 

20 

2017 

 

News Popularity in Multiple Social Media Platforms

Multivariate, Time-Series, Text 

Regression 

Integer, Real 

93239 

11 

2018 

 

Online News Popularity

Multivariate 

Classification, Regression 

Integer, Real 

39797 

61 

2015 

 

Online Video Characteristics and Transcoding Time Dataset

Multivariate 

Regression 

Integer, Real 

168286 

11 

2015 

 

Optical Interconnection Network

Multivariate 

Classification, Regression 

Integer, Real 

640 

10 

2018 

 

Paper Reviews

Text 

Classification, Regression 

Integer 

405 

10 

2017 

 

Parkinson Speech Dataset with Multiple Types of Sound Recordings

Multivariate 

Classification, Regression 

Integer, Real 

1040 

26 

2014 

 

Parkinsons Telemonitoring

Multivariate 

Regression 

Integer, Real 

5875 

26 

2009 

 

PM2.5 Data of Five Chinese Cities

Multivariate, Time-Series 

Regression 

Integer, Real 

52854 

86 

2017 

 

SGEMM GPU kernel performance

Multivariate 

Regression 

Integer 

241600 

18 

2018 

 

SkillCraft1 Master Table Dataset

Multivariate 

Regression 

Integer, Real 

3395 

20 

2013 

 

SML2010

Multivariate, Sequential, Time-Series, Text 

Regression 

Real 

4137 

24 

2014 

 

Stock portfolio performance

Multivariate 

Regression 

Real 

315 

12 

2016 

 

Student Performance

Multivariate 

Classification, Regression 

Integer 

649 

33 

2014 

 

Tennis Major Tournament Match Statistics

Multivariate 

Classification, Regression, Clustering 

Integer, Real 

127 

42 

2014 

 

UJIIndoorLoc-Mag

Multivariate, Sequential, Time-Series 

Classification, Regression, Clustering 

Integer, Real 

40000 

13 

2015 

 

Wine Quality

Multivariate 

Classification, Regression 

Real 

4898 

12 

2009 

 

YearPredictionMSD

Multivariate 

Regression 

Real 

515345 

90 

2011 

Supported By:

 In Collaboration With:

About  ||  Citation Policy  ||  Donation Policy  ||  Contact  ||  CML