Center for Machine Learning and Intelligent Systems
About  Citation Policy  Donate a Data Set  Contact


Repository Web            Google
View ALL Data Sets

× Check out the beta version of the new UCI Machine Learning Repository we are currently testing! Contact us if you have any issues, questions, or concerns. Click here to try out the new site.

Browse Through:

Default Task

Classification (19)
Regression (16)
Clustering (6)
Other (1)

Attribute Type

Categorical (0)
Numerical (25)
Mixed (0)

Data Type - Undo

Multivariate (74)
Univariate (2)
Sequential (13)
Time-Series (25)
Text (8)
Domain-Theory (0)
Other (0)

Area - Undo

Life Sciences (5)
Physical Sciences (6)
CS / Engineering (25)
Social Sciences (2)
Business (4)
Game (0)
Other (7)

# Attributes - Undo

Less than 10 (15)
10 to 100 (25)
Greater than 100 (20)

# Instances

Less than 100 (1)
100 to 1000 (2)
Greater than 1000 (22)

Format Type

Matrix (18)
Non-Matrix (7)

25 Data Sets

Table View  List View

Name

Data Types

Default Task

Attribute Types

# Instances

# Attributes

Year

 

PAMAP2 Physical Activity Monitoring

Multivariate, Time-Series 

Classification 

Real 

3850505 

52 

2012 

 

Buzz in social media

Time-Series, Multivariate 

Regression, Classification 

Integer, Real 

140000 

77 

2013 

 

Predict keywords activities in a online social media

Multivariate, Sequential, Time-Series 

 

Integer, Real 

51 

35 

2013 

 

SML2010

Multivariate, Sequential, Time-Series, Text 

Regression 

Real 

4137 

24 

2014 

 

MHEALTH Dataset

Multivariate, Time-Series 

Classification 

Real 

120 

23 

2014 

 

Gas sensor array under dynamic gas mixtures

Multivariate, Time-Series 

Classification, Regression 

Real 

4178504 

19 

2015 

 

UJIIndoorLoc-Mag

Multivariate, Sequential, Time-Series 

Classification, Regression, Clustering 

Integer, Real 

40000 

13 

2015 

 

Educational Process Mining (EPM): A Learning Analytics Data Set

Multivariate, Sequential, Time-Series 

Classification, Regression, Clustering 

Integer 

230318 

13 

2015 

 

Heterogeneity Activity Recognition

Multivariate, Time-Series 

Classification, Clustering 

Real 

43930257 

16 

2015 

 

Air Quality

Multivariate, Time-Series 

Regression 

Real 

9358 

15 

2016 

 

Gas sensors for home activity monitoring

Multivariate, Time-Series 

Classification 

Real 

919438 

11 

2016 

 

Hybrid Indoor Positioning Dataset from WiFi RSSI, Bluetooth and magnetometer

Multivariate, Sequential, Time-Series 

Classification 

Real 

1540 

65 

2016 

 

Geo-Magnetic field and WLAN dataset for indoor localisation from wristband and smartphone

Multivariate, Sequential, Time-Series 

Classification, Regression, Clustering 

Integer, Real 

153540 

25 

2017 

 

Appliances energy prediction

Multivariate, Time-Series 

Regression 

Real 

19735 

29 

2017 

 

BLE RSSI Dataset for Indoor localization and Navigation

Multivariate, Sequential, Time-Series 

Classification, Clustering 

Integer 

6611 

15 

2018 

 

News Popularity in Multiple Social Media Platforms

Multivariate, Time-Series, Text 

Regression 

Integer, Real 

93239 

11 

2018 

 

WESAD (Wearable Stress and Affect Detection)

Multivariate, Time-Series 

Classification, Regression 

Real 

63000000 

12 

2018 

 

Behavior of the urban traffic of the city of Sao Paulo in Brazil

Multivariate, Time-Series 

Classification, Regression 

Integer, Real 

135 

18 

2018 

 

Gas sensor array temperature modulation

Multivariate, Time-Series 

Classification, Regression 

Real 

4095000 

20 

2019 

 

Pedestrian in Traffic Dataset

Multivariate, Sequential, Time-Series 

Classification, Regression, Causal-Discovery 

Real 

4760 

14 

2019 

 

PPG-DaLiA

Multivariate, Time-Series 

Regression 

Real 

8300000 

11 

2019 

 

CNNpred: CNN-based stock market prediction using a diverse set of variables

Sequential, Time-Series 

Classification, Regression 

Real 

1985 

84 

2019 

 

BitcoinHeistRansomwareAddressDataset

Multivariate, Time-Series 

Classification, Clustering 

Integer, Real 

2916697 

10 

2020 

 

AI4I 2020 Predictive Maintenance Dataset

Multivariate, Time-Series 

Classification, Regression, Causal-Discovery 

Real 

10000 

14 

2020 

 

Room Occupancy Estimation

Multivariate, Time-Series 

Classification 

Real 

10129 

16 

2021 

Supported By:

 In Collaboration With:

About  ||  Citation Policy  ||  Donation Policy  ||  Contact  ||  CML