Center for Machine Learning and Intelligent Systems
About  Citation Policy  Donate a Data Set  Contact


Repository Web            Google
View ALL Data Sets

× Check out the beta version of the new UCI Machine Learning Repository we are currently testing! Contact us if you have any issues, questions, or concerns. Click here to try out the new site.

Browse Through:

Default Task

Classification (66)
Regression (36)
Clustering (24)
Other (3)

Attribute Type

Categorical (0)
Numerical (77)
Mixed (3)

Data Type - Undo

Multivariate (251)
Univariate (12)
Sequential (47)
Time-Series (87)
Text (45)
Domain-Theory (8)
Other (5)

Area

Life Sciences (11)
Physical Sciences (8)
CS / Engineering (50)
Social Sciences (1)
Business (5)
Game (1)
Other (11)

# Attributes

Less than 10 (25)
10 to 100 (37)
Greater than 100 (21)

# Instances - Undo

Less than 100 (4)
100 to 1000 (23)
Greater than 1000 (87)

Format Type

Matrix (58)
Non-Matrix (29)

87 Data Sets

Table View  List View

Name

Data Types

Default Task

Attribute Types

# Instances

# Attributes

Year

 

Kitsune Network Attack Dataset

Multivariate, Sequential, Time-Series 

Classification, Clustering, Causal-Discovery 

Real 

27170754 

115 

2019 

 

EEG Eye State

Multivariate, Sequential, Time-Series 

Classification 

Integer, Real 

14980 

15 

2013 

 

Activities of Daily Living (ADLs) Recognition Using Binary Sensors

Multivariate, Sequential, Time-Series 

Classification, Clustering 

 

2747 

 

2013 

 

Pedestrian in Traffic Dataset

Multivariate, Sequential, Time-Series 

Classification, Regression, Causal-Discovery 

Real 

4760 

14 

2019 

 

Gesture Phase Segmentation

Multivariate, Sequential, Time-Series 

Classification, Clustering 

Real 

9900 

50 

2014 

 

UJIIndoorLoc-Mag

Multivariate, Sequential, Time-Series 

Classification, Regression, Clustering 

Integer, Real 

40000 

13 

2015 

 

Educational Process Mining (EPM): A Learning Analytics Data Set

Multivariate, Sequential, Time-Series 

Classification, Regression, Clustering 

Integer 

230318 

13 

2015 

 

Indoor User Movement Prediction from RSS data

Multivariate, Sequential, Time-Series 

Classification 

Real 

13197 

2016 

 

Online Retail

Multivariate, Sequential, Time-Series 

Classification, Clustering 

Integer, Real 

541909 

2015 

 

Activity Recognition system based on Multisensor data fusion (AReM)

Multivariate, Sequential, Time-Series 

Classification 

Real 

42240 

2016 

 

Geo-Magnetic field and WLAN dataset for indoor localisation from wristband and smartphone

Multivariate, Sequential, Time-Series 

Classification, Regression, Clustering 

Integer, Real 

153540 

25 

2017 

 

Hybrid Indoor Positioning Dataset from WiFi RSSI, Bluetooth and magnetometer

Multivariate, Sequential, Time-Series 

Classification 

Real 

1540 

65 

2016 

 

Ozone Level Detection

Multivariate, Sequential, Time-Series 

Classification 

Real 

2536 

73 

2008 

 

BLE RSSI Dataset for Indoor localization and Navigation

Multivariate, Sequential, Time-Series 

Classification, Clustering 

Integer 

6611 

15 

2018 

 

GNFUV Unmanned Surface Vehicles Sensor Data Set 2

Multivariate, Sequential, Time-Series 

Regression 

Real 

10190 

2018 

 

Metro Interstate Traffic Volume

Multivariate, Sequential, Time-Series 

Regression 

Integer, Real 

48204 

2019 

 

Human Activity Recognition from Continuous Ambient Sensor Data

Multivariate, Sequential, Time-Series 

Classification 

Integer, Real 

13956534 

37 

2019 

 

Taxi Service Trajectory - Prediction Challenge, ECML PKDD 2015

Multivariate, Sequential, Time-Series, Domain-Theory 

Clustering, Causal-Discovery 

Real 

1710671 

2015 

 

SML2010

Multivariate, Sequential, Time-Series, Text 

Regression 

Real 

4137 

24 

2014 

 

Online Retail II

Multivariate, Sequential, Time-Series, Text 

Classification, Regression, Clustering 

Integer, Real 

1067371 

2019 

 

Daily and Sports Activities

Multivariate, Time-Series 

Classification, Clustering 

Real 

9120 

5625 

2013 

 

Bar Crawl: Detecting Heavy Drinking

Multivariate, Time-Series 

Classification, Regression 

Real 

14057567 

2020 

 

Crop mapping using fused optical-radar data set

Multivariate, Time-Series 

Classification 

Real 

325834 

175 

2020 

 

Gas Sensor Array Drift Dataset at Different Concentrations

Multivariate, Time-Series 

Classification, Regression, Clustering, Causa 

Real 

13910 

129 

2013 

 

BitcoinHeistRansomwareAddressDataset

Multivariate, Time-Series 

Classification, Clustering 

Integer, Real 

2916697 

10 

2020 

 

3W dataset

Multivariate, Time-Series 

Classification, Clustering 

Integer, Real 

1984 

2019 

 

Bar Crawl: Detecting Heavy Drinking

Multivariate, Time-Series 

Classification, Regression 

Real 

14057567 

2020 

 

REALDISP Activity Recognition Dataset

Multivariate, Time-Series 

Classification 

Real 

1419 

120 

2014 

 

Gas sensor array under dynamic gas mixtures

Multivariate, Time-Series 

Classification, Regression 

Real 

4178504 

19 

2015 

 

Simulated data for survival modelling

Multivariate, Time-Series 

Regression 

Integer, Real 

120000 

25 

2018 

 

Greenhouse Gas Observing Network

Multivariate, Time-Series 

Regression 

Real 

2921 

5232 

2015 

 

Smartphone-Based Recognition of Human Activities and Postural Transitions

Multivariate, Time-Series 

Classification 

Real 

10929 

561 

2015 

 

Productivity Prediction of Garment Employees

Multivariate, Time-Series 

Classification, Regression 

Integer, Real 

1197 

15 

2020 

 

Heterogeneity Activity Recognition

Multivariate, Time-Series 

Classification, Clustering 

Real 

43930257 

16 

2015 

 

AI4I 2020 Predictive Maintenance Dataset

Multivariate, Time-Series 

Classification, Regression, Causal-Discovery 

Real 

10000 

14 

2020 

 

Occupancy Detection

Multivariate, Time-Series 

Classification 

Real 

20560 

2016 

 

Air Quality

Multivariate, Time-Series 

Regression 

Real 

9358 

15 

2016 

 

Gas sensors for home activity monitoring

Multivariate, Time-Series 

Classification 

Real 

919438 

11 

2016 

 

Australian Sign Language signs

Multivariate, Time-Series 

Classification 

Categorical, Real 

6650 

15 

1999 

 

Australian Sign Language signs (High Quality)

Multivariate, Time-Series 

Classification 

Real 

2565 

22 

2002 

 

Appliances energy prediction

Multivariate, Time-Series 

Regression 

Real 

19735 

29 

2017 

 

Beijing PM2.5 Data

Multivariate, Time-Series 

Regression 

Integer, Real 

43824 

13 

2017 

 

FMA: A Dataset For Music Analysis

Multivariate, Time-Series 

Classification, Clustering 

Real 

106574 

518 

2017 

 

Air quality

Multivariate, Time-Series 

Regression 

Real 

9358 

15 

2016 

 

Epileptic Seizure Recognition

Multivariate, Time-Series 

Classification, Clustering 

Integer, Real 

11500 

179 

2017 

 

PM2.5 Data of Five Chinese Cities

Multivariate, Time-Series 

Regression 

Integer, Real 

52854 

86 

2017 

 

CalIt2 Building People Counts

Multivariate, Time-Series 

 

Categorical, Integer 

10080 

2006 

 

Dodgers Loop Sensor

Multivariate, Time-Series 

 

Categorical, Integer 

50400 

2006 

 

Dynamic Features of VirusShare Executables

Multivariate, Time-Series 

Classification, Regression 

Integer 

107888 

482 

2017 

 

URL Reputation

Multivariate, Time-Series 

Classification 

Integer, Real 

2396130 

3231961 

2009 

 

Condition monitoring of hydraulic systems

Multivariate, Time-Series 

Classification, Regression 

Real 

2205 

43680 

2018 

 

Spoken Arabic Digit

Multivariate, Time-Series 

Classification 

Real 

8800 

13 

2010 

 

GNFUV Unmanned Surface Vehicles Sensor Data

Multivariate, Time-Series 

Regression 

Real 

1672 

2018 

 

EEG Steady-State Visual Evoked Potential Signals

Multivariate, Time-Series 

Classification, Regression 

Integer 

9200 

16 

2018 

 

WESAD (Wearable Stress and Affect Detection)

Multivariate, Time-Series 

Classification, Regression 

Real 

63000000 

12 

2018 

 

OPPORTUNITY Activity Recognition

Multivariate, Time-Series 

Classification 

Real 

2551 

242 

2012 

 

PAMAP2 Physical Activity Monitoring

Multivariate, Time-Series 

Classification 

Real 

3850505 

52 

2012 

 

Gas sensor array temperature modulation

Multivariate, Time-Series 

Classification, Regression 

Real 

4095000 

20 

2019 

 

Individual household electric power consumption

Multivariate, Time-Series 

Regression, Clustering 

Real 

2075259 

2012 

 

PPG-DaLiA

Multivariate, Time-Series 

Regression 

Real 

8300000 

11 

2019 

 

Human Activity Recognition Using Smartphones

Multivariate, Time-Series 

Classification, Clustering 

 

10299 

561 

2012 

 

Beijing Multi-Site Air-Quality Data

Multivariate, Time-Series 

Regression 

Integer, Real 

420768 

18 

2019 

 

Gas sensor arrays in open sampling settings

Multivariate, Time-Series 

Classification 

Real 

18000 

1950000 

2013 

 

WISDM Smartphone and Smartwatch Activity and Biometrics Dataset

Multivariate, Time-Series 

Classification 

Real 

15630426 

2019 

 

Real-time Election Results: Portugal 2019

Multivariate, Time-Series, Text 

Regression 

Integer, Real 

21643 

29 

2019 

 

Detect Malware Types

Multivariate, Time-Series, Text 

Classification 

 

7107 

280 

2019 

 

News Popularity in Multiple Social Media Platforms

Multivariate, Time-Series, Text 

Regression 

Integer, Real 

93239 

11 

2018 

 

Parking Birmingham

Multivariate, Univariate, Sequential, Time-Series 

Classification, Regression, Clustering 

Real 

35717 

2019 

 

CNNpred: CNN-based stock market prediction using a diverse set of variables

Sequential, Time-Series 

Classification, Regression 

Real 

1985 

84 

2019 

 

BLE RSSI dataset for Indoor localization

Sequential, Time-Series 

Classification 

Integer 

23570 

2019 

 

Machine Learning based ZZAlpha Ltd. Stock Recommendations 2012-2014

Sequential, Time-Series 

Classification 

Real 

314080 

2015 

 

selfBACK

Time-Series 

Classification, Clustering 

Real 

26136 

2020 

 

sEMG for Basic Hand movements

Time-Series 

Classification 

Real 

3000 

2500 

2014 

 

Basketball dataset

Time-Series 

Classification 

Integer 

10000 

2019 

 

Smartphone Dataset for Human Activity Recognition (HAR) in Ambient Assisted Living (AAL)

Time-Series 

Classification 

Real 

5744 

561 

2016 

 

Character Trajectories

Time-Series 

Classification, Clustering 

Real 

2858 

2008 

 

Simulated Falls and Daily Living Activities Data Set

Time-Series 

Classification 

Integer 

3060 

138 

2018 

 

EMG Physical Action Data Set

Time-Series 

Classification 

Real 

10000 

2011 

 

Vicon Physical Action Data Set

Time-Series 

Classification 

Real 

3000 

27 

2011 

 

BAUM-1

Time-Series 

Classification 

 

1184 

 

2018 

 

BAUM-2

Time-Series 

Classification 

 

1047 

 

2018 

 

EMG data for gestures

Time-Series 

Classification 

Real 

30000 

2019 

 

MEx

Time-Series 

Classification, Clustering 

Real 

6262 

710 

2019 

 

Amazon Access Samples

Time-Series, Domain-Theory 

Regression, Clustering, Causal-Discovery 

 

30000 

20000 

2011 

 

Buzz in social media

Time-Series, Multivariate 

Regression, Classification 

Integer, Real 

140000 

77 

2013 

 

Localization Data for Person Activity

Univariate, Sequential, Time-Series 

Classification 

Real 

164860 

2010 

 

Pseudo Periodic Synthetic Time Series

Univariate, Time-Series 

 

 

100000 

 

1999 

Supported By:

 In Collaboration With:

About  ||  Citation Policy  ||  Donation Policy  ||  Contact  ||  CML