Center for Machine Learning and Intelligent Systems
About  Citation Policy  Donate a Data Set  Contact


Repository Web            Google
View ALL Data Sets

× Check out the beta version of the new UCI Machine Learning Repository we are currently testing! Contact us if you have any issues, questions, or concerns. Click here to try out the new site.

Browse Through:

Default Task

Classification (88)
Regression (50)
Clustering (34)
Other (11)

Attribute Type

Categorical (0)
Numerical (111)
Mixed (7)

Data Type - Undo

Multivariate (480)
Univariate (30)
Sequential (59)
Time-Series (126)
Text (69)
Domain-Theory (23)
Other (21)

Area

Life Sciences (18)
Physical Sciences (10)
CS / Engineering (65)
Social Sciences (4)
Business (9)
Game (2)
Other (17)

# Attributes

Less than 10 (35)
10 to 100 (50)
Greater than 100 (30)

# Instances

Less than 100 (5)
100 to 1000 (24)
Greater than 1000 (90)

Format Type

Matrix (85)
Non-Matrix (41)

126 Data Sets

Table View  List View

Name

Data Types

Default Task

Attribute Types

# Instances

# Attributes

Year

 

Bach Chorales

Univariate, Time-Series 

 

Categorical, Integer 

100 

 

 

Diabetes

Multivariate, Time-Series 

 

Categorical, Integer 

 

20 

 

 

ICU

Multivariate, Time-Series 

 

Real 

 

 

 

 

Japanese Vowels

Multivariate, Time-Series 

Classification 

Real 

640 

12 

 

 

Pioneer-1 Mobile Robot Data

Multivariate, Time-Series 

 

Categorical, Real 

 

 

1999 

 

Pseudo Periodic Synthetic Time Series

Univariate, Time-Series 

 

 

100000 

 

1999 

 

Australian Sign Language signs

Multivariate, Time-Series 

Classification 

Categorical, Real 

6650 

15 

1999 

 

Robot Execution Failures

Multivariate, Time-Series 

Classification 

Integer 

463 

90 

1999 

 

Synthetic Control Chart Time Series

Time-Series 

Classification, Clustering 

Real 

600 

 

1999 

 

EEG Database

Multivariate, Time-Series 

 

Categorical, Integer, Real 

122 

1999 

 

Australian Sign Language signs (High Quality)

Multivariate, Time-Series 

Classification 

Real 

2565 

22 

2002 

 

CalIt2 Building People Counts

Multivariate, Time-Series 

 

Categorical, Integer 

10080 

2006 

 

Dodgers Loop Sensor

Multivariate, Time-Series 

 

Categorical, Integer 

50400 

2006 

 

Ozone Level Detection

Multivariate, Sequential, Time-Series 

Classification 

Real 

2536 

73 

2008 

 

Character Trajectories

Time-Series 

Classification, Clustering 

Real 

2858 

2008 

 

URL Reputation

Multivariate, Time-Series 

Classification 

Integer, Real 

2396130 

3231961 

2009 

 

Spoken Arabic Digit

Multivariate, Time-Series 

Classification 

Real 

8800 

13 

2010 

 

Localization Data for Person Activity

Univariate, Sequential, Time-Series 

Classification 

Real 

164860 

2010 

 

PEMS-SF

Multivariate, Time-Series 

Classification 

Real 

440 

138672 

2011 

 

EMG Physical Action Data Set

Time-Series 

Classification 

Real 

10000 

2011 

 

Vicon Physical Action Data Set

Time-Series 

Classification 

Real 

3000 

27 

2011 

 

Amazon Access Samples

Time-Series, Domain-Theory 

Regression, Clustering, Causal-Discovery 

 

30000 

20000 

2011 

 

OPPORTUNITY Activity Recognition

Multivariate, Time-Series 

Classification 

Real 

2551 

242 

2012 

 

PAMAP2 Physical Activity Monitoring

Multivariate, Time-Series 

Classification 

Real 

3850505 

52 

2012 

 

Individual household electric power consumption

Multivariate, Time-Series 

Regression, Clustering 

Real 

2075259 

2012 

 

Human Activity Recognition Using Smartphones

Multivariate, Time-Series 

Classification, Clustering 

 

10299 

561 

2012 

 

Daphnet Freezing of Gait

Multivariate, Time-Series 

Classification 

Real 

237 

2013 

 

Buzz in social media

Time-Series, Multivariate 

Regression, Classification 

Integer, Real 

140000 

77 

2013 

 

ISTANBUL STOCK EXCHANGE

Multivariate, Univariate, Time-Series 

Classification, Regression 

Real 

536 

2013 

 

Gas sensor arrays in open sampling settings

Multivariate, Time-Series 

Classification 

Real 

18000 

1950000 

2013 

 

EEG Eye State

Multivariate, Sequential, Time-Series 

Classification 

Integer, Real 

14980 

15 

2013 

 

Daily and Sports Activities

Multivariate, Time-Series 

Classification, Clustering 

Real 

9120 

5625 

2013 

 

Gas Sensor Array Drift Dataset at Different Concentrations

Multivariate, Time-Series 

Classification, Regression, Clustering, Causa 

Real 

13910 

129 

2013 

 

Activities of Daily Living (ADLs) Recognition Using Binary Sensors

Multivariate, Sequential, Time-Series 

Classification, Clustering 

 

2747 

 

2013 

 

Predict keywords activities in a online social media

Multivariate, Sequential, Time-Series 

 

Integer, Real 

51 

35 

2013 

 

SML2010

Multivariate, Sequential, Time-Series, Text 

Regression 

Real 

4137 

24 

2014 

 

EMG dataset in Lower Limb

Multivariate, Time-Series 

 

Real 

132 

2014 

 

Dataset for ADL Recognition with Wrist-worn Accelerometer

Multivariate, Time-Series 

Classification, Clustering 

 

 

2014 

 

User Identification From Walking Activity

Univariate, Sequential, Time-Series 

Classification, Clustering 

Real 

 

 

2014 

 

Activity Recognition from Single Chest-Mounted Accelerometer

Univariate, Sequential, Time-Series 

Classification, Clustering 

Real 

 

 

2014 

 

Gesture Phase Segmentation

Multivariate, Sequential, Time-Series 

Classification, Clustering 

Real 

9900 

50 

2014 

 

REALDISP Activity Recognition Dataset

Multivariate, Time-Series 

Classification 

Real 

1419 

120 

2014 

 

Gas sensor array under flow modulation

Multivariate, Time-Series 

Classification, Regression 

Real 

58 

120432 

2014 

 

Gas sensor array exposed to turbulent gas mixtures

Multivariate, Time-Series 

Classification, Regression 

Real 

180 

150000 

2014 

 

Dow Jones Index

Time-Series 

Classification, Clustering 

Integer, Real 

750 

16 

2014 

 

sEMG for Basic Hand movements

Time-Series 

Classification 

Real 

3000 

2500 

2014 

 

MHEALTH Dataset

Multivariate, Time-Series 

Classification 

Real 

120 

23 

2014 

 

ElectricityLoadDiagrams20112014

Time-Series 

Regression, Clustering 

Real 

370 

140256 

2015 

 

Gas sensor array under dynamic gas mixtures

Multivariate, Time-Series 

Classification, Regression 

Real 

4178504 

19 

2015 

 

Greenhouse Gas Observing Network

Multivariate, Time-Series 

Regression 

Real 

2921 

5232 

2015 

 

Machine Learning based ZZAlpha Ltd. Stock Recommendations 2012-2014

Sequential, Time-Series 

Classification 

Real 

314080 

2015 

 

Taxi Service Trajectory - Prediction Challenge, ECML PKDD 2015

Multivariate, Sequential, Time-Series, Domain-Theory 

Clustering, Causal-Discovery 

Real 

1710671 

2015 

 

Smartphone-Based Recognition of Human Activities and Postural Transitions

Multivariate, Time-Series 

Classification 

Real 

10929 

561 

2015 

 

UJIIndoorLoc-Mag

Multivariate, Sequential, Time-Series 

Classification, Regression, Clustering 

Integer, Real 

40000 

13 

2015 

 

Educational Process Mining (EPM): A Learning Analytics Data Set

Multivariate, Sequential, Time-Series 

Classification, Regression, Clustering 

Integer 

230318 

13 

2015 

 

Heterogeneity Activity Recognition

Multivariate, Time-Series 

Classification, Clustering 

Real 

43930257 

16 

2015 

 

Online Retail

Multivariate, Sequential, Time-Series 

Classification, Clustering 

Integer, Real 

541909 

2015 

 

Open University Learning Analytics dataset

Multivariate, Sequential, Time-Series 

Classification, Regression, Clustering 

Integer 

 

 

2015 

 

Indoor User Movement Prediction from RSS data

Multivariate, Sequential, Time-Series 

Classification 

Real 

13197 

2016 

 

Occupancy Detection

Multivariate, Time-Series 

Classification 

Real 

20560 

2016 

 

Smartphone Dataset for Human Activity Recognition (HAR) in Ambient Assisted Living (AAL)

Time-Series 

Classification 

Real 

5744 

561 

2016 

 

Air Quality

Multivariate, Time-Series 

Regression 

Real 

9358 

15 

2016 

 

Activity Recognition system based on Multisensor data fusion (AReM)

Multivariate, Sequential, Time-Series 

Classification 

Real 

42240 

2016 

 

Twin gas sensor arrays

Multivariate, Time-Series, Domain-Theory 

Classification, Regression 

Real 

640 

480000 

2016 

 

Gas sensors for home activity monitoring

Multivariate, Time-Series 

Classification 

Real 

919438 

11 

2016 

 

Air quality

Multivariate, Time-Series 

Regression 

Real 

9358 

15 

2016 

 

Hybrid Indoor Positioning Dataset from WiFi RSSI, Bluetooth and magnetometer

Multivariate, Sequential, Time-Series 

Classification 

Real 

1540 

65 

2016 

 

Geo-Magnetic field and WLAN dataset for indoor localisation from wristband and smartphone

Multivariate, Sequential, Time-Series 

Classification, Regression, Clustering 

Integer, Real 

153540 

25 

2017 

 

Beijing PM2.5 Data

Multivariate, Time-Series 

Regression 

Integer, Real 

43824 

13 

2017 

 

Appliances energy prediction

Multivariate, Time-Series 

Regression 

Real 

19735 

29 

2017 

 

FMA: A Dataset For Music Analysis

Multivariate, Time-Series 

Classification, Clustering 

Real 

106574 

518 

2017 

 

Epileptic Seizure Recognition

Multivariate, Time-Series 

Classification, Clustering 

Integer, Real 

11500 

179 

2017 

 

Data for Software Engineering Teamwork Assessment in Education Setting

Sequential, Time-Series 

Classification 

Integer, Real 

74 

102 

2017 

 

Sales_Transactions_Dataset_Weekly

Multivariate, Time-Series 

Clustering 

Integer, Real 

811 

53 

2017 

 

PM2.5 Data of Five Chinese Cities

Multivariate, Time-Series 

Regression 

Integer, Real 

52854 

86 

2017 

 

Daily Demand Forecasting Orders

Time-Series 

Regression 

Integer 

60 

13 

2017 

 

Dynamic Features of VirusShare Executables

Multivariate, Time-Series 

Classification, Regression 

Integer 

107888 

482 

2017 

 

BLE RSSI Dataset for Indoor localization and Navigation

Multivariate, Sequential, Time-Series 

Classification, Clustering 

Integer 

6611 

15 

2018 

 

News Popularity in Multiple Social Media Platforms

Multivariate, Time-Series, Text 

Regression 

Integer, Real 

93239 

11 

2018 

 

Absenteeism at work

Multivariate, Time-Series 

Classification, Clustering 

Integer, Real 

740 

21 

2018 

 

Condition monitoring of hydraulic systems

Multivariate, Time-Series 

Classification, Regression 

Real 

2205 

43680 

2018 

 

GNFUV Unmanned Surface Vehicles Sensor Data

Multivariate, Time-Series 

Regression 

Real 

1672 

2018 

 

Simulated Falls and Daily Living Activities Data Set

Time-Series 

Classification 

Integer 

3060 

138 

2018 

 

EEG Steady-State Visual Evoked Potential Signals

Multivariate, Time-Series 

Classification, Regression 

Integer 

9200 

16 

2018 

 

GNFUV Unmanned Surface Vehicles Sensor Data Set 2

Multivariate, Sequential, Time-Series 

Regression 

Real 

10190 

2018 

 

WESAD (Wearable Stress and Affect Detection)

Multivariate, Time-Series 

Classification, Regression 

Real 

63000000 

12 

2018 

 

BAUM-1

Time-Series 

Classification 

 

1184 

 

2018 

 

BAUM-2

Time-Series 

Classification 

 

1047 

 

2018 

 

Simulated data for survival modelling

Multivariate, Time-Series 

Regression 

Integer, Real 

120000 

25 

2018 

 

Behavior of the urban traffic of the city of Sao Paulo in Brazil

Multivariate, Time-Series 

Classification, Regression 

Integer, Real 

135 

18 

2018 

 

Parking Birmingham

Multivariate, Univariate, Sequential, Time-Series 

Classification, Regression, Clustering 

Real 

35717 

2019 

 

EMG data for gestures

Time-Series 

Classification 

Real 

30000 

2019 

 

Gas sensor array temperature modulation

Multivariate, Time-Series 

Classification, Regression 

Real 

4095000 

20 

2019 

 

Metro Interstate Traffic Volume

Multivariate, Sequential, Time-Series 

Regression 

Integer, Real 

48204 

2019 

 

BLE RSSI dataset for Indoor localization

Sequential, Time-Series 

Classification 

Integer 

23570 

2019 

 

Detect Malware Types

Multivariate, Time-Series, Text 

Classification 

 

7107 

280 

2019 

 

Basketball dataset

Time-Series 

Classification 

Integer 

10000 

2019 

 

Pedestrian in Traffic Dataset

Multivariate, Sequential, Time-Series 

Classification, Regression, Causal-Discovery 

Real 

4760 

14 

2019 

 

PPG-DaLiA

Multivariate, Time-Series 

Regression 

Real 

8300000 

11 

2019 

 

3W dataset

Multivariate, Time-Series 

Classification, Clustering 

Integer, Real 

1984 

2019 

 

MEx

Time-Series 

Classification, Clustering 

Real 

6262 

710 

2019 

 

Beijing Multi-Site Air-Quality Data

Multivariate, Time-Series 

Regression 

Integer, Real 

420768 

18 

2019 

 

Human Activity Recognition from Continuous Ambient Sensor Data

Multivariate, Sequential, Time-Series 

Classification 

Integer, Real 

13956534 

37 

2019 

 

Online Retail II

Multivariate, Sequential, Time-Series, Text 

Classification, Regression, Clustering 

Integer, Real 

1067371 

2019 

 

WISDM Smartphone and Smartwatch Activity and Biometrics Dataset

Multivariate, Time-Series 

Classification 

Real 

15630426 

2019 

 

Kitsune Network Attack Dataset

Multivariate, Sequential, Time-Series 

Classification, Clustering, Causal-Discovery 

Real 

27170754 

115 

2019 

 

Breath Metabolomics

Multivariate, Time-Series 

Classification, Clustering 

Real 

104 

1656 

2019 

 

Horton General Hospital

Multivariate, Time-Series 

Causal-Discovery 

Integer 

139 

2019 

 

Real-time Election Results: Portugal 2019

Multivariate, Time-Series, Text 

Regression 

Integer, Real 

21643 

29 

2019 

 

CNNpred: CNN-based stock market prediction using a diverse set of variables

Sequential, Time-Series 

Classification, Regression 

Real 

1985 

84 

2019 

 

Bar Crawl: Detecting Heavy Drinking

Multivariate, Time-Series 

Classification, Regression 

Real 

14057567 

2020 

 

Bar Crawl: Detecting Heavy Drinking

Multivariate, Time-Series 

Classification, Regression 

Real 

14057567 

2020 

 

selfBACK

Time-Series 

Classification, Clustering 

Real 

26136 

2020 

 

Crop mapping using fused optical-radar data set

Multivariate, Time-Series 

Classification 

Real 

325834 

175 

2020 

 

BitcoinHeistRansomwareAddressDataset

Multivariate, Time-Series 

Classification, Clustering 

Integer, Real 

2916697 

10 

2020 

 

Productivity Prediction of Garment Employees

Multivariate, Time-Series 

Classification, Regression 

Integer, Real 

1197 

15 

2020 

 

Rocket League Skillshots Data Set

Multivariate, Time-Series 

Classification 

Real 

298 

 

2020 

 

AI4I 2020 Predictive Maintenance Dataset

Multivariate, Time-Series 

Classification, Regression, Causal-Discovery 

Real 

10000 

14 

2020 

 

Intelligent Media Accelerometer and Gyroscope (IM-AccGyro) Dataset

Time-Series 

Classification 

Real 

800 

2020 

 

Image Recognition Task Execution Times in Mobile Edge Computing

Univariate, Sequential, Time-Series 

Regression 

Real 

4000 

2020 

 

Room Occupancy Estimation

Multivariate, Time-Series 

Classification 

Real 

10129 

16 

2021 

 

Hungarian Chickenpox Cases

Time-Series 

Regression 

Real 

521 

20 

2021 

 

Power consumption of Tetouan city

Multivariate, Time-Series 

Regression 

Integer, Real 

52417 

2021 

 

Wikipedia Math Essentials

Time-Series 

Regression 

Real 

731 

1068 

2021 

 

Wikipedia Math Essentials

Time-Series 

Regression 

Real 

731 

1068 

2021 

 

Pedal Me Bicycle Deliveries

Time-Series 

Regression 

Real 

36 

15 

2021 

Supported By:

 In Collaboration With:

About  ||  Citation Policy  ||  Donation Policy  ||  Contact  ||  CML