Center for Machine Learning and Intelligent Systems
About  Citation Policy  Donate a Data Set  Contact


Repository Web            Google
View ALL Data Sets

× Check out the beta version of the new UCI Machine Learning Repository we are currently testing! Contact us if you have any issues, questions, or concerns. Click here to try out the new site.

Browse Through:

Default Task - Undo

Classification (324)
Regression (139)
Clustering (94)
Other (13)

Attribute Type - Undo

Categorical (1)
Numerical (139)
Mixed (5)

Data Type

Multivariate (121)
Univariate (9)
Sequential (17)
Time-Series (49)
Text (11)
Domain-Theory (3)
Other (0)

Area

Life Sciences (23)
Physical Sciences (15)
CS / Engineering (65)
Social Sciences (9)
Business (17)
Game (1)
Other (8)

# Attributes

Less than 10 (41)
10 to 100 (77)
Greater than 100 (18)

# Instances

Less than 100 (7)
100 to 1000 (42)
Greater than 1000 (88)

Format Type

Matrix (110)
Non-Matrix (29)

139 Data Sets

Table View  List View

Name

Data Types

Default Task

Attribute Types

# Instances

# Attributes

Year

 

IIWA14-R820-Gazebo-Dataset-10Trajectories

 

Regression 

Integer 

 

 

2020 

 

Open University Learning Analytics dataset

Multivariate, Sequential, Time-Series 

Classification, Regression, Clustering 

Integer 

 

 

2015 

 

Container Crane Controller Data Set

Univariate, Domain-Theory 

Classification, Regression 

Real 

15 

2018 

 

Challenger USA Space Shuttle O-Ring

Multivariate 

Regression 

Integer 

23 

1993 

 

Pedal Me Bicycle Deliveries

Time-Series 

Regression 

Real 

36 

15 

2021 

 

Improved Spiral Test Using Digitized Graphics Tablet for Monitoring Parkinson’s Disease

Multivariate 

Classification, Regression, Clustering 

Real 

40 

2016 

 

Gas sensor array under flow modulation

Multivariate, Time-Series 

Classification, Regression 

Real 

58 

120432 

2014 

 

Daily Demand Forecasting Orders

Time-Series 

Regression 

Integer 

60 

13 

2017 

 

Parkinson Disease Spiral Drawings Using Digitized Graphics Tablet

Multivariate 

Classification, Regression, Clustering 

Integer 

77 

2017 

 

Fertility

Multivariate 

Classification, Regression 

Real 

100 

10 

2013 

 

Concrete Slump Test

Multivariate 

Regression 

Real 

103 

10 

2009 

 

Average Localization Error (ALE) in sensor node localization process in WSNs

Multivariate 

Regression 

Real 

107 

2021 

 

Alcohol QCM Sensor Dataset

Multivariate 

Classification, Regression, Clustering 

Real 

125 

2019 

 

Tennis Major Tournament Match Statistics

Multivariate 

Classification, Regression, Clustering 

Integer, Real 

127 

42 

2014 

 

Early biomarkers of Parkinsons disease based on natural connected speech

Multivariate 

Classification, Regression 

Integer, Real 

130 

65 

2017 

 

Behavior of the urban traffic of the city of Sao Paulo in Brazil

Multivariate, Time-Series 

Classification, Regression 

Integer, Real 

135 

18 

2018 

 

GPS Trajectories

Multivariate 

Classification, Regression 

Real 

163 

15 

2016 

 

Gas sensor array exposed to turbulent gas mixtures

Multivariate, Time-Series 

Classification, Regression 

Real 

180 

150000 

2014 

 

Bone marrow transplant: children

Multivariate 

Classification, Regression 

Integer, Real 

187 

39 

2020 

 

Breast Cancer Wisconsin (Prognostic)

Multivariate 

Classification, Regression 

Real 

198 

34 

1995 

 

Risk Factor prediction of Chronic Kidney Disease

Multivariate 

Classification, Regression 

Real 

202 

29 

2021 

 

Computer Hardware

Multivariate 

Regression 

Integer 

209 

1987 

 

NoisyOffice

Multivariate 

Classification, Regression 

Real 

216 

216 

2015 

 

CSM (Conventional and Social Media Movies) Dataset 2014 and 2015

Multivariate 

Classification, Regression 

Integer 

217 

12 

2017 

 

Algerian Forest Fires Dataset

Multivariate 

Classification, Regression 

Real 

244 

12 

2019 

 

Heart failure clinical records

Multivariate 

Classification, Regression, Clustering 

Integer, Real 

299 

13 

2020 

 

Yacht Hydrodynamics

Multivariate 

Regression 

Real 

308 

2013 

 

Stock portfolio performance

Multivariate 

Regression 

Real 

315 

12 

2016 

 

ElectricityLoadDiagrams20112014

Time-Series 

Regression, Clustering 

Real 

370 

140256 

2015 

 

Residential Building Data Set

Multivariate 

Regression 

Real 

372 

105 

2018 

 

Paper Reviews

Text 

Classification, Regression 

Integer 

405 

10 

2017 

 

Real estate valuation data set

Multivariate 

Regression 

Integer, Real 

414 

2018 

 

Facebook metrics

Multivariate 

Regression 

Integer 

500 

19 

2016 

 

Las Vegas Strip

 

Classification, Regression 

Integer 

504 

20 

2017 

 

Forest Fires

Multivariate 

Regression 

Real 

517 

13 

2008 

 

Hungarian Chickenpox Cases

Time-Series 

Regression 

Real 

521 

20 

2021 

 

ISTANBUL STOCK EXCHANGE

Multivariate, Univariate, Time-Series 

Classification, Regression 

Real 

536 

2013 

 

QSAR aquatic toxicity

Multivariate 

Regression 

Real 

546 

2019 

 

Synchronous Machine Data Set

Multivariate 

Regression 

Real 

557 

2021 

 

Synchronous Machine Data Set

Multivariate 

Regression 

Real 

557 

2021 

 

DrivFace

Multivariate 

Classification, Regression, Clustering 

Real 

606 

6400 

2016 

 

Twin gas sensor arrays

Multivariate, Time-Series, Domain-Theory 

Classification, Regression 

Real 

640 

480000 

2016 

 

Optical Interconnection Network

Multivariate 

Classification, Regression 

Integer, Real 

640 

10 

2018 

 

Student Performance

Multivariate 

Classification, Regression 

Integer 

649 

33 

2014 

 

Water Quality Prediction

Multivariate 

Regression 

Real 

705 

11 

2020 

 

Wikipedia Math Essentials

Time-Series 

Regression 

Real 

731 

1068 

2021 

 

Wikipedia Math Essentials

Time-Series 

Regression 

Real 

731 

1068 

2021 

 

Energy efficiency

Multivariate 

Classification, Regression 

Integer, Real 

768 

2012 

 

QSAR fish toxicity

Multivariate 

Regression 

Real 

908 

2019 

 

South German Credit

Multivariate 

Classification, Regression, Clustering 

Integer, Real 

1000 

21 

2019 

 

South German Credit (UPDATE)

Multivariate 

Classification, Regression, Clustering 

Integer, Real 

1000 

21 

2020 

 

Concrete Compressive Strength

Multivariate 

Regression 

Real 

1030 

2007 

 

Parkinson Speech Dataset with Multiple Types of Sound Recordings

Multivariate 

Classification, Regression 

Integer, Real 

1040 

26 

2014 

 

QSAR fish bioconcentration factor (BCF)

Multivariate 

Regression 

Integer, Real 

1056 

2019 

 

Geographical Original of Music

Multivariate 

Classification, Regression 

Real 

1059 

68 

2014 

 

Productivity Prediction of Garment Employees

Multivariate, Time-Series 

Classification, Regression 

Integer, Real 

1197 

15 

2020 

 

Airfoil Self-Noise

Multivariate 

Regression 

Real 

1503 

2014 

 

GNFUV Unmanned Surface Vehicles Sensor Data

Multivariate, Time-Series 

Regression 

Real 

1672 

2018 

 

CNNpred: CNN-based stock market prediction using a diverse set of variables

Sequential, Time-Series 

Classification, Regression 

Real 

1985 

84 

2019 

 

Communities and Crime

Multivariate 

Regression 

Real 

1994 

128 

2009 

 

Traffic Flow Forecasting

Multivariate 

Regression 

Real 

2101 

47 

2020 

 

Estimation of obesity levels based on eating habits and physical condition

Multivariate 

Classification, Regression, Clustering 

Integer 

2111 

17 

2019 

 

Condition monitoring of hydraulic systems

Multivariate, Time-Series 

Classification, Regression 

Real 

2205 

43680 

2018 

 

Communities and Crime Unnormalized

Multivariate 

Regression 

Real 

2215 

147 

2011 

 

Greenhouse Gas Observing Network

Multivariate, Time-Series 

Regression 

Real 

2921 

5232 

2015 

 

Iranian Churn Dataset

Multivariate 

Classification, Regression 

Integer 

3150 

13 

2020 

 

Iranian Churn Dataset

Multivariate 

Classification, Regression 

Integer 

3150 

13 

2020 

 

SkillCraft1 Master Table Dataset

Multivariate 

Regression 

Integer, Real 

3395 

20 

2013 

 

Cargo 2000 Freight Tracking and Tracing

Multivariate, Sequential 

Classification, Regression 

Integer 

3942 

98 

2016 

 

Image Recognition Task Execution Times in Mobile Edge Computing

Univariate, Sequential, Time-Series 

Regression 

Real 

4000 

2020 

 

Image Recognition Task Execution Times in Mobile Edge Computing

Univariate 

Regression 

Real 

4000 

2021 

 

KDC-4007 dataset Collection

Multivariate, Text 

Classification, Regression 

Integer 

4007 

 

2017 

 

SML2010

Multivariate, Sequential, Time-Series, Text 

Regression 

Real 

4137 

24 

2014 

 

Drug Review Dataset (Druglib.com)

Multivariate, Text 

Classification, Regression, Clustering 

Integer 

4143 

2018 

 

Pedestrian in Traffic Dataset

Multivariate, Sequential, Time-Series 

Classification, Regression, Causal-Discovery 

Real 

4760 

14 

2019 

 

Wine Quality

Multivariate 

Classification, Regression 

Real 

4898 

12 

2009 

 

Parkinsons Telemonitoring

Multivariate 

Regression 

Integer, Real 

5875 

26 

2009 

 

Bias correction of numerical prediction model temperature forecast

Multivariate 

Regression 

Real 

7750 

25 

2020 

 

Seoul Bike Sharing Demand

Multivariate 

Regression 

Integer, Real 

8760 

14 

2020 

 

EEG Steady-State Visual Evoked Potential Signals

Multivariate, Time-Series 

Classification, Regression 

Integer 

9200 

16 

2018 

 

Air Quality

Multivariate, Time-Series 

Regression 

Real 

9358 

15 

2016 

 

Air quality

Multivariate, Time-Series 

Regression 

Real 

9358 

15 

2016 

 

Combined Cycle Power Plant

Multivariate 

Regression 

Real 

9568 

2014 

 

AI4I 2020 Predictive Maintenance Dataset

Multivariate, Time-Series 

Classification, Regression, Causal-Discovery 

Real 

10000 

14 

2020 

 

Electrical Grid Stability Simulated Data

Multivariate 

Classification, Regression 

Real 

10000 

14 

2018 

 

GNFUV Unmanned Surface Vehicles Sensor Data Set 2

Multivariate, Sequential, Time-Series 

Regression 

Real 

10190 

2018 

 

Carbon Nanotubes

Univariate 

Regression 

Real 

10721 

2018 

 

Condition Based Maintenance of Naval Propulsion Plants

Multivariate 

Regression 

Real 

11934 

16 

2014 

 

Cuff-Less Blood Pressure Estimation

Multivariate 

Classification, Regression 

Real 

12000 

2015 

 

Gas Sensor Array Drift Dataset at Different Concentrations

Multivariate, Time-Series 

Classification, Regression, Clustering, Causa 

Real 

13910 

129 

2013 

 

Bike Sharing Dataset

Univariate 

Regression 

Integer, Real 

17389 

16 

2013 

 

Appliances energy prediction

Multivariate, Time-Series 

Regression 

Real 

19735 

29 

2017 

 

UJIIndoorLoc

Multivariate 

Classification, Regression 

Integer, Real 

21048 

529 

2014 

 

Superconductivty Data

Multivariate 

Regression 

Real 

21263 

81 

2018 

 

Real-time Election Results: Portugal 2019

Multivariate, Time-Series, Text 

Regression 

Integer, Real 

21643 

29 

2019 

 

Demand Forecasting for a store

Multivariate 

Regression 

Integer 

28764 

2019 

 

Steel Industry Energy Consumption Dataset

Multivariate 

Regression 

Integer 

35040 

11 

2020 

 

Parking Birmingham

Multivariate, Univariate, Sequential, Time-Series 

Classification, Regression, Clustering 

Real 

35717 

2019 

 

Gas Turbine CO and NOx Emission Data Set

Multivariate 

Regression, Clustering 

Real 

36733 

11 

2019 

 

Online News Popularity

Multivariate 

Classification, Regression 

Integer, Real 

39797 

61 

2015 

 

UJIIndoorLoc-Mag

Multivariate, Sequential, Time-Series 

Classification, Regression, Clustering 

Integer, Real 

40000 

13 

2015 

 

Facebook Comment Volume Dataset

Multivariate 

Regression 

Integer, Real 

40949 

54 

2016 

 

Beijing PM2.5 Data

Multivariate, Time-Series 

Regression 

Integer, Real 

43824 

13 

2017 

 

Physicochemical Properties of Protein Tertiary Structure

Multivariate 

Regression 

Real 

45730 

2013 

 

Tamilnadu Electricity Board Hourly Readings

Multivariate 

Classification, Regression, Clustering 

Real 

45781 

2013 

 

Metro Interstate Traffic Volume

Multivariate, Sequential, Time-Series 

Regression 

Integer, Real 

48204 

2019 

 

Power consumption of Tetouan city

Multivariate, Time-Series 

Regression 

Integer, Real 

52417 

2021 

 

PM2.5 Data of Five Chinese Cities

Multivariate, Time-Series 

Regression 

Integer, Real 

52854 

86 

2017 

 

KEGG Metabolic Relation Network (Directed)

Multivariate, Univariate, Text 

Classification, Regression, Clustering 

Integer, Real 

53414 

24 

2011 

 

Relative location of CT slices on axial axis

Domain-Theory 

Regression 

Real 

53500 

386 

2011 

 

BlogFeedback

Multivariate 

Regression 

Integer, Real 

60021 

281 

2014 

 

KEGG Metabolic Reaction Network (Undirected)

Multivariate, Univariate, Text 

Classification, Regression, Clustering 

Integer, Real 

65554 

29 

2011 

 

News Popularity in Multiple Social Media Platforms

Multivariate, Time-Series, Text 

Regression 

Integer, Real 

93239 

11 

2018 

 

Dynamic Features of VirusShare Executables

Multivariate, Time-Series 

Classification, Regression 

Integer 

107888 

482 

2017 

 

Simulated data for survival modelling

Multivariate, Time-Series 

Regression 

Integer, Real 

120000 

25 

2018 

 

Buzz in social media

Time-Series, Multivariate 

Regression, Classification 

Integer, Real 

140000 

77 

2013 

 

Incident management process enriched event log

Multivariate, Sequential 

Regression, Clustering 

Integer 

141712 

36 

2019 

 

Accelerometer

Multivariate 

Classification, Regression 

Integer, Real 

153000 

2021 

 

Geo-Magnetic field and WLAN dataset for indoor localisation from wristband and smartphone

Multivariate, Sequential, Time-Series 

Classification, Regression, Clustering 

Integer, Real 

153540 

25 

2017 

 

9mers from cullpdb

Sequential 

Classification, Regression 

Real 

158716 

2021 

 

clickstream data for online shopping

Multivariate, Sequential 

Classification, Regression, Clustering 

Integer, Real 

165474 

14 

2019 

 

Online Video Characteristics and Transcoding Time Dataset

Multivariate 

Regression 

Integer, Real 

168286 

11 

2015 

 

Drug Review Dataset (Drugs.com)

Multivariate, Text 

Classification, Regression, Clustering 

Integer 

215063 

2018 

 

Educational Process Mining (EPM): A Learning Analytics Data Set

Multivariate, Sequential, Time-Series 

Classification, Regression, Clustering 

Integer 

230318 

13 

2015 

 

SGEMM GPU kernel performance

Multivariate 

Regression 

Integer 

241600 

18 

2018 

 

Query Analytics Workloads Dataset

Multivariate 

Regression, Clustering 

Real 

260000 

2019 

 

Wave Energy Converters

Multivariate 

Regression 

Real 

288000 

49 

2019 

 

Wave Energy Converters

Multivariate 

Regression 

Real 

288000 

49 

2019 

 

Beijing Multi-Site Air-Quality Data

Multivariate, Time-Series 

Regression 

Integer, Real 

420768 

18 

2019 

 

3D Road Network (North Jutland, Denmark)

Sequential, Text 

Regression, Clustering 

Real 

434874 

2013 

 

YearPredictionMSD

Multivariate 

Regression 

Real 

515345 

90 

2011 

 

Online Retail II

Multivariate, Sequential, Time-Series, Text 

Classification, Regression, Clustering 

Integer, Real 

1067371 

2019 

 

Individual household electric power consumption

Multivariate, Time-Series 

Regression, Clustering 

Real 

2075259 

2012 

 

Gas sensor array temperature modulation

Multivariate, Time-Series 

Classification, Regression 

Real 

4095000 

20 

2019 

 

Gas sensor array under dynamic gas mixtures

Multivariate, Time-Series 

Classification, Regression 

Real 

4178504 

19 

2015 

 

PPG-DaLiA

Multivariate, Time-Series 

Regression 

Real 

8300000 

11 

2019 

 

Bar Crawl: Detecting Heavy Drinking

Multivariate, Time-Series 

Classification, Regression 

Real 

14057567 

2020 

 

Bar Crawl: Detecting Heavy Drinking

Multivariate, Time-Series 

Classification, Regression 

Real 

14057567 

2020 

 

WESAD (Wearable Stress and Affect Detection)

Multivariate, Time-Series 

Classification, Regression 

Real 

63000000 

12 

2018 

Supported By:

 In Collaboration With:

About  ||  Citation Policy  ||  Donation Policy  ||  Contact  ||  CML