Center for Machine Learning and Intelligent Systems
About  Citation Policy  Donate a Data Set  Contact


Repository Web            Google
View ALL Data Sets

× Check out the beta version of the new UCI Machine Learning Repository we are currently testing! Contact us if you have any issues, questions, or concerns. Click here to try out the new site.

Browse Through:

Default Task

Classification (67)
Regression (41)
Clustering (30)
Other (3)

Attribute Type - Undo

Categorical (15)
Numerical (101)
Mixed (15)

Data Type

Multivariate (74)
Univariate (13)
Sequential (17)
Time-Series (30)
Text (7)
Domain-Theory (4)
Other (0)

Area

Life Sciences (29)
Physical Sciences (11)
CS / Engineering (41)
Social Sciences (0)
Business (10)
Game (1)
Other (6)

# Attributes - Undo

Less than 10 (101)
10 to 100 (206)
Greater than 100 (91)

# Instances

Less than 100 (10)
100 to 1000 (35)
Greater than 1000 (56)

Format Type

Matrix (72)
Non-Matrix (29)

101 Data Sets

Table View  List View

Name

Data Types

Default Task

Attribute Types

# Instances

# Attributes

Year

 

2.4 GHZ Indoor Channel Measurements

Multivariate 

Classification 

Real 

7840 

2018 

 

3D Road Network (North Jutland, Denmark)

Sequential, Text 

Regression, Clustering 

Real 

434874 

2013 

 

3W dataset

Multivariate, Time-Series 

Classification, Clustering 

Integer, Real 

1984 

2019 

 

9mers from cullpdb

Sequential 

Classification, Regression 

Real 

158716 

2021 

 

Accelerometer

Multivariate 

Classification, Regression 

Integer, Real 

153000 

2021 

 

Activity Recognition system based on Multisensor data fusion (AReM)

Multivariate, Sequential, Time-Series 

Classification 

Real 

42240 

2016 

 

Activity recognition with healthy older people using a batteryless wearable sensor

Sequential 

Classification 

Real 

75128 

2016 

 

Airfoil Self-Noise

Multivariate 

Regression 

Real 

1503 

2014 

 

Alcohol QCM Sensor Dataset

Multivariate 

Classification, Regression, Clustering 

Real 

125 

2019 

 

Average Localization Error (ALE) in sensor node localization process in WSNs

Multivariate 

Regression 

Real 

107 

2021 

 

banknote authentication

Multivariate 

Classification 

Real 

1372 

2013 

 

Bar Crawl: Detecting Heavy Drinking

Multivariate, Time-Series 

Classification, Regression 

Real 

14057567 

2020 

 

Bar Crawl: Detecting Heavy Drinking

Multivariate, Time-Series 

Classification, Regression 

Real 

14057567 

2020 

 

Basketball dataset

Time-Series 

Classification 

Integer 

10000 

2019 

 

BLE RSSI dataset for Indoor localization

Sequential, Time-Series 

Classification 

Integer 

23570 

2019 

 

Blood Transfusion Service Center

Multivariate 

Classification 

Real 

748 

2008 

 

BuddyMove Data Set

Multivariate, Text 

Classification, Clustering 

Real 

249 

2018 

 

Caesarian Section Classification Dataset

Univariate 

Classification 

Integer 

80 

2018 

 

Carbon Nanotubes

Univariate 

Regression 

Real 

10721 

2018 

 

Challenger USA Space Shuttle O-Ring

Multivariate 

Regression 

Integer 

23 

1993 

 

Character Trajectories

Time-Series 

Classification, Clustering 

Real 

2858 

2008 

 

Combined Cycle Power Plant

Multivariate 

Regression 

Real 

9568 

2014 

 

Computer Hardware

Multivariate 

Regression 

Integer 

209 

1987 

 

Concrete Compressive Strength

Multivariate 

Regression 

Real 

1030 

2007 

 

Container Crane Controller Data Set

Univariate, Domain-Theory 

Classification, Regression 

Real 

15 

2018 

 

Cryotherapy Dataset

Univariate 

Classification 

Integer, Real 

90 

2018 

 

Cuff-Less Blood Pressure Estimation

Multivariate 

Classification, Regression 

Real 

12000 

2015 

 

Daphnet Freezing of Gait

Multivariate, Time-Series 

Classification 

Real 

237 

2013 

 

Demand Forecasting for a store

Multivariate 

Regression 

Integer 

28764 

2019 

 

Drug Review Dataset (Druglib.com)

Multivariate, Text 

Classification, Regression, Clustering 

Integer 

4143 

2018 

 

Drug Review Dataset (Drugs.com)

Multivariate, Text 

Classification, Regression, Clustering 

Integer 

215063 

2018 

 

DSRC Vehicle Communications

Sequential, Text 

Clustering 

Real 

10000 

2017 

 

Ecoli

Multivariate 

Classification 

Real 

336 

1996 

 

EMG data for gestures

Time-Series 

Classification 

Real 

30000 

2019 

 

EMG dataset in Lower Limb

Multivariate, Time-Series 

 

Real 

132 

2014 

 

EMG Physical Action Data Set

Time-Series 

Classification 

Real 

10000 

2011 

 

Energy efficiency

Multivariate 

Classification, Regression 

Integer, Real 

768 

2012 

 

Exasens

Multivariate 

Classification, Clustering 

Integer 

399 

2020 

 

Exasens

Multivariate 

Classification, Clustering 

Integer 

399 

2020 

 

GNFUV Unmanned Surface Vehicles Sensor Data

Multivariate, Time-Series 

Regression 

Real 

1672 

2018 

 

GNFUV Unmanned Surface Vehicles Sensor Data Set 2

Multivariate, Sequential, Time-Series 

Regression 

Real 

10190 

2018 

 

Haberman's Survival

Multivariate 

Classification 

Integer 

306 

1999 

 

Horton General Hospital

Multivariate, Time-Series 

Causal-Discovery 

Integer 

139 

2019 

 

HTRU2

Multivariate 

Classification, Clustering 

Real 

17898 

2017 

 

Image Recognition Task Execution Times in Mobile Edge Computing

Univariate, Sequential, Time-Series 

Regression 

Real 

4000 

2020 

 

Image Recognition Task Execution Times in Mobile Edge Computing

Univariate 

Regression 

Real 

4000 

2021 

 

Immunotherapy Dataset

Univariate 

Classification 

Integer, Real 

90 

2018 

 

Improved Spiral Test Using Digitized Graphics Tablet for Monitoring Parkinson’s Disease

Multivariate 

Classification, Regression, Clustering 

Real 

40 

2016 

 

Individual household electric power consumption

Multivariate, Time-Series 

Regression, Clustering 

Real 

2075259 

2012 

 

Indoor User Movement Prediction from RSS data

Multivariate, Sequential, Time-Series 

Classification 

Real 

13197 

2016 

 

Intelligent Media Accelerometer and Gyroscope (IM-AccGyro) Dataset

Time-Series 

Classification 

Real 

800 

2020 

 

Iris

Multivariate 

Classification 

Real 

150 

1988 

 

ISTANBUL STOCK EXCHANGE

Multivariate, Univariate, Time-Series 

Classification, Regression 

Real 

536 

2013 

 

Labeled Text Forum Threads Dataset

Text 

Classification 

Integer 

200 

2019 

 

Localization Data for Person Activity

Univariate, Sequential, Time-Series 

Classification 

Real 

164860 

2010 

 

Machine Learning based ZZAlpha Ltd. Stock Recommendations 2012-2014

Sequential, Time-Series 

Classification 

Real 

314080 

2015 

 

Mammographic Mass

Multivariate 

Classification 

Integer 

961 

2007 

 

Metro Interstate Traffic Volume

Multivariate, Sequential, Time-Series 

Regression 

Integer, Real 

48204 

2019 

 

Occupancy Detection

Multivariate, Time-Series 

Classification 

Real 

20560 

2016 

 

OCT data & Color Fundus Images of Left & Right Eyes

Multivariate 

Classification 

Real 

50 

2016 

 

Online Retail

Multivariate, Sequential, Time-Series 

Classification, Clustering 

Integer, Real 

541909 

2015 

 

Online Retail II

Multivariate, Sequential, Time-Series, Text 

Classification, Regression, Clustering 

Integer, Real 

1067371 

2019 

 

Parking Birmingham

Multivariate, Univariate, Sequential, Time-Series 

Classification, Regression, Clustering 

Real 

35717 

2019 

 

Parkinson Disease Spiral Drawings Using Digitized Graphics Tablet

Multivariate 

Classification, Regression, Clustering 

Integer 

77 

2017 

 

Perfume Data

Univariate, Domain-Theory 

Classification, Clustering 

Integer 

560 

2014 

 

Physicochemical Properties of Protein Tertiary Structure

Multivariate 

Regression 

Real 

45730 

2013 

 

Power consumption of Tetouan city

Multivariate, Time-Series 

Regression 

Integer, Real 

52417 

2021 

 

QSAR aquatic toxicity

Multivariate 

Regression 

Real 

546 

2019 

 

QSAR fish bioconcentration factor (BCF)

Multivariate 

Regression 

Integer, Real 

1056 

2019 

 

QSAR fish toxicity

Multivariate 

Regression 

Real 

908 

2019 

 

QtyT40I10D100K

Sequential 

 

Integer 

3960456 

2012 

 

Query Analytics Workloads Dataset

Multivariate 

Regression, Clustering 

Real 

260000 

2019 

 

Raisin Dataset

Multivariate 

Classification 

Integer, Real 

900 

2021 

 

Real estate valuation data set

Multivariate 

Regression 

Integer, Real 

414 

2018 

 

Rice (Cammeo and Osmancik)

Multivariate 

Classification 

Real 

3810 

2019 

 

seeds

Multivariate 

Classification, Clustering 

Real 

210 

2012 

 

selfBACK

Time-Series 

Classification, Clustering 

Real 

26136 

2020 

 

Sepsis survival minimal clinical records

Multivariate 

Classification 

Integer 

110341 

2020 

 

ser Knowledge Modeling Data (Students' Knowledge Levels on DC Electrical Machines)

Multivariate 

Classification 

Real 

403 

2013 

 

Shoulder Implant X-Ray Manufacturer Classification

Multivariate 

Classification 

Real 

597 

2020 

 

Shoulder Implant X-Ray Manufacturer Classification

Multivariate 

Classification 

Real 

597 

2020 

 

Skin Segmentation

Univariate 

Classification 

Real 

245057 

2012 

 

Somerville Happiness Survey

 

Classification 

Integer 

143 

2018 

 

Statlog (Shuttle)

Multivariate 

Classification 

Integer 

58000 

 

 

Stock keeping units

Multivariate 

Clustering 

Integer, Real 

2279 

2019 

 

Stock keeping units

Multivariate 

Clustering 

Integer, Real 

2279 

2019 

 

StoneFlakes

Multivariate 

Classification, Clustering, Causal-Discovery 

Real 

79 

2014 

 

Synchronous Machine Data Set

Multivariate 

Regression 

Real 

557 

2021 

 

Synchronous Machine Data Set

Multivariate 

Regression 

Real 

557 

2021 

 

Tamilnadu Electricity Board Hourly Readings

Multivariate 

Classification, Regression, Clustering 

Real 

45781 

2013 

 

Taxi Service Trajectory - Prediction Challenge, ECML PKDD 2015

Multivariate, Sequential, Time-Series, Domain-Theory 

Clustering, Causal-Discovery 

Real 

1710671 

2015 

 

UrbanGB, urban road accidents coordinates labelled by the urban center

Univariate 

Clustering 

Real 

360177 

2019 

 

User Knowledge Modeling

Multivariate 

Classification, Clustering 

Integer 

403 

2013 

 

USPTO Algorithm Challenge, run by NASA-Harvard Tournament Lab and TopCoder Problem: Pat

Domain-Theory 

Classification 

Integer 

306 

2013 

 

Vehicle routing and scheduling problems

Multivariate 

Clustering 

Integer, Real 

18 

2019 

 

Vertebral Column

Multivariate 

Classification 

Real 

310 

2011 

 

Wholesale customers

Multivariate 

Classification, Clustering 

Integer 

440 

2014 

 

Wireless Indoor Localization

Multivariate 

Classification 

Real 

2000 

2017 

 

WISDM Smartphone and Smartwatch Activity and Biometrics Dataset

Multivariate, Time-Series 

Classification 

Real 

15630426 

2019 

 

Yacht Hydrodynamics

Multivariate 

Regression 

Real 

308 

2013 

 

Yeast

Multivariate 

Classification 

Real 

1484 

1996 

Supported By:

 In Collaboration With:

About  ||  Citation Policy  ||  Donation Policy  ||  Contact  ||  CML