Center for Machine Learning and Intelligent Systems
About  Citation Policy  Donate a Data Set  Contact


Repository Web            Google
View ALL Data Sets

× Check out the beta version of the new UCI Machine Learning Repository we are currently testing! Contact us if you have any issues, questions, or concerns. Click here to try out the new site.

Browse Through:

Default Task - Undo

Classification (268)
Regression (121)
Clustering (77)
Other (9)

Attribute Type - Undo

Categorical (2)
Numerical (77)
Mixed (1)

Data Type - Undo

Multivariate (77)
Univariate (8)
Sequential (20)
Time-Series (30)
Text (20)
Domain-Theory (4)
Other (0)

Area

Life Sciences (18)
Physical Sciences (4)
CS / Engineering (31)
Social Sciences (1)
Business (14)
Game (0)
Other (7)

# Attributes

Less than 10 (24)
10 to 100 (37)
Greater than 100 (14)

# Instances

Less than 100 (8)
100 to 1000 (22)
Greater than 1000 (46)

Format Type

Matrix (64)
Non-Matrix (13)

77 Data Sets

Table View  List View

Name

Data Types

Default Task

Attribute Types

# Instances

# Attributes

Year

 

Open University Learning Analytics dataset

Multivariate, Sequential, Time-Series 

Classification, Regression, Clustering 

Integer 

 

 

2015 

 

SMS Spam Collection

Multivariate, Text, Domain-Theory 

Classification, Clustering 

Real 

5574 

 

2012 

 

Exasens

Multivariate 

Classification, Clustering 

Integer 

399 

2020 

 

Exasens

Multivariate 

Classification, Clustering 

Integer 

399 

2020 

 

Parking Birmingham

Multivariate, Univariate, Sequential, Time-Series 

Classification, Regression, Clustering 

Real 

35717 

2019 

 

User Knowledge Modeling

Multivariate 

Classification, Clustering 

Integer 

403 

2013 

 

Tamilnadu Electricity Board Hourly Readings

Multivariate 

Classification, Regression, Clustering 

Real 

45781 

2013 

 

Drug Review Dataset (Drugs.com)

Multivariate, Text 

Classification, Regression, Clustering 

Integer 

215063 

2018 

 

Improved Spiral Test Using Digitized Graphics Tablet for Monitoring Parkinson’s Disease

Multivariate 

Classification, Regression, Clustering 

Real 

40 

2016 

 

Parkinson Disease Spiral Drawings Using Digitized Graphics Tablet

Multivariate 

Classification, Regression, Clustering 

Integer 

77 

2017 

 

BuddyMove Data Set

Multivariate, Text 

Classification, Clustering 

Real 

249 

2018 

 

seeds

Multivariate 

Classification, Clustering 

Real 

210 

2012 

 

3W dataset

Multivariate, Time-Series 

Classification, Clustering 

Integer, Real 

1984 

2019 

 

Wholesale customers

Multivariate 

Classification, Clustering 

Integer 

440 

2014 

 

StoneFlakes

Multivariate 

Classification, Clustering, Causal-Discovery 

Real 

79 

2014 

 

Online Retail

Multivariate, Sequential, Time-Series 

Classification, Clustering 

Integer, Real 

541909 

2015 

 

Drug Review Dataset (Druglib.com)

Multivariate, Text 

Classification, Regression, Clustering 

Integer 

4143 

2018 

 

Query Analytics Workloads Dataset

Multivariate 

Regression, Clustering 

Real 

260000 

2019 

 

Alcohol QCM Sensor Dataset

Multivariate 

Classification, Regression, Clustering 

Real 

125 

2019 

 

Online Retail II

Multivariate, Sequential, Time-Series, Text 

Classification, Regression, Clustering 

Integer, Real 

1067371 

2019 

 

Stock keeping units

Multivariate 

Clustering 

Integer, Real 

2279 

2019 

 

Vehicle routing and scheduling problems

Multivariate 

Clustering 

Integer, Real 

18 

2019 

 

Stock keeping units

Multivariate 

Clustering 

Integer, Real 

2279 

2019 

 

Taxi Service Trajectory - Prediction Challenge, ECML PKDD 2015

Multivariate, Sequential, Time-Series, Domain-Theory 

Clustering, Causal-Discovery 

Real 

1710671 

2015 

 

HTRU2

Multivariate 

Classification, Clustering 

Real 

17898 

2017 

 

Individual household electric power consumption

Multivariate, Time-Series 

Regression, Clustering 

Real 

2075259 

2012 

 

BitcoinHeistRansomwareAddressDataset

Multivariate, Time-Series 

Classification, Clustering 

Integer, Real 

2916697 

10 

2020 

 

Gas Turbine CO and NOx Emission Data Set

Multivariate 

Regression, Clustering 

Real 

36733 

11 

2019 

 

Travel Reviews

Multivariate, Text 

Classification, Clustering 

Real 

980 

11 

2018 

 

TV News Channel Commercial Detection Dataset

Multivariate 

Classification, Clustering 

Real 

129685 

12 

2015 

 

Facebook Live Sellers in Thailand

Multivariate 

Clustering 

Integer 

7051 

12 

2019 

 

Heart failure clinical records

Multivariate 

Classification, Regression, Clustering 

Integer, Real 

299 

13 

2020 

 

UJIIndoorLoc-Mag

Multivariate, Sequential, Time-Series 

Classification, Regression, Clustering 

Integer, Real 

40000 

13 

2015 

 

Educational Process Mining (EPM): A Learning Analytics Data Set

Multivariate, Sequential, Time-Series 

Classification, Regression, Clustering 

Integer 

230318 

13 

2015 

 

clickstream data for online shopping

Multivariate, Sequential 

Classification, Regression, Clustering 

Integer, Real 

165474 

14 

2019 

 

HCV data

Multivariate 

Classification, Clustering 

Integer, Real 

615 

14 

2020 

 

BLE RSSI Dataset for Indoor localization and Navigation

Multivariate, Sequential, Time-Series 

Classification, Clustering 

Integer 

6611 

15 

2018 

 

Heterogeneity Activity Recognition

Multivariate, Time-Series 

Classification, Clustering 

Real 

43930257 

16 

2015 

 

Estimation of obesity levels based on eating habits and physical condition

Multivariate 

Classification, Regression, Clustering 

Integer 

2111 

17 

2019 

 

Online Shoppers Purchasing Intention Dataset

Multivariate 

Classification, Clustering 

Integer, Real 

12330 

18 

2018 

 

Cervical Cancer Behavior Risk

Multivariate, Univariate 

Classification, Clustering 

Integer 

72 

19 

2019 

 

Chemical Composition of Ceramic Samples

Multivariate 

Classification, Clustering 

Real 

88 

19 

2019 

 

South German Credit

Multivariate 

Classification, Regression, Clustering 

Integer, Real 

1000 

21 

2019 

 

South German Credit (UPDATE)

Multivariate 

Classification, Regression, Clustering 

Integer, Real 

1000 

21 

2020 

 

Absenteeism at work

Multivariate, Time-Series 

Classification, Clustering 

Integer, Real 

740 

21 

2018 

 

Non verbal tourists data

Multivariate 

Classification, Clustering 

Integer, Real 

73 

22 

2021 

 

Anuran Calls (MFCCs)

Multivariate 

Classification, Clustering 

Real 

7195 

22 

2017 

 

KEGG Metabolic Relation Network (Directed)

Multivariate, Univariate, Text 

Classification, Regression, Clustering 

Integer, Real 

53414 

24 

2011 

 

Geo-Magnetic field and WLAN dataset for indoor localisation from wristband and smartphone

Multivariate, Sequential, Time-Series 

Classification, Regression, Clustering 

Integer, Real 

153540 

25 

2017 

 

Tarvel Review Ratings

Multivariate, Text 

Classification, Clustering 

Real 

5456 

25 

2018 

 

KEGG Metabolic Reaction Network (Undirected)

Multivariate, Univariate, Text 

Classification, Regression, Clustering 

Integer, Real 

65554 

29 

2011 

 

Incident management process enriched event log

Multivariate, Sequential 

Regression, Clustering 

Integer 

141712 

36 

2019 

 

Water Treatment Plant

Multivariate 

Clustering 

Integer, Real 

527 

38 

1993 

 

MoCap Hand Postures

Multivariate 

Classification, Clustering 

Integer, Real 

78095 

38 

2016 

 

Motion Capture Hand Postures

Multivariate 

Classification, Clustering 

Real 

78095 

38 

2017 

 

Tennis Major Tournament Match Statistics

Multivariate 

Classification, Regression, Clustering 

Integer, Real 

127 

42 

2014 

 

Gesture Phase Segmentation

Multivariate, Sequential, Time-Series 

Classification, Clustering 

Real 

9900 

50 

2014 

 

Sales_Transactions_Dataset_Weekly

Multivariate, Time-Series 

Clustering 

Integer, Real 

811 

53 

2017 

 

Diabetes 130-US hospitals for years 1999-2008

Multivariate 

Classification, Clustering 

Integer 

100000 

55 

2014 

 

Multi-view Brain Networks

Multivariate 

Classification, Clustering 

Integer 

70 

70 

2020 

 

Mice Protein Expression

Multivariate 

Classification, Clustering 

Real 

1080 

82 

2015 

 

Libras Movement

Multivariate, Sequential 

Classification, Clustering 

Real 

360 

91 

2009 

 

Grammatical Facial Expressions

Multivariate, Sequential 

Classification, Clustering 

Real 

27965 

100 

2014 

 

Kitsune Network Attack Dataset

Multivariate, Sequential, Time-Series 

Classification, Clustering, Causal-Discovery 

Real 

27170754 

115 

2019 

 

detection_of_IoT_botnet_attacks_N_BaIoT

Multivariate, Sequential 

Classification, Clustering 

Real 

7062606 

115 

2018 

 

Gas Sensor Array Drift Dataset at Different Concentrations

Multivariate, Time-Series 

Classification, Regression, Clustering, Causa 

Real 

13910 

129 

2013 

 

Epileptic Seizure Recognition

Multivariate, Time-Series 

Classification, Clustering 

Integer, Real 

11500 

179 

2017 

 

Mturk User-Perceived Clusters over Images

Multivariate, Text 

Clustering 

Integer 

180 

500 

2016 

 

FMA: A Dataset For Music Analysis

Multivariate, Time-Series 

Classification, Clustering 

Real 

106574 

518 

2017 

 

Breath Metabolomics

Multivariate, Time-Series 

Classification, Clustering 

Real 

104 

1656 

2019 

 

Daily and Sports Activities

Multivariate, Time-Series 

Classification, Clustering 

Real 

9120 

5625 

2013 

 

DrivFace

Multivariate 

Classification, Regression, Clustering 

Real 

606 

6400 

2016 

 

A study of Asian Religious and Biblical Texts

Multivariate, Text 

Classification, Clustering 

Integer 

590 

8265 

2019 

 

Reuter_50_50

Multivariate, Text, Domain-Theory 

Classification, Clustering 

Real 

2500 

10000 

2011 

 

gene expression cancer RNA-Seq

Multivariate 

Classification, Clustering 

Real 

801 

20531 

2016 

 

Repeat Consumption Matrices

Multivariate 

Clustering 

Real 

130000 

21000 

2018 

 

YouTube Multiview Video Games Dataset

Multivariate, Text 

Classification, Clustering 

Integer, Real 

120000 

1000000 

2013 

Supported By:

 In Collaboration With:

About  ||  Citation Policy  ||  Donation Policy  ||  Contact  ||  CML