Center for Machine Learning and Intelligent Systems
About  Citation Policy  Donate a Data Set  Contact


Repository Web            Google
View ALL Data Sets

Browse Through:

Default Task

Classification (19)
Regression (8)
Clustering (12)
Other (3)

Attribute Type

Categorical (1)
Numerical (24)
Mixed (0)

Data Type - Undo

Multivariate (127)
Univariate (10)
Sequential (27)
Time-Series (50)
Text (28)
Domain-Theory (10)
Other (3)

Area - Undo

Life Sciences (7)
Physical Sciences (1)
CS / Engineering (27)
Social Sciences (1)
Business (3)
Game (0)
Other (7)

# Attributes

Less than 10 (7)
10 to 100 (11)
Greater than 100 (2)

# Instances

Less than 100 (2)
100 to 1000 (0)
Greater than 1000 (23)

Format Type

Matrix (12)
Non-Matrix (15)

27 Data Sets

Table View  List View

Name

Data Types

Default Task

Attribute Types

# Instances

# Attributes

Year

 

MSNBC.com Anonymous Web Data

Sequential 

 

Categorical 

989818 

 

 

 

UNIX User Data

Text, Sequential 

 

 

 

 

 

 

Predict keywords activities in a online social media

Multivariate, Sequential, Time-Series 

 

Integer, Real 

51 

35 

2013 

 

Indoor User Movement Prediction from RSS data

Multivariate, Sequential, Time-Series 

Classification 

Real 

13197 

2016 

 

Activity Recognition system based on Multisensor data fusion (AReM)

Multivariate, Sequential, Time-Series 

Classification 

Real 

42240 

2016 

 

Data for Software Engineering Teamwork Assessment in Education Setting

Sequential, Time-Series 

Classification 

Integer, Real 

74 

102 

2017 

 

Hybrid Indoor Positioning Dataset from WiFi RSSI, Bluetooth and magnetometer

Multivariate, Sequential, Time-Series 

Classification 

Real 

1540 

65 

2016 

 

UJI Pen Characters

Multivariate, Sequential 

Classification 

Integer 

1364 

 

2007 

 

UJI Pen Characters (Version 2)

Multivariate, Sequential 

Classification 

Integer 

11640 

 

2009 

 

Wall-Following Robot Navigation Data

Multivariate, Sequential 

Classification 

Real 

5456 

24 

2010 

 

Online Handwritten Assamese Characters Dataset

Multivariate, Sequential 

Classification 

Integer 

8235 

 

2011 

 

Wearable Computing: Classification of Body Postures and Movements (PUC-Rio)

Sequential 

Classification 

Integer, Real 

165632 

18 

2013 

 

microblogPCU

Multivariate, Univariate, Sequential, Text 

Classification, Causal-Discovery 

Integer, Real 

221579 

20 

2015 

 

Activities of Daily Living (ADLs) Recognition Using Binary Sensors

Multivariate, Sequential, Time-Series 

Classification, Clustering 

 

2747 

 

2013 

 

Grammatical Facial Expressions

Multivariate, Sequential 

Classification, Clustering 

Real 

27965 

100 

2014 

 

BLE RSSI Dataset for Indoor localization and Navigation

Multivariate, Sequential, Time-Series 

Classification, Clustering 

Integer 

6611 

15 

2018 

 

detection_of_IoT_botnet_attacks_N_BaIoT

Multivariate, Sequential 

Classification, Clustering 

Real 

7062606 

115 

2018 

 

UJIIndoorLoc-Mag

Multivariate, Sequential, Time-Series 

Classification, Regression, Clustering 

Integer, Real 

40000 

13 

2015 

 

Educational Process Mining (EPM): A Learning Analytics Data Set

Multivariate, Sequential, Time-Series 

Classification, Regression, Clustering 

Integer 

230318 

13 

2015 

 

Open University Learning Analytics dataset

Multivariate, Sequential, Time-Series 

Classification, Regression, Clustering 

Integer 

 

 

2015 

 

Geo-Magnetic field and WLAN dataset for indoor localisation from wristband and smartphone

Multivariate, Sequential, Time-Series 

Classification, Regression, Clustering 

Integer, Real 

153540 

25 

2017 

 

Parking Birmingham

Multivariate, Univariate, Sequential, Time-Series 

Classification, Regression, Clustering 

Real 

35717 

2019 

 

DSRC Vehicle Communications

Sequential, Text 

Clustering 

Real 

10000 

2017 

 

Taxi Service Trajectory - Prediction Challenge, ECML PKDD 2015

Multivariate, Sequential, Time-Series, Domain-Theory 

Clustering, Causal-Discovery 

Real 

1710671 

2015 

 

SML2010

Multivariate, Sequential, Time-Series, Text 

Regression 

Real 

4137 

24 

2014 

 

GNFUV Unmanned Surface Vehicles Sensor Data Set 2

Multivariate, Sequential, Time-Series 

Regression 

Real 

10190 

2018 

 

3D Road Network (North Jutland, Denmark)

Sequential, Text 

Regression, Clustering 

Real 

434874 

2013 

Supported By:

 In Collaboration With:

About  ||  Citation Policy  ||  Donation Policy  ||  Contact  ||  CML