Center for Machine Learning and Intelligent Systems
About  Citation Policy  Donate a Data Set  Contact


Repository Web            Google
View ALL Data Sets

Browse Through:

Default Task

Classification (13)
Regression (11)
Clustering (4)
Other (0)

Attribute Type

Categorical (0)
Numerical (17)
Mixed (0)

Data Type - Undo

Multivariate (50)
Univariate (1)
Sequential (9)
Time-Series (17)
Text (3)
Domain-Theory (0)
Other (0)

Area - Undo

Life Sciences (4)
Physical Sciences (5)
CS / Engineering (17)
Social Sciences (1)
Business (4)
Game (0)
Other (5)

# Attributes - Undo

Less than 10 (6)
10 to 100 (17)
Greater than 100 (16)

# Instances

Less than 100 (0)
100 to 1000 (1)
Greater than 1000 (16)

Format Type - Undo

Matrix (17)
Non-Matrix (7)

17 Data Sets

Table View  List View


1. AI4I 2020 Predictive Maintenance Dataset: The AI4I 2020 Predictive Maintenance Dataset is a synthetic dataset that reflects real predictive maintenance data encountered in industry.

2. Air Quality: Contains the responses of a gas multisensor device deployed on the field in an Italian city. Hourly responses averages are recorded along with gas concentrations references from a certified analyzer.

3. BLE RSSI Dataset for Indoor localization and Navigation: This dataset contains RSSI readings gathered from an array of Bluetooth Low Energy (BLE) iBeacons in a real-world and operational indoor environment for localization and navigation purposes.

4. Buzz in social media : This data-set contains examples of buzz events from two different social networks: Twitter, and Tom's Hardware, a forum network focusing on new technology with more conservative dynamics.

5. CNNpred: CNN-based stock market prediction using a diverse set of variables: This dataset contains several daily features of S&P 500, NASDAQ Composite, Dow Jones Industrial Average, RUSSELL 2000, and NYSE Composite from 2010 to 2017.

6. Educational Process Mining (EPM): A Learning Analytics Data Set: Educational Process Mining data set is built from the recordings of 115 subjects' activities through a logging application while learning with an educational simulator.

7. Gas sensor array temperature modulation: A chemical detection platform composed of 14 temperature-modulated metal oxide (MOX) gas sensors was exposed during 3 weeks to mixtures of carbon monoxide and humid synthetic air in a gas chamber.

8. Gas sensors for home activity monitoring: 100 recordings of a sensor array under different conditions in a home setting: background, wine and banana presentations. The array includes 8 MOX gas sensors, and humidity and temperature sensors.

9. Heterogeneity Activity Recognition: The Heterogeneity Human Activity Recognition (HHAR) dataset from Smartphones and Smartwatches is a dataset devised to benchmark human activity recognition algorithms (classification, automatic data segmentation, sensor fusion, feature extraction, etc.) in real-world contexts; specifically, the dataset is gathered with a variety of different device models and use-scenarios, in order to reflect sensing heterogeneities to be expected in real deployments.

10. Hybrid Indoor Positioning Dataset from WiFi RSSI, Bluetooth and magnetometer: The dataset was created for the comparison and evaluation of hybrid indoor positioning methods. The dataset presented contains data from W-LAN and Bluetooth interfaces, and Magnetometer.

11. MHEALTH Dataset: The MHEALTH (Mobile Health) dataset is devised to benchmark techniques dealing with human behavior analysis based on multimodal body sensing.

12. News Popularity in Multiple Social Media Platforms: Large data set of news items and their respective social feedback on multiple platforms: Facebook, Google+ and LinkedIn.

13. PAMAP2 Physical Activity Monitoring: The PAMAP2 Physical Activity Monitoring dataset contains data of 18 different physical activities, performed by 9 subjects wearing 3 inertial measurement units and a heart rate monitor.

14. PPG-DaLiA: PPG-DaLiA contains data from 15 subjects wearing physiological and motion sensors, providing a PPG dataset for motion compensation and heart rate estimation in Daily Life Activities.

15. SML2010: This dataset is collected from a monitor system mounted in a domotic house. It corresponds to approximately 40 days of monitoring data.

16. UJIIndoorLoc-Mag: The UJIIndoorLoc-Mag is an indoor localization database to test Indoor Positioning System that rely on Earth's magnetic field variations.

17. WESAD (Wearable Stress and Affect Detection): WESAD (Wearable Stress and Affect Detection) contains data of 15 subjects during a stress-affect lab study, while wearing physiological and motion sensors.


Supported By:

 In Collaboration With:

About  ||  Citation Policy  ||  Donation Policy  ||  Contact  ||  CML