Center for Machine Learning and Intelligent Systems
About  Citation Policy  Donate a Data Set  Contact


Repository Web            Google
View ALL Data Sets

Browse Through:

Default Task - Undo

Classification (23)
Regression (16)
Clustering (8)
Other (0)

Attribute Type

Categorical (0)
Numerical (16)
Mixed (0)

Data Type - Undo

Multivariate (56)
Univariate (2)
Sequential (5)
Time-Series (16)
Text (5)
Domain-Theory (0)
Other (0)

Area

Life Sciences (1)
Physical Sciences (3)
CS / Engineering (10)
Social Sciences (1)
Business (1)
Game (0)
Other (0)

# Attributes - Undo

Less than 10 (5)
10 to 100 (16)
Greater than 100 (6)

# Instances

Less than 100 (1)
100 to 1000 (0)
Greater than 1000 (15)

Format Type - Undo

Matrix (16)
Non-Matrix (6)

16 Data Sets

Table View  List View


1. Air Quality: Contains the responses of a gas multisensor device deployed on the field in an Italian city. Hourly responses averages are recorded along with gas concentrations references from a certified analyzer.

2. Beijing Multi-Site Air-Quality Data: This hourly data set considers 6 main air pollutants and 6 relevant meteorological variables at multiple sites in Beijing.

3. Beijing PM2.5 Data: This hourly data set contains the PM2.5 data of US Embassy in Beijing. Meanwhile, meteorological data from Beijing Capital International Airport are also included.

4. Buzz in social media : This data-set contains examples of buzz events from two different social networks: Twitter, and Tom's Hardware, a forum network focusing on new technology with more conservative dynamics.

5. CNNpred: CNN-based stock market prediction using a diverse set of variables: This dataset contains several daily features of S&P 500, NASDAQ Composite, Dow Jones Industrial Average, RUSSELL 2000, and NYSE Composite from 2010 to 2017.

6. Daily Demand Forecasting Orders: The dataset was collected during 60 days, this is a real database of a brazilian logistics company.

7. Educational Process Mining (EPM): A Learning Analytics Data Set: Educational Process Mining data set is built from the recordings of 115 subjects' activities through a logging application while learning with an educational simulator.

8. EEG Steady-State Visual Evoked Potential Signals: This database consists on 30 subjects performing Brain Computer Interface for Steady State Visual Evoked Potentials (BCI-SSVEP).

9. Gas sensor array temperature modulation: A chemical detection platform composed of 14 temperature-modulated metal oxide (MOX) gas sensors was exposed during 3 weeks to mixtures of carbon monoxide and humid synthetic air in a gas chamber.

10. News Popularity in Multiple Social Media Platforms: Large data set of news items and their respective social feedback on multiple platforms: Facebook, Google+ and LinkedIn.

11. PM2.5 Data of Five Chinese Cities: This hourly data set contains the PM2.5 data in Beijing, Shanghai, Guangzhou, Chengdu and Shenyang. Meanwhile, meteorological data for each city are also included.

12. PPG-DaLiA: PPG-DaLiA contains data from 15 subjects wearing physiological and motion sensors, providing a PPG dataset for motion compensation and heart rate estimation in Daily Life Activities.

13. Real-time Election Results: Portugal 2019: Data set of the real-time election results of the 2019 Portuguese Parliamentary Election.

14. SML2010: This dataset is collected from a monitor system mounted in a domotic house. It corresponds to approximately 40 days of monitoring data.

15. UJIIndoorLoc-Mag: The UJIIndoorLoc-Mag is an indoor localization database to test Indoor Positioning System that rely on Earth's magnetic field variations.

16. WESAD (Wearable Stress and Affect Detection): WESAD (Wearable Stress and Affect Detection) contains data of 15 subjects during a stress-affect lab study, while wearing physiological and motion sensors.


Supported By:

 In Collaboration With:

About  ||  Citation Policy  ||  Donation Policy  ||  Contact  ||  CML