Center for Machine Learning and Intelligent Systems
About  Citation Policy  Donate a Data Set  Contact


Repository Web            Google
View ALL Data Sets

Browse Through:

Default Task

Classification (13)
Regression (5)
Clustering (6)
Other (4)

Attribute Type

Categorical (0)
Numerical (17)
Mixed (2)

Data Type - Undo

Multivariate (161)
Univariate (8)
Sequential (4)
Time-Series (19)
Text (14)
Domain-Theory (7)
Other (4)

Area

Life Sciences (4)
Physical Sciences (1)
CS / Engineering (7)
Social Sciences (0)
Business (3)
Game (0)
Other (3)

# Attributes

Less than 10 (6)
10 to 100 (7)
Greater than 100 (5)

# Instances - Undo

Less than 100 (4)
100 to 1000 (19)
Greater than 1000 (82)

Format Type

Matrix (13)
Non-Matrix (6)

19 Data Sets

Table View  List View


1. Bach Chorales: Time-series data based on chorales; challenge is to learn generative grammar; data in Lisp

2. EEG Database: This data arises from a large study to examine EEG correlates of genetic predisposition to alcoholism. It contains measurements from 64 electrodes placed on the scalp sampled at 256 Hz

3. EMG dataset in Lower Limb: 3 different exercises: sitting, standing and walking in the muscles: biceps femoris, vastus medialis, rectus femoris and semitendinosus addition to goniometry in the exercises.

4. Horton General Hospital: Horton General Hospital is in the town Banbury not far from Oxford, UK.

5. MHEALTH Dataset: The MHEALTH (Mobile Health) dataset is devised to benchmark techniques dealing with human behavior analysis based on multimodal body sensing.

6. Japanese Vowels: This dataset records 640 time series of 12 LPC cepstrum coefficients taken from nine male speakers.

7. Robot Execution Failures: This dataset contains force and torque measurements on a robot after failure detection. Each failure is characterized by 15 force/torque samples collected at regular time intervals

8. PEMS-SF: 15 months worth of daily data (440 daily records) that describes the occupancy rate, between 0 and 1, of different car lanes of the San Francisco bay area freeways across time.

9. Daphnet Freezing of Gait: This dataset contains the annotated readings of 3 acceleration sensors at the hip and leg of Parkinson's disease patients that experience freezing of gait (FoG) during walking tasks.

10. Breath Metabolomics: Breath analysis is a pivotal method for biological phenotyping. In a pilot study, 100 experiments with four subjects have been performed to study the reproducibility of this technique.

11. Dow Jones Index: This dataset contains weekly data for the Dow Jones Industrial Index. It has been used in computational investing research.

12. Synthetic Control Chart Time Series: This data consists of synthetically generated control charts.

13. Absenteeism at work: The database was created with records of absenteeism at work from July 2007 to July 2010 at a courier company in Brazil.

14. Gas sensor array exposed to turbulent gas mixtures: A chemical detection platform composed of 8 chemoresistive gas sensors was exposed to turbulent gas mixtures generated naturally in a wind tunnel. The acquired time series of the sensors are provided.

15. Twin gas sensor arrays: 5 replicates of an 8-MOX gas sensor array were exposed to different gas conditions (4 volatiles at 10 concentration levels each).

16. Behavior of the urban traffic of the city of Sao Paulo in Brazil: The database was created with records of behavior of the urban traffic of the city of Sao Paulo in Brazil.

17. ISTANBUL STOCK EXCHANGE: Data sets includes returns of Istanbul Stock Exchange with seven other international index; SP, DAX, FTSE, NIKKEI, BOVESPA, MSCE_EU, MSCI_EM from Jun 5, 2009 to Feb 22, 2011.

18. Sales_Transactions_Dataset_Weekly: Contains weekly purchased quantities of 800 over products over 52 weeks. Normalised values are provided too.

19. ElectricityLoadDiagrams20112014: This data set contains electricity consumption of 370 points/clients.


Supported By:

 In Collaboration With:

About  ||  Citation Policy  ||  Donation Policy  ||  Contact  ||  CML