Center for Machine Learning and Intelligent Systems
About  Citation Policy  Donate a Data Set  Contact


Repository Web            Google
View ALL Data Sets

Browse Through:

Default Task

Classification (11)
Regression (3)
Clustering (5)
Other (1)

Attribute Type

Categorical (0)
Numerical (12)
Mixed (1)

Data Type - Undo

Multivariate (127)
Univariate (6)
Sequential (2)
Time-Series (13)
Text (7)
Domain-Theory (3)
Other (0)

Area

Life Sciences (3)
Physical Sciences (0)
CS / Engineering (4)
Social Sciences (0)
Business (3)
Game (0)
Other (2)

# Attributes

Less than 10 (3)
10 to 100 (5)
Greater than 100 (4)

# Instances - Undo

Less than 100 (3)
100 to 1000 (13)
Greater than 1000 (54)

Format Type - Undo

Matrix (13)
Non-Matrix (6)

13 Data Sets

Table View  List View


1. Absenteeism at work: The database was created with records of absenteeism at work from July 2007 to July 2010 at a courier company in Brazil.

2. Breath Metabolomics: Breath analysis is a pivotal method for biological phenotyping. In a pilot study, 100 experiments with four subjects have been performed to study the reproducibility of this technique.

3. Daphnet Freezing of Gait: This dataset contains the annotated readings of 3 acceleration sensors at the hip and leg of Parkinson's disease patients that experience freezing of gait (FoG) during walking tasks.

4. Dow Jones Index: This dataset contains weekly data for the Dow Jones Industrial Index. It has been used in computational investing research.

5. EEG Database: This data arises from a large study to examine EEG correlates of genetic predisposition to alcoholism. It contains measurements from 64 electrodes placed on the scalp sampled at 256 Hz

6. Gas sensor array exposed to turbulent gas mixtures: A chemical detection platform composed of 8 chemoresistive gas sensors was exposed to turbulent gas mixtures generated naturally in a wind tunnel. The acquired time series of the sensors are provided.

7. ISTANBUL STOCK EXCHANGE: Data sets includes returns of Istanbul Stock Exchange with seven other international index; SP, DAX, FTSE, NIKKEI, BOVESPA, MSCE_EU, MSCI_EM from Jun 5, 2009 to Feb 22, 2011.

8. Japanese Vowels: This dataset records 640 time series of 12 LPC cepstrum coefficients taken from nine male speakers.

9. MHEALTH Dataset: The MHEALTH (Mobile Health) dataset is devised to benchmark techniques dealing with human behavior analysis based on multimodal body sensing.

10. PEMS-SF: 15 months worth of daily data (440 daily records) that describes the occupancy rate, between 0 and 1, of different car lanes of the San Francisco bay area freeways across time.

11. Sales_Transactions_Dataset_Weekly: Contains weekly purchased quantities of 800 over products over 52 weeks. Normalised values are provided too.

12. Synthetic Control Chart Time Series: This data consists of synthetically generated control charts.

13. Twin gas sensor arrays: 5 replicates of an 8-MOX gas sensor array were exposed to different gas conditions (4 volatiles at 10 concentration levels each).


Supported By:

 In Collaboration With:

About  ||  Citation Policy  ||  Donation Policy  ||  Contact  ||  CML