Center for Machine Learning and Intelligent Systems
About  Citation Policy  Donate a Data Set  Contact


Repository Web            Google
View ALL Data Sets

Browse Through:

Default Task - Undo

Classification (11)
Regression (4)
Clustering (5)
Other (0)

Attribute Type - Undo

Categorical (0)
Numerical (11)
Mixed (0)

Data Type - Undo

Multivariate (69)
Univariate (5)
Sequential (1)
Time-Series (11)
Text (4)
Domain-Theory (2)
Other (0)

Area

Life Sciences (2)
Physical Sciences (0)
CS / Engineering (4)
Social Sciences (0)
Business (3)
Game (0)
Other (2)

# Attributes

Less than 10 (2)
10 to 100 (4)
Greater than 100 (4)

# Instances - Undo

Less than 100 (2)
100 to 1000 (11)
Greater than 1000 (41)

Format Type - Undo

Matrix (11)
Non-Matrix (3)

11 Data Sets

Table View  List View


1. Absenteeism at work: The database was created with records of absenteeism at work from July 2007 to July 2010 at a courier company in Brazil.

2. Breath Metabolomics: Breath analysis is a pivotal method for biological phenotyping. In a pilot study, 100 experiments with four subjects have been performed to study the reproducibility of this technique.

3. Daphnet Freezing of Gait: This dataset contains the annotated readings of 3 acceleration sensors at the hip and leg of Parkinson's disease patients that experience freezing of gait (FoG) during walking tasks.

4. Dow Jones Index: This dataset contains weekly data for the Dow Jones Industrial Index. It has been used in computational investing research.

5. Gas sensor array exposed to turbulent gas mixtures: A chemical detection platform composed of 8 chemoresistive gas sensors was exposed to turbulent gas mixtures generated naturally in a wind tunnel. The acquired time series of the sensors are provided.

6. ISTANBUL STOCK EXCHANGE: Data sets includes returns of Istanbul Stock Exchange with seven other international index; SP, DAX, FTSE, NIKKEI, BOVESPA, MSCE_EU, MSCI_EM from Jun 5, 2009 to Feb 22, 2011.

7. Japanese Vowels: This dataset records 640 time series of 12 LPC cepstrum coefficients taken from nine male speakers.

8. MHEALTH Dataset: The MHEALTH (Mobile Health) dataset is devised to benchmark techniques dealing with human behavior analysis based on multimodal body sensing.

9. PEMS-SF: 15 months worth of daily data (440 daily records) that describes the occupancy rate, between 0 and 1, of different car lanes of the San Francisco bay area freeways across time.

10. Synthetic Control Chart Time Series: This data consists of synthetically generated control charts.

11. Twin gas sensor arrays: 5 replicates of an 8-MOX gas sensor array were exposed to different gas conditions (4 volatiles at 10 concentration levels each).


Supported By:

 In Collaboration With:

About  ||  Citation Policy  ||  Donation Policy  ||  Contact  ||  CML