Center for Machine Learning and Intelligent Systems
About  Citation Policy  Donate a Data Set  Contact


Repository Web            Google
View ALL Data Sets

Browse Through:

Default Task

Classification (10)
Regression (3)
Clustering (4)
Other (1)

Attribute Type

Categorical (0)
Numerical (11)
Mixed (1)

Data Type - Undo

Multivariate (113)
Univariate (5)
Sequential (2)
Time-Series (12)
Text (7)
Domain-Theory (3)
Other (0)

Area

Life Sciences (2)
Physical Sciences (0)
CS / Engineering (4)
Social Sciences (0)
Business (3)
Game (0)
Other (2)

# Attributes

Less than 10 (3)
10 to 100 (5)
Greater than 100 (3)

# Instances - Undo

Less than 100 (3)
100 to 1000 (12)
Greater than 1000 (52)

Format Type - Undo

Matrix (12)
Non-Matrix (5)

12 Data Sets

Table View  List View


1. EEG Database: This data arises from a large study to examine EEG correlates of genetic predisposition to alcoholism. It contains measurements from 64 electrodes placed on the scalp sampled at 256 Hz

2. Dow Jones Index: This dataset contains weekly data for the Dow Jones Industrial Index. It has been used in computational investing research.

3. Sales_Transactions_Dataset_Weekly: Contains weekly purchased quantities of 800 over products over 52 weeks. Normalised values are provided too.

4. Absenteeism at work: The database was created with records of absenteeism at work from July 2007 to July 2010 at a courier company in Brazil.

5. Gas sensor array exposed to turbulent gas mixtures: A chemical detection platform composed of 8 chemoresistive gas sensors was exposed to turbulent gas mixtures generated naturally in a wind tunnel. The acquired time series of the sensors are provided.

6. MHEALTH Dataset: The MHEALTH (Mobile Health) dataset is devised to benchmark techniques dealing with human behavior analysis based on multimodal body sensing.

7. Twin gas sensor arrays: 5 replicates of an 8-MOX gas sensor array were exposed to different gas conditions (4 volatiles at 10 concentration levels each).

8. Japanese Vowels: This dataset records 640 time series of 12 LPC cepstrum coefficients taken from nine male speakers.

9. Synthetic Control Chart Time Series: This data consists of synthetically generated control charts.

10. PEMS-SF: 15 months worth of daily data (440 daily records) that describes the occupancy rate, between 0 and 1, of different car lanes of the San Francisco bay area freeways across time.

11. Daphnet Freezing of Gait: This dataset contains the annotated readings of 3 acceleration sensors at the hip and leg of Parkinson's disease patients that experience freezing of gait (FoG) during walking tasks.

12. ISTANBUL STOCK EXCHANGE: Data sets includes returns of Istanbul Stock Exchange with seven other international index; SP, DAX, FTSE, NIKKEI, BOVESPA, MSCE_EU, MSCI_EM from Jun 5, 2009 to Feb 22, 2011.


Supported By:

 In Collaboration With:

About  ||  Citation Policy  ||  Donation Policy  ||  Contact  ||  CML