Center for Machine Learning and Intelligent Systems
About  Citation Policy  Donate a Data Set  Contact


Repository Web            Google
View ALL Data Sets

Browse Through:

Default Task - Undo

Classification (17)
Regression (10)
Clustering (7)
Other (6)

Attribute Type

Categorical (0)
Numerical (3)
Mixed (2)

Data Type - Undo

Multivariate (8)
Univariate (2)
Sequential (4)
Time-Series (6)
Text (5)
Domain-Theory (9)
Other (11)

Area

Life Sciences (2)
Physical Sciences (0)
CS / Engineering (2)
Social Sciences (0)
Business (0)
Game (0)
Other (2)

# Attributes

Less than 10 (2)
10 to 100 (2)
Greater than 100 (0)

# Instances

Less than 100 (1)
100 to 1000 (2)
Greater than 1000 (1)

Format Type - Undo

Matrix (4)
Non-Matrix (6)

6 Data Sets

Table View  List View


1. Bach Chorales: Time-series data based on chorales; challenge is to learn generative grammar; data in Lisp

2. Diabetes: This diabetes dataset is from AIM '94

3. EMG dataset in Lower Limb: 3 different exercises: sitting, standing and walking in the muscles: biceps femoris, vastus medialis, rectus femoris and semitendinosus addition to goniometry in the exercises.

4. ICU: Data set prepared for the use of participants for the 1994 AAAI Spring Symposium on Artificial Intelligence in Medicine.

5. Predict keywords activities in a online social media: The data from Twitter was collected during 360 consecutive days. It was done by querying 1497 English keywords sampled from Wikipedia. This dataset is proposed in a Learning to rank setting.

6. Pseudo Periodic Synthetic Time Series: This data set is designed for testing indexing schemes in time series databases. The data appears highly periodic, but never exactly repeats itself.


Supported By:

 In Collaboration With:

About  ||  Citation Policy  ||  Donation Policy  ||  Contact  ||  CML