Center for Machine Learning and Intelligent Systems
About  Citation Policy  Donate a Data Set  Contact

Repository Web            Google
View ALL Data Sets

Browse Through:

Default Task - Undo

Classification (118)
Regression (26)
Clustering (19)
Other (12)

Attribute Type

Categorical (1)
Numerical (3)
Mixed (4)

Data Type

Multivariate (6)
Univariate (1)
Sequential (0)
Time-Series (3)
Text (1)
Domain-Theory (2)
Other (2)


Life Sciences (3)
Physical Sciences (2)
CS / Engineering (3)
Social Sciences (2)
Business (1)
Game (0)
Other (1)

# Attributes

Less than 10 (5)
10 to 100 (4)
Greater than 100 (0)

# Instances - Undo

Less than 100 (3)
100 to 1000 (12)
Greater than 1000 (19)

Format Type

Matrix (5)
Non-Matrix (7)

12 Data Sets

Table View  List View

1. Abscisic Acid Signaling Network: The objective is to determine the set of boolean rules that describe the interactions of the nodes within this plant signaling network. The dataset includes 300 separate boolean pseudodynamic simulations using an asynchronous update scheme.

2. Bach Chorales: Time-series data based on chorales; challenge is to learn generative grammar; data in Lisp

3. Coil 1999 Competition Data: This data set is from the 1999 Computational Intelligence and Learning (COIL) competition. The data contains measurements of river chemical concentrations and algae densities.

4. Eco-hotel: This dataset includes Online Textual Reviews from both online (e.g., TripAdvisor) and offline (e.g., Guests' book) sources from the Areias do Seixo Eco-Resort.

5. EEG Database: This data arises from a large study to examine EEG correlates of genetic predisposition to alcoholism. It contains measurements from 64 electrodes placed on the scalp sampled at 256 Hz

6. EMG dataset in Lower Limb: 3 different exercises: sitting, standing and walking in the muscles: biceps femoris, vastus medialis, rectus femoris and semitendinosus addition to goniometry in the exercises.

7. Function Finding: Cases collected mostly from investigations in physical science; intention is to evaluate function-finding algorithms

8. Kinship: Relational dataset

9. Liver Disorders: BUPA Medical Research Ltd. database donated by Richard S. Forsyth

10. Moral Reasoner: Horn-clause model that qualitatively simulates moral reasoning; Theory includes negated literals

11. Restaurant & consumer data: The dataset was obtained from a recommender system prototype. The task was to generate a top-n list of restaurants according to the consumer preferences.

12. Student Loan Relational: Student Loan Relational Domain

Supported By:

 In Collaboration With:

About  ||  Citation Policy  ||  Donation Policy  ||  Contact  ||  CML