Center for Machine Learning and Intelligent Systems
About  Citation Policy  Donate a Data Set  Contact


Repository Web            Google
View ALL Data Sets

Browse Through:

Default Task - Undo

Classification (5)
Regression (0)
Clustering (0)
Other (0)

Attribute Type

Categorical (0)
Numerical (3)
Mixed (0)

Data Type

Multivariate (5)
Univariate (0)
Sequential (0)
Time-Series (0)
Text (0)
Domain-Theory (0)
Other (0)

Area - Undo

Life Sciences (13)
Physical Sciences (5)
CS / Engineering (42)
Social Sciences (2)
Business (1)
Game (1)
Other (3)

# Attributes - Undo

Less than 10 (4)
10 to 100 (18)
Greater than 100 (5)

# Instances

Less than 100 (0)
100 to 1000 (1)
Greater than 1000 (4)

Format Type - Undo

Matrix (5)
Non-Matrix (3)

5 Data Sets

Table View  List View


1. Musk (Version 1): The goal is to learn to predict whether new molecules will be musks or non-musks

2. Musk (Version 2): The goal is to learn to predict whether new molecules will be musks or non-musks

3. QSAR androgen receptor: 1024 binary attributes (molecular fingerprints) used to classify 1687 chemicals into 2 classes (binder to androgen receptor/positive, non-binder to androgen receptor /negative)

4. QSAR oral toxicity: Data set containing values for 1024 binary attributes (molecular fingerprints) used to classify 8992 chemicals into 2 classes (very toxic/positive, not very toxic/negative)

5. Weight Lifting Exercises monitored with Inertial Measurement Units: Six young health subjects were asked to perform 5 variations of the biceps curl weight lifting exercise. One of the variations is the one predicted by the health professional.


Supported By:

 In Collaboration With:

About  ||  Citation Policy  ||  Donation Policy  ||  Contact  ||  CML