Center for Machine Learning and Intelligent Systems
About  Citation Policy  Donate a Data Set  Contact


Repository Web            Google
View ALL Data Sets

Browse Through:

Default Task - Undo

Classification (9)
Regression (6)
Clustering (0)
Other (0)

Attribute Type - Undo

Categorical (0)
Numerical (9)
Mixed (0)

Data Type

Multivariate (7)
Univariate (1)
Sequential (0)
Time-Series (2)
Text (1)
Domain-Theory (0)
Other (0)

Area - Undo

Life Sciences (26)
Physical Sciences (6)
CS / Engineering (9)
Social Sciences (4)
Business (5)
Game (0)
Other (8)

# Attributes - Undo

Less than 10 (5)
10 to 100 (9)
Greater than 100 (9)

# Instances - Undo

Less than 100 (1)
100 to 1000 (9)
Greater than 1000 (35)

Format Type

Matrix (5)
Non-Matrix (4)

9 Data Sets

Table View  List View


1. Paper Reviews: This sentiment analysis data set contains scientific paper reviews from an international conference on computing and informatics. The task is to predict the orientation or the evaluation of a review.

2. CSM (Conventional and Social Media Movies) Dataset 2014 and 2015: 12 features categorized as conventional and social media features. Both conventional features, collected from movies databases on Web as well as social media features(YouTube,Twitter).

3. Optical Interconnection Network : This dataset contains 640 performance measurements from a simulation of 2-Dimensional Multiprocessor Optical Interconnection Network.

4. Behavior of the urban traffic of the city of Sao Paulo in Brazil: The database was created with records of behavior of the urban traffic of the city of Sao Paulo in Brazil.

5. Leaf: This dataset consists in a collection of shape and texture features extracted from digital images of leaf specimens originating from a total of 40 different plant species.

6. MHEALTH Dataset: The MHEALTH (Mobile Health) dataset is devised to benchmark techniques dealing with human behavior analysis based on multimodal body sensing.

7. Mesothelioma’s disease data set : Mesothelioma’s disease data set were prepared at Dicle University Faculty of Medicine in Turkey. Three hundred and twenty-four Mesothelioma patient data. In the dataset, all samples have 34 features.

8. GPS Trajectories: The dataset has been feed by Android app called Go!Track. It is available at Goolge Play Store(https://play.google.com/store/apps/details?id=com.go.router).

9. Planning Relax: The dataset concerns with the classification of two mental stages from recorded EEG signals: Planning (during imagination of motor act) and Relax state.


Supported By:

 In Collaboration With:

About  ||  Citation Policy  ||  Donation Policy  ||  Contact  ||  CML