Center for Machine Learning and Intelligent Systems
About  Citation Policy  Donate a Data Set  Contact

Repository Web            Google
View ALL Data Sets

Browse Through:

Default Task

Classification (11)
Regression (5)
Clustering (2)
Other (1)

Attribute Type - Undo

Categorical (1)
Numerical (14)
Mixed (2)

Data Type

Multivariate (12)
Univariate (3)
Sequential (0)
Time-Series (4)
Text (1)
Domain-Theory (1)
Other (0)

Area - Undo

Life Sciences (21)
Physical Sciences (12)
CS / Engineering (14)
Social Sciences (1)
Business (4)
Game (0)
Other (9)

# Attributes

Less than 10 (6)
10 to 100 (4)
Greater than 100 (4)

# Instances - Undo

Less than 100 (2)
100 to 1000 (14)
Greater than 1000 (39)

Format Type

Matrix (10)
Non-Matrix (4)

14 Data Sets

Table View  List View

1. Perfume Data: This data consists of odors of 20 different perfumes. Data was obtained by using a handheld odor meter (OMX-GR sensor) per second for 28 seconds period.

2. User Knowledge Modeling: It is the real dataset about the students' knowledge status about the subject of Electrical DC Machines. The dataset had been obtained from Ph.D. Thesis.

3. ser Knowledge Modeling Data (Students' Knowledge Levels on DC Electrical Machines): The dataset is about the users' learning activities and knowledge levels on subjects of DC Electrical Machines. The dataset had been obtained from online web-courses and reported in my Ph.D. Thesis.

4. EMG dataset in Lower Limb: 3 different exercises: sitting, standing and walking in the muscles: biceps femoris, vastus medialis, rectus femoris and semitendinosus addition to goniometry in the exercises.

5. Energy efficiency: This study looked into assessing the heating load and cooling load requirements of buildings (that is, energy efficiency) as a function of building parameters.

6. Computer Hardware: Relative CPU Performance Data, described in terms of its cycle time, memory size, etc.

7. Concrete Slump Test: Concrete is a highly complex material. The slump flow of concrete is not only determined by the water content, but that is also influenced by other concrete ingredients.

8. Planning Relax: The dataset concerns with the classification of two mental stages from recorded EEG signals: Planning (during imagination of motor act) and Relax state.

9. Leaf: This dataset consists in a collection of shape and texture features extracted from digital images of leaf specimens originating from a total of 40 different plant species.

10. MHEALTH Dataset: The MHEALTH (Mobile Health) dataset is devised to benchmark techniques dealing with human behavior analysis based on multimodal body sensing.

11. Northix: Northix is designed to be a schema matching benchmark problem for data integration of two entity relationship databases.

12. NoisyOffice: Corpus intended to do cleaning (or binarization) and enhancement of noisy grayscale printed text images using supervised learning methods. Noisy images and their corresponding ground truth provided.

13. PEMS-SF: 15 months worth of daily data (440 daily records) that describes the occupancy rate, between 0 and 1, of different car lanes of the San Francisco bay area freeways across time.

14. Gas sensor array exposed to turbulent gas mixtures: A chemical detection platform composed of 8 chemoresistive gas sensors was exposed to turbulent gas mixtures generated naturally in a wind tunnel. The acquired time series of the sensors are provided.

Supported By:

 In Collaboration With:

About  ||  Citation Policy  ||  Donation Policy  ||  Contact  ||  CML