Center for Machine Learning and Intelligent Systems
About  Citation Policy  Donate a Data Set  Contact


Repository Web            Google
View ALL Data Sets

Browse Through:

Default Task

Classification (15)
Regression (7)
Clustering (6)
Other (0)

Attribute Type

Categorical (1)
Numerical (13)
Mixed (1)

Data Type - Undo

Multivariate (22)
Univariate (5)
Sequential (1)
Time-Series (5)
Text (1)
Domain-Theory (2)
Other (1)

Area - Undo

Life Sciences (22)
Physical Sciences (9)
CS / Engineering (22)
Social Sciences (4)
Business (8)
Game (2)
Other (13)

# Attributes - Undo

Less than 10 (22)
10 to 100 (46)
Greater than 100 (39)

# Instances

Less than 100 (3)
100 to 1000 (9)
Greater than 1000 (8)

Format Type - Undo

Matrix (22)
Non-Matrix (15)

22 Data Sets

Table View  List View


1. 2.4 GHZ Indoor Channel Measurements: Measurement of the S21,consists of 10 sweeps, each sweep contains 601 frequency points with spacing of 0.167MHz to cover a 100MHz band centered at 2.4GHz.

2. AAAI 2013 Accepted Papers: This data set compromises the metadata for the 2013 AAAI conference's accepted papers (main track only), including paper titles, abstracts, and keywords of varying granularity.

3. AAAI 2014 Accepted Papers: This data set compromises the metadata for the 2014 AAAI conference's accepted papers, including paper titles, authors, abstracts, and keywords of varying granularity.

4. banknote authentication: Data were extracted from images that were taken for the evaluation of an authentication procedure for banknotes.

5. BLOGGER: In this paper, we look for to recognize the causes of users tend to cyber space in Kohkiloye and Boyer Ahmad Province in Iran

6. Combined Cycle Power Plant: The dataset contains 9568 data points collected from a Combined Cycle Power Plant over 6 years (2006-2011), when the plant was set to work with full load.

7. Computer Hardware: Relative CPU Performance Data, described in terms of its cycle time, memory size, etc.

8. COVID-19 Surveillance: Coronavirus Disease (COVID-19) Surveillance.

9. Dataset for ADL Recognition with Wrist-worn Accelerometer: Recordings of 16 volunteers performing 14 Activities of Daily Living (ADL) while carrying a single wrist-worn tri-axial accelerometer.

10. Dishonest Internet users Dataset: The dataset was used to test an architecture based on a trust model capable to cope with the evaluation of the trustworthiness of users interacting in pervasive environments.

11. Energy efficiency: This study looked into assessing the heating load and cooling load requirements of buildings (that is, energy efficiency) as a function of building parameters.

12. GNFUV Unmanned Surface Vehicles Sensor Data: The data-set contains four (4) sets of mobile sensor readings data (humidity, temperature) corresponding to a swarm of four (4) Unmanned Surface Vehicles (USVs) in a test-bed in Athens (Greece).

13. GNFUV Unmanned Surface Vehicles Sensor Data Set 2: The data-set contains eight (2x4) data-sets of mobile sensor readings data (humidity, temperature) corresponding to a swarm of four Unmanned Surface Vehicles (USVs) in a test-bed, Athens, Greece.

14. LED Display Domain: From Classification and Regression Trees book; We provide here 2 C programs for generating sample databases

15. Occupancy Detection : Experimental data used for binary classification (room occupancy) from Temperature,Humidity,Light and CO2. Ground-truth occupancy was obtained from time stamped pictures that were taken every minute.

16. OCT data & Color Fundus Images of Left & Right Eyes: This dataset contains OCT data (in mat format) and color fundus data (in jpg format) of left & right eyes of 50 healthy persons.

17. Parkinson Disease Spiral Drawings Using Digitized Graphics Tablet: Handwriting database consists of 62 PWP(People with Parkinson) and 15 healthy individuals. Three types of recordings (Static Spiral Test, Dynamic Spiral Test and Stability Test) are taken.

18. Qualitative_Bankruptcy: Predict the Bankruptcy from Qualitative parameters from experts.

19. Rice (Cammeo and Osmancik): A total of 3810 rice grain's images were taken for the two species, processed and feature inferences were made. 7 morphological features were obtained for each grain of rice.

20. Servo: Data was from a simulation of a servo system

21. User Knowledge Modeling: It is the real dataset about the students' knowledge status about the subject of Electrical DC Machines. The dataset had been obtained from Ph.D. Thesis.

22. Wireless Indoor Localization: Collected in indoor space by observing signal strengths of seven WiFi signals visible on a smartphone. The decision variable is one of the four rooms.


Supported By:

 In Collaboration With:

About  ||  Citation Policy  ||  Donation Policy  ||  Contact  ||  CML