Center for Machine Learning and Intelligent Systems
About  Citation Policy  Donate a Data Set  Contact


Repository Web            Google
View ALL Data Sets

Browse Through:

Default Task - Undo

Classification (20)
Regression (5)
Clustering (5)
Other (0)

Attribute Type

Categorical (5)
Numerical (11)
Mixed (1)

Data Type

Multivariate (15)
Univariate (3)
Sequential (1)
Time-Series (2)
Text (1)
Domain-Theory (1)
Other (0)

Area

Life Sciences (6)
Physical Sciences (1)
CS / Engineering (9)
Social Sciences (1)
Business (0)
Game (0)
Other (3)

# Attributes

Less than 10 (11)
10 to 100 (4)
Greater than 100 (5)

# Instances - Undo

Less than 100 (20)
100 to 1000 (114)
Greater than 1000 (161)

Format Type - Undo

Matrix (20)
Non-Matrix (5)

20 Data Sets

Table View  List View


1. Balloons: Data previously used in cognitive psychology experiment; 4 data sets represent different conditions of an experiment

2. Caesarian Section Classification Dataset: This dataset contains information about caesarian section results of 80 pregnant women with the most important characteristics of delivery problems in the medical field.

3. Container Crane Controller Data Set: A container crane has the function of transporting containers from one point to another point.

4. COVID-19 Surveillance: Coronavirus Disease (COVID-19) Surveillance.

5. Cryotherapy Dataset : This dataset contains information about wart treatment results of 90 patients using cryotherapy.

6. Data for Software Engineering Teamwork Assessment in Education Setting: Data include over 100 Team Activity Measures and outcomes (ML classes) obtained from activities of 74 student teams during the creation of final class project in SW Eng. classes at SFSU, Fulda, FAU

7. DBWorld e-mails: It contains 64 e-mails which I have manually collected from DBWorld mailing list. They are classified in: 'announces of conferences' and 'everything else'.

8. Gas sensor array under flow modulation: The data set contains 58 time series acquired from 16 chemical sensors under gas flow modulation conditions. The sensors were exposed to different gaseous binary mixtures of acetone and ethanol.

9. Lenses: Database for fitting contact lenses

10. Lung Cancer: Lung cancer data; no attribute definitions

11. Monolithic Columns in Troad and Mysia Region: These data have been constituted to clarify the distribution in Northwestern Anatolia of the monolithic columns produced in the ancient granite quarries located in Troad and Mysia Regions.

12. OCT data & Color Fundus Images of Left & Right Eyes: This dataset contains OCT data (in mat format) and color fundus data (in jpg format) of left & right eyes of 50 healthy persons.

13. Parkinson Disease Spiral Drawings Using Digitized Graphics Tablet: Handwriting database consists of 62 PWP(People with Parkinson) and 15 healthy individuals. Three types of recordings (Static Spiral Test, Dynamic Spiral Test and Stability Test) are taken.

14. Person Classification Gait Data: Gait is considered a biometric criterion. Therefore, we tried to classify people with gait analysis with this gait data set.

15. Post-Operative Patient: Dataset of patient features

16. SCADI: First self-care activities dataset based on ICF-CY.

17. Shuttle Landing Control: Tiny database; all nominal values

18. Soybean (Small): Michalski's famous soybean disease database

19. StoneFlakes: Stone flakes are waste products of the stone tool production in the prehistoric era. The variables are means of geometric and stylistic features of the flakes contained in different inventories.

20. Trains: 2 data formats (structured, one-instance-per-line)


Supported By:

 In Collaboration With:

About  ||  Citation Policy  ||  Donation Policy  ||  Contact  ||  CML