Center for Machine Learning and Intelligent Systems
About  Citation Policy  Donate a Data Set  Contact


Repository Web            Google
View ALL Data Sets

Browse Through:

Default Task - Undo

Classification (49)
Regression (10)
Clustering (5)
Other (3)

Attribute Type

Categorical (0)
Numerical (7)
Mixed (2)

Data Type - Undo

Multivariate (10)
Univariate (0)
Sequential (0)
Time-Series (0)
Text (0)
Domain-Theory (0)
Other (0)

Area

Life Sciences (2)
Physical Sciences (1)
CS / Engineering (2)
Social Sciences (2)
Business (0)
Game (0)
Other (3)

# Attributes - Undo

Less than 10 (6)
10 to 100 (10)
Greater than 100 (2)

# Instances - Undo

Less than 100 (0)
100 to 1000 (10)
Greater than 1000 (19)

Format Type

Matrix (8)
Non-Matrix (2)

10 Data Sets

Table View  List View


1. Concrete Slump Test: Concrete is a highly complex material. The slump flow of concrete is not only determined by the water content, but that is also influenced by other concrete ingredients.

2. Fertility: 100 volunteers provide a semen sample analyzed according to the WHO 2010 criteria. Sperm concentration are related to socio-demographic data, environmental factors, health status, and life habits

3. Forest Fires: This is a difficult regression task, where the aim is to predict the burned area of forest fires, in the northeast region of Portugal, by using meteorological and other data (see details at: http://www.dsi.uminho.pt/~pcortez/forestfires).

4. Housing: Taken from StatLib library

5. GPS Trajectories: The dataset has been feed by Android app called Go!Track. It is available at Goolge Play Store(https://play.google.com/store/apps/details?id=com.go.router).

6. Automobile: From 1985 Ward's Automotive Yearbook

7. Student Performance: Predict student performance in secondary education (high school).

8. Breast Cancer Wisconsin (Prognostic): Prognostic Wisconsin Breast Cancer Database

9. Tennis Major Tournament Match Statistics: This is a collection of 8 files containing the match statistics for both women and men at the four major tennis tournaments of the year 2013. Each file has 42 columns and a minimum of 76 rows.

10. wiki4HE: Survey of faculty members from two Spanish universities on teaching uses of Wikipedia


Supported By:

 In Collaboration With:

About  ||  Citation Policy  ||  Donation Policy  ||  Contact  ||  CML