Center for Machine Learning and Intelligent Systems
About  Citation Policy  Donate a Data Set  Contact

Repository Web            Google
View ALL Data Sets

Browse Through:

Default Task - Undo

Classification (44)
Regression (12)
Clustering (9)
Other (0)

Attribute Type - Undo

Categorical (0)
Numerical (12)
Mixed (4)

Data Type

Multivariate (12)
Univariate (1)
Sequential (0)
Time-Series (3)
Text (0)
Domain-Theory (1)
Other (0)


Life Sciences (1)
Physical Sciences (2)
CS / Engineering (5)
Social Sciences (1)
Business (2)
Game (0)
Other (1)

# Attributes

Less than 10 (4)
10 to 100 (6)
Greater than 100 (2)

# Instances - Undo

Less than 100 (2)
100 to 1000 (12)
Greater than 1000 (27)

Format Type - Undo

Matrix (12)
Non-Matrix (4)

12 Data Sets

Table View  List View

1. Computer Hardware: Relative CPU Performance Data, described in terms of its cycle time, memory size, etc.

2. Facebook metrics: Facebook performance metrics of a renowned cosmetic's brand Facebook page.

3. Forest Fires: This is a difficult regression task, where the aim is to predict the burned area of forest fires, in the northeast region of Portugal, by using meteorological and other data (see details at:

4. Concrete Slump Test: Concrete is a highly complex material. The slump flow of concrete is not only determined by the water content, but that is also influenced by other concrete ingredients.

5. Yacht Hydrodynamics: Delft data set, used to predict the hydodynamic performance of sailing yachts from dimensions and velocity.

6. Tennis Major Tournament Match Statistics: This is a collection of 8 files containing the match statistics for both women and men at the four major tennis tournaments of the year 2013. Each file has 42 columns and a minimum of 76 rows.

7. Breast Cancer Wisconsin (Prognostic): Prognostic Wisconsin Breast Cancer Database

8. Gas sensor array exposed to turbulent gas mixtures: A chemical detection platform composed of 8 chemoresistive gas sensors was exposed to turbulent gas mixtures generated naturally in a wind tunnel. The acquired time series of the sensors are provided.

9. Student Performance: Predict student performance in secondary education (high school).

10. Twin gas sensor arrays: 5 replicates of an 8-MOX gas sensor array were exposed to different gas conditions (4 volatiles at 10 concentration levels each).

11. Energy efficiency: This study looked into assessing the heating load and cooling load requirements of buildings (that is, energy efficiency) as a function of building parameters.

12. ISTANBUL STOCK EXCHANGE: Data sets includes returns of Istanbul Stock Exchange with seven other international index; SP, DAX, FTSE, NIKKEI, BOVESPA, MSCE_EU, MSCI_EM from Jun 5, 2009 to Feb 22, 2011.

Supported By:

 In Collaboration With:

About  ||  Citation Policy  ||  Donation Policy  ||  Contact  ||  CML