Center for Machine Learning and Intelligent Systems
About  Citation Policy  Donate a Data Set  Contact


Repository Web            Google
View ALL Data Sets

Browse Through:

Default Task - Undo

Classification (10)
Regression (2)
Clustering (5)
Other (0)

Attribute Type

Categorical (0)
Numerical (4)
Mixed (0)

Data Type

Multivariate (4)
Univariate (0)
Sequential (3)
Time-Series (2)
Text (3)
Domain-Theory (1)
Other (0)

Area

Life Sciences (0)
Physical Sciences (1)
CS / Engineering (3)
Social Sciences (1)
Business (0)
Game (0)
Other (0)

# Attributes - Undo

Less than 10 (5)
10 to 100 (1)
Greater than 100 (0)

# Instances - Undo

Less than 100 (1)
100 to 1000 (0)
Greater than 1000 (5)

Format Type - Undo

Matrix (4)
Non-Matrix (5)

5 Data Sets

Table View  List View


1. NYSK: NYSK (New York v. Strauss-Kahn) is a collection of English news articles about the case relating to allegations of sexual assault against the former IMF director Dominique Strauss-Kahn (May 2011).

2. Taxi Service Trajectory - Prediction Challenge, ECML PKDD 2015: An accurate dataset describing trajectories performed by all the 442 taxis running in the city of Porto, in Portugal.

3. Amazon book reviews: 213.335 book reviews for 8 different books. There are books which are scored very negatively in general and books which are scored very positively.

4. Individual household electric power consumption: Measurements of electric power consumption in one household with a one-minute sampling rate over a period of almost 4 years. Different electrical quantities and some sub-metering values are available.

5. 3D Road Network (North Jutland, Denmark): 3D road network with highly accurate elevation information (+-20cm) from Denmark used in eco-routing and fuel/Co2-estimation routing algorithms.


Supported By:

 In Collaboration With:

About  ||  Citation Policy  ||  Donation Policy  ||  Contact  ||  CML