Center for Machine Learning and Intelligent Systems
About  Citation Policy  Donate a Data Set  Contact


Repository Web            Google
View ALL Data Sets

Browse Through:

Default Task

Classification (3)
Regression (3)
Clustering (2)
Other (1)

Attribute Type

Categorical (0)
Numerical (5)
Mixed (0)

Data Type - Undo

Multivariate (37)
Univariate (2)
Sequential (5)
Time-Series (9)
Text (4)
Domain-Theory (1)
Other (4)

Area

Life Sciences (0)
Physical Sciences (0)
CS / Engineering (3)
Social Sciences (0)
Business (2)
Game (0)
Other (0)

# Attributes - Undo

Less than 10 (10)
10 to 100 (5)
Greater than 100 (1)

# Instances

Less than 100 (1)
100 to 1000 (0)
Greater than 1000 (4)

Format Type - Undo

Matrix (16)
Non-Matrix (5)

5 Data Sets

Table View  List View


1. Cargo 2000 Freight Tracking and Tracing: Sanitized and anonymized Cargo 2000 (C2K) airfreight tracking and tracing events, covering five months of business execution (3,942 process instances, 7,932 transport legs, 56,082 activities).

2. Geo-Magnetic field and WLAN dataset for indoor localisation from wristband and smartphone: A multisource and multivariate dataset for indoor localisation methods based on WLAN and Geo-Magnetic field fingerprinting

3. Incident management process enriched event log: This event log was extracted from data gathered from the audit system of an instance of the ServiceNow platform used by an IT company and enriched with data loaded from a relational database.

4. microblogPCU: MicroblogPCU data is crawled from sina weibo microblog[http://weibo.com/]. This data can be used to study machine learning methods as well as do some social network research.

5. Predict keywords activities in a online social media: The data from Twitter was collected during 360 consecutive days. It was done by querying 1497 English keywords sampled from Wikipedia. This dataset is proposed in a Learning to rank setting.


Supported By:

 In Collaboration With:

About  ||  Citation Policy  ||  Donation Policy  ||  Contact  ||  CML