Center for Machine Learning and Intelligent Systems
About  Citation Policy  Donate a Data Set  Contact


Repository Web            Google
View ALL Data Sets

Browse Through:

Default Task - Undo

Classification (39)
Regression (12)
Clustering (21)
Other (5)

Attribute Type

Categorical (0)
Numerical (12)
Mixed (0)

Data Type - Undo

Multivariate (102)
Univariate (7)
Sequential (12)
Time-Series (38)
Text (11)
Domain-Theory (4)
Other (0)

Area

Life Sciences (0)
Physical Sciences (0)
CS / Engineering (8)
Social Sciences (0)
Business (3)
Game (0)
Other (1)

# Attributes

Less than 10 (5)
10 to 100 (6)
Greater than 100 (0)

# Instances

Less than 100 (0)
100 to 1000 (0)
Greater than 1000 (11)

Format Type

Matrix (7)
Non-Matrix (5)

12 Data Sets

Table View  List View


1. Open University Learning Analytics dataset: Open University Learning Analytics Dataset contains data about courses, students and their interactions with Virtual Learning Environment for seven selected courses and more than 30000 students.

2. Parking Birmingham: Data collected from car parks in Birmingham that are operated by NCP from Birmingham City Council. UK Open Government Licence (OGL). https://data.birmingham.gov.uk/dataset/birmingham-parking

3. 3D Road Network (North Jutland, Denmark): 3D road network with highly accurate elevation information (+-20cm) from Denmark used in eco-routing and fuel/Co2-estimation routing algorithms.

4. GNFUV Unmanned Surface Vehicles Sensor Data Set 2: The data-set contains eight (2x4) data-sets of mobile sensor readings data (humidity, temperature) corresponding to a swarm of four Unmanned Surface Vehicles (USVs) in a test-bed, Athens, Greece.

5. Online Retail II: A real online retail transaction data set of two years.

6. Metro Interstate Traffic Volume: Hourly Minneapolis-St Paul, MN traffic volume for westbound I-94. Includes weather and holiday features from 2012-2018.

7. UJIIndoorLoc-Mag: The UJIIndoorLoc-Mag is an indoor localization database to test Indoor Positioning System that rely on Earth's magnetic field variations.

8. Educational Process Mining (EPM): A Learning Analytics Data Set: Educational Process Mining data set is built from the recordings of 115 subjects' activities through a logging application while learning with an educational simulator.

9. SML2010: This dataset is collected from a monitor system mounted in a domotic house. It corresponds to approximately 40 days of monitoring data.

10. Geo-Magnetic field and WLAN dataset for indoor localisation from wristband and smartphone: A multisource and multivariate dataset for indoor localisation methods based on WLAN and Geo-Magnetic field fingerprinting

11. Incident management process enriched event log: This event log was extracted from data gathered from the audit system of an instance of the ServiceNow platform used by an IT company and enriched with data loaded from a relational database.

12. Cargo 2000 Freight Tracking and Tracing: Sanitized and anonymized Cargo 2000 (C2K) airfreight tracking and tracing events, covering five months of business execution (3,942 process instances, 7,932 transport legs, 56,082 activities).


Supported By:

 In Collaboration With:

About  ||  Citation Policy  ||  Donation Policy  ||  Contact  ||  CML