Center for Machine Learning and Intelligent Systems
About  Citation Policy  Donate a Data Set  Contact

Repository Web            Google
View ALL Data Sets

Browse Through:

Default Task - Undo

Classification (18)
Regression (8)
Clustering (10)
Other (1)

Attribute Type - Undo

Categorical (0)
Numerical (10)
Mixed (0)

Data Type - Undo

Multivariate (36)
Univariate (3)
Sequential (10)
Time-Series (14)
Text (12)
Domain-Theory (1)
Other (0)


Life Sciences (0)
Physical Sciences (0)
CS / Engineering (6)
Social Sciences (0)
Business (3)
Game (0)
Other (1)

# Attributes

Less than 10 (2)
10 to 100 (6)
Greater than 100 (2)

# Instances - Undo

Less than 100 (0)
100 to 1000 (1)
Greater than 1000 (10)

Format Type - Undo

Matrix (10)
Non-Matrix (6)

10 Data Sets

Table View  List View

1. BLE RSSI Dataset for Indoor localization and Navigation: This dataset contains RSSI readings gathered from an array of Bluetooth Low Energy (BLE) iBeacons in a real-world and operational indoor environment for localization and navigation purposes.

2. clickstream data for online shopping: The dataset contains information on clickstream from online store offering clothing for pregnant women.

3. detection_of_IoT_botnet_attacks_N_BaIoT: This dataset addresses the lack of public botnet datasets, especially for the IoT. It suggests *real* traffic data, gathered from 9 commercial IoT devices authentically infected by Mirai and BASHLITE.

4. Educational Process Mining (EPM): A Learning Analytics Data Set: Educational Process Mining data set is built from the recordings of 115 subjects' activities through a logging application while learning with an educational simulator.

5. Gesture Phase Segmentation: The dataset is composed by features extracted from 7 videos with people gesticulating, aiming at studying Gesture Phase Segmentation. It contains 50 attributes divided into two files for each video.

6. Grammatical Facial Expressions: This dataset supports the development of models that make possible to interpret Grammatical Facial Expressions from Brazilian Sign Language (Libras).

7. Kitsune Network Attack Dataset: A cybersecurity dataset containing nine different network attacks on a commercial IP-based surveillance system and an IoT network. The dataset includes reconnaissance, MitM, DoS, and botnet attacks.

8. Online Retail: This is a transnational data set which contains all the transactions occurring between 01/12/2010 and 09/12/2011 for a UK-based and registered non-store online retail.

9. Online Retail II: A real online retail transaction data set of two years.

10. UJIIndoorLoc-Mag: The UJIIndoorLoc-Mag is an indoor localization database to test Indoor Positioning System that rely on Earth's magnetic field variations.

Supported By:

 In Collaboration With:

About  ||  Citation Policy  ||  Donation Policy  ||  Contact  ||  CML