Center for Machine Learning and Intelligent Systems
About  Citation Policy  Donate a Data Set  Contact


Repository Web            Google
View ALL Data Sets

Browse Through:

Default Task - Undo

Classification (21)
Regression (9)
Clustering (10)
Other (2)

Attribute Type - Undo

Categorical (3)
Numerical (21)
Mixed (2)

Data Type

Multivariate (12)
Univariate (3)
Sequential (6)
Time-Series (12)
Text (1)
Domain-Theory (1)
Other (0)

Area

Life Sciences (7)
Physical Sciences (1)
CS / Engineering (10)
Social Sciences (0)
Business (1)
Game (1)
Other (1)

# Attributes - Undo

Less than 10 (21)
10 to 100 (30)
Greater than 100 (11)

# Instances

Less than 100 (2)
100 to 1000 (7)
Greater than 1000 (12)

Format Type - Undo

Matrix (42)
Non-Matrix (21)

21 Data Sets

Table View  List View


1. WISDM Smartphone and Smartwatch Activity and Biometrics Dataset : Contains accelerometer and gyroscope time-series sensor data collected from a smartphone and smartwatch as 51 test subjects perform 18 activities for 3 minutes each.

2. USPTO Algorithm Challenge, run by NASA-Harvard Tournament Lab and TopCoder Problem: Pat: Data used for USPTO Algorithm Competition. Contains drawing pages from US patents with manually labeled figure and part labels.

3. Shoulder Implant X-Ray Manufacturer Classification: 597 de-identified raw X-ray scans of implanted shoulder prostheses from four manufactures.

4. Shoulder Implant X-Ray Manufacturer Classification: 597 de-identified raw X-ray scans of implanted shoulder prostheses from four manufactures.

5. ser Knowledge Modeling Data (Students' Knowledge Levels on DC Electrical Machines): The dataset is about the users' learning activities and knowledge levels on subjects of DC Electrical Machines. The dataset had been obtained from online web-courses and reported in my Ph.D. Thesis.

6. selfBACK: The SELFBACK dataset is a Human Activity Recognition Dataset of 9 activity classes recorded with two tri-axial accelerometers.

7. Parking Birmingham: Data collected from car parks in Birmingham that are operated by NCP from Birmingham City Council. UK Open Government Licence (OGL). https://data.birmingham.gov.uk/dataset/birmingham-parking

8. Machine Learning based ZZAlpha Ltd. Stock Recommendations 2012-2014: The data here are the ZZAlpha® machine learning recommendations made for various US traded stock portfolios the morning of each day during the 3 year period Jan 1, 2012 - Dec 31, 2014.

9. Localization Data for Person Activity: Data contains recordings of five people performing different activities. Each person wore four sensors (tags) while performing the same scenario five times.

10. Labeled Text Forum Threads Dataset: The dataset is a collection of text forum threads with class labels reflects the reply quality to the Initial-Post, 3 for complete relevant, 2 for partially relevant, and 1 for irrelevant

11. Intelligent Media Accelerometer and Gyroscope (IM-AccGyro) Dataset: The IM-AccGyro dataset is devised to benchmark techniques dealing with human activity recognition based on inertial sensors.

12. Indoor User Movement Prediction from RSS data: This dataset contains temporal data from a Wireless Sensor Network deployed in real-world office environments. The task is intended as real-life benchmark in the area of Ambient Assisted Living.

13. Improved Spiral Test Using Digitized Graphics Tablet for Monitoring Parkinson’s Disease: Handwriting database consists of 25 PWP(People with Parkinson) and 15 healthy individuals.Three types of recordings (Static Spiral Test, Dynamic Spiral Test and Stability Test) are taken.

14. Immunotherapy Dataset: This dataset contains information about wart treatment results of 90 patients using immunotherapy.

15. Basketball dataset: It's data collected from different volunteers that are done in a basketball practice: dribbling, pass, shoot, picking the ball, and holding the ball.

16. Bar Crawl: Detecting Heavy Drinking: Accelerometer and transdermal alcohol content data from a college bar crawl. Used to predict heavy drinking episodes via mobile data.

17. Bar Crawl: Detecting Heavy Drinking: Accelerometer and transdermal alcohol content data from a college bar crawl. Used to predict heavy drinking episodes via mobile data.

18. Alcohol QCM Sensor Dataset: Five different QCM gas sensors are used, and five different gas measurements (1-octanol, 1-propanol, 2-butanol, 2-propanol and 1-isobutanol) are conducted in each of these sensors.

19. Activity recognition with healthy older people using a batteryless wearable sensor: Sequential motion data from 14 healthy older people aged 66 to 86 years old using a batteryless, wearable sensor on top of their clothing for the recognition of activities in clinical environments.

20. Activity Recognition system based on Multisensor data fusion (AReM): This dataset contains temporal data from a Wireless Sensor Network worn by an actor performing the activities: bending, cycling, lying down, sitting, standing, walking.

21. 3W dataset: The first realistic and public dataset with rare undesirable real events in oil wells.


Supported By:

 In Collaboration With:

About  ||  Citation Policy  ||  Donation Policy  ||  Contact  ||  CML