Center for Machine Learning and Intelligent Systems
About  Citation Policy  Donate a Data Set  Contact


Repository Web            Google
View ALL Data Sets

Welcome to the UC Irvine Machine Learning Repository!

We currently maintain 585 data sets as a service to the machine learning community. You may view all data sets through our searchable interface. For a general overview of the Repository, please visit our About page. For information about citing data sets in publications, please read our citation policy. If you wish to donate a data set, please consult our donation policy. For any other questions, feel free to contact the Repository librarians.

Supported By:

In Collaboration With:

Latest News:
09-24-2018: Welcome to the new Repository admins Dheeru Dua and Efi Karra Taniskidou!
04-04-2013: Welcome to the new Repository admins Kevin Bache and Moshe Lichman!
03-01-2010: Note from donor regarding Netflix data
10-16-2009: Two new data sets have been added.
09-14-2009: Several data sets have been added.
03-24-2008: New data sets have been added!
06-25-2007: Two new data sets have been added: UJI Pen Characters, MAGIC Gamma Telescope


Featured Data Set:  URL Reputation

Task: Classification
Data Type: Multivariate, Time-Series
# Attributes: 3231961
# Instances: 2396130

Anonymized 120-day subset of the ICML-09 URL data containing 2.4 million examples and 3.2 million features.
Newest Data Sets:
02-17-2021:
 Hungarian Chickenpox Cases
12-09-2020:
 Myocardial infarction complications
10-14-2020:
 Gait Classification
10-03-2020:
 Codon usage
09-15-2020:
 in-vehicle coupon recommendation
09-14-2020:
 Dry Bean Dataset
09-03-2020:
 Intelligent Media Accelerometer and Gyroscope (IM-AccGyro) Dataset
08-30-2020:
 AI4I 2020 Predictive Maintenance Dataset
08-25-2020:
 Wisesight Sentiment Corpus
08-22-2020:
 LastFM Asia Social Network
08-06-2020:
 Multi-view Brain Networks
08-03-2020:
 Wheat kernels
Most Popular Data Sets (hits since 2007):
3856179:
 Iris
2085875:
 Adult
1612564:
 Wine
1457738:
 Heart Disease
1448657:
 Wine Quality
1447436:
 Breast Cancer Wisconsin (Diagnostic)
1410919:
 Bank Marketing
1334309:
 Car Evaluation
1094865:
 Human Activity Recognition Using Smartphones
1060223:
 Abalone
981831:
 Forest Fires
886969:
 Student Performance

About  ||  Citation Policy  ||  Donation Policy  ||  Contact  ||  CML