Browse Datasets

Car Evaluation

Derived from simple hierarchical decision model, this database may be useful for testing constructive induction and structure discovery methods.

Mushroom

From Audobon Society Field Guide; mushrooms described in terms of physical characteristics; classification: poisonous or edible

Breast Cancer

This breast cancer domain was obtained from the University Medical Centre, Institute of Oncology, Ljubljana, Yugoslavia. This is one of three domains provided by the Oncology Institute that has repeatedly appeared in the machine learning literature. (See also lymphography and primary-tumor.)

National Poll on Healthy Aging (NPHA)

This is a subset of the NPHA dataset filtered down to develop and validate machine learning algorithms for predicting the number of doctors a survey respondent sees in a year. This dataset’s records represent seniors who responded to the NPHA survey.

Tic-Tac-Toe Endgame

Binary classification task on possible configurations of tic-tac-toe game

Congressional Voting Records

1984 United Stated Congressional Voting Records; Classify as Republican or Democrat

Nursery

Nursery Database was derived from a hierarchical decision model originally developed to rank applications for nursery schools.

Solar Flare

Each class attribute counts the number of solar flares of a certain class that occur in a 24 hour period

Balance Scale

Balance scale weight & distance database

Drug Induced Autoimmunity Prediction

This dataset comprises molecular descriptors generated using RDKit, specifically curated for the study of drug-induced autoimmunity through ensemble machine learning approaches. It is divided into a training set and a testing set, containing numerical features that represent molecular properties and structural characteristics of drugs. The dataset supports predictive modeling tasks aimed at identifying potential autoimmune risks associated with drug candidates. These molecular descriptors include physicochemical properties, providing a comprehensive foundation for machine learning analysis. The dataset facilitates the development of interpretable models for drug toxicity prediction, contributing to advancements in computational toxicology and drug safety assessment.

0 to 10 of 45

By using the UCI Machine Learning Repository, you acknowledge and accept the cookies and privacy practices used by the UCI Machine Learning Repository.

Read Policy