Browse Datasets
Sort by # Views, desc
Car Evaluation
Derived from simple hierarchical decision model, this database may be useful for testing constructive induction and structure discovery methods.
Mushroom
From Audobon Society Field Guide; mushrooms described in terms of physical characteristics; classification: poisonous or edible
Breast Cancer
This breast cancer domain was obtained from the University Medical Centre, Institute of Oncology, Ljubljana, Yugoslavia. This is one of three domains provided by the Oncology Institute that has repeatedly appeared in the machine learning literature. (See also lymphography and primary-tumor.)
National Poll on Healthy Aging (NPHA)
This is a subset of the NPHA dataset filtered down to develop and validate machine learning algorithms for predicting the number of doctors a survey respondent sees in a year. This dataset’s records represent seniors who responded to the NPHA survey.
Tic-Tac-Toe Endgame
Binary classification task on possible configurations of tic-tac-toe game
Congressional Voting Records
1984 United Stated Congressional Voting Records; Classify as Republican or Democrat
Nursery
Nursery Database was derived from a hierarchical decision model originally developed to rank applications for nursery schools.
Solar Flare
Each class attribute counts the number of solar flares of a certain class that occur in a 24 hour period
Balance Scale
Balance scale weight & distance database
Drug Induced Autoimmunity Prediction
This dataset comprises molecular descriptors generated using RDKit, specifically curated for the study of drug-induced autoimmunity through ensemble machine learning approaches. It is divided into a training set and a testing set, containing numerical features that represent molecular properties and structural characteristics of drugs. The dataset supports predictive modeling tasks aimed at identifying potential autoimmune risks associated with drug candidates. These molecular descriptors include physicochemical properties, providing a comprehensive foundation for machine learning analysis. The dataset facilitates the development of interpretable models for drug toxicity prediction, contributing to advancements in computational toxicology and drug safety assessment.
0 to 10 of 45
