1. Thyroid Disease: 10 separate databases from Garavan Institute 2. Australian Sign Language signs: This data consists of sample of Auslan (Australian Sign Language) signs. Examples of 95 signs were collected from five signers with a total of 6650 sign samples. 3. Abalone: Predict the age of abalone from physical measurements 4. Artificial Characters: Dataset artificially generated by using first order theory which describes structure of ten capital letters of English alphabet 5. Internet Advertisements: This dataset represents a set of possible advertisements on Internet pages. 6. Adult: Predict whether income exceeds $50K/yr based on census data. Also known as "Census Income" dataset. 7. Census Income: Predict whether income exceeds $50K/yr based on census data. Also known as "Adult" dataset. 8. Chess (King-Rook vs. King): Chess Endgame Database for White King and Rook against Black King (KRK). 9. Contraceptive Method Choice: Dataset is a subset of the 1987 National Indonesia Contraceptive Prevalence Survey. 10. Covertype: Forest CoverType dataset 11. Census-Income (KDD): This data set contains weighted census data extracted from the 1994 and 1995 current population surveys conducted by the U.S. Census Bureau. 12. KDD Cup 1999 Data: This is the data set used for The Third International Knowledge Discovery and Data Mining Tools Competition, which was held in conjunction with KDD-99 13. Poker Hand: Purpose is to predict poker hands |