1. Blood Transfusion Service Center: Data taken from the Blood Transfusion Service Center in Hsin-Chu City in Taiwan -- this is a classification problem. 2. Credit Approval: This data concerns credit card applications; good mix of attributes 3. Japanese Credit Screening: Includes domain theory (generated by talking to Japanese domain experts); data in Lisp 4. Reuters Transcribed Subset: This dataset is created by reading out 200 files from the 10 largest Reuters
classes and using an Automatic Speech Recognition system to create
corresponding transcriptions. 5. Statlog (Australian Credit Approval): This file concerns credit card applications. This database exists elsewhere in the repository (Credit Screening Database) in a slightly different form 6. Statlog (German Credit Data): This dataset classifies people described by a set of attributes as good or bad credit risks. Comes in two formats (one all numeric). Also comes with a cost matrix |