1. DGP2 - The Second Data Generation Program: Generates application domains based on specific parameters, number of features, and proportion of positive to negative examples 2. Document Understanding: Five concepts, expressed as predicates, to be learned 3. Movie: This data set contains a list of over 10000 films including many older, odd, and cult films. There is information on actors, casts, directors, producers, studios, etc. 4. Statlog Project: Various Databases: Vehicle silhouttes, Landsat Sattelite, Shuttle, Australian Credit Approval, Heart Disease, Image Segmentation, German Credit 5. Undocumented: Various datasets without documentation (feel free to explore!) |