1. KDD Cup 1998 Data: This is the data set used for The Second International Knowledge Discovery and Data Mining Tools Competition, which was held in conjunction with KDD-98
2. Automobile: From 1985 Ward's Automotive Yearbook
3. Auto MPG: Revised from CMU StatLib library, data concerns city-cycle fuel consumption
4. Demand Forecasting for a store: Contains data for a store from week 1 to week 146.
5. Tennis Major Tournament Match Statistics: This is a collection of 8 files containing the match statistics for both women and men at the four major tennis tournaments of the year 2013. Each file has 42 columns and a minimum of 76 rows.
6. Facebook Comment Volume Dataset: Instances in this dataset contain features extracted from facebook posts. The task associated with the data is to predict how many comments the post will receive.
7. Metro Interstate Traffic Volume: Hourly Minneapolis-St Paul, MN traffic volume for westbound I-94. Includes weather and holiday features from 2012-2018.
8. Geographical Original of Music: Instances in this dataset contain audio features extracted from 1059 wave files. The task associated with the data is to predict the geographical origin of music.
9. Air quality: Contains the responses of a gas multisensor device deployed on the field in an Italian city.
10. YearPredictionMSD: Prediction of the release year of a song from audio features. Songs are mostly western, commercial tracks ranging from 1922 to 2011, with a peak in the year 2000s.