1. YearPredictionMSD: Prediction of the release year of a song from audio features. Songs are mostly western, commercial tracks ranging from 1922 to 2011, with a peak in the year 2000s.
2. Tennis Major Tournament Match Statistics: This is a collection of 8 files containing the match statistics for both women and men at the four major tennis tournaments of the year 2013. Each file has 42 columns and a minimum of 76 rows.
3. KDD Cup 1998 Data: This is the data set used for The Second International Knowledge Discovery and Data Mining Tools Competition, which was held in conjunction with KDD-98
4. Housing: Taken from StatLib library
5. Geographical Original of Music: Instances in this dataset contain audio features extracted from 1059 wave files. The task associated with the data is to predict the geographical origin of music.
6. Facebook Comment Volume Dataset: Instances in this dataset contain features extracted from facebook posts. The task associated with the data is to predict how many comments the post will receive.
7. Automobile: From 1985 Ward's Automotive Yearbook
8. Auto MPG: Revised from CMU StatLib library, data concerns city-cycle fuel consumption