1. KDD Cup 1998 Data: This is the data set used for The Second International Knowledge Discovery and Data Mining Tools Competition, which was held in conjunction with KDD-98
2. Facebook Comment Volume Dataset: Instances in this dataset contain features extracted from facebook posts. The task associated with the data is to predict how many comments the post will receive.
3. Geographical Original of Music: Instances in this dataset contain audio features extracted from 1059 wave files. The task associated with the data is to predict the geographical origin of music.
4. YearPredictionMSD: Prediction of the release year of a song from audio features. Songs are mostly western, commercial tracks ranging from 1922 to 2011, with a peak in the year 2000s.