1. Turkish Spam V01: The TurkishSpam data set contains spam and normal emails written in Turkish.
2. Twitter Data set for Arabic Sentiment Analysis: This problem of Sentiment Analysis (SA) has been studied well on the English language but not Arabic one. Two main approaches have been devised: corpus-based and lexicon-based.
3. Balance Scale: Balance scale weight & distance database
4. Balloons: Data previously used in cognitive psychology experiment; 4 data sets represent different conditions of an experiment
5. Hayes-Roth: Topic: human subjects study
6. NYSK: NYSK (New York v. Strauss-Kahn) is a collection of English news articles about the case relating to allegations of sexual assault against the former IMF director Dominique Strauss-Kahn (May 2011).
7. Nursery: Nursery Database was derived from a hierarchical decision model originally developed to rank applications for nursery schools.