1. Speaker Accent Recognition: Data set featuring single English words read by speakers from six different countries for accent detection and recognition
2. Drug consumption (quantified): Classify type of drug consumer by personality data
3. Student Performance: Predict student performance in secondary education (high school).
4. Sports articles for objectivity analysis: 1000 sports articles were labeled using Amazon Mechanical Turk as objective or subjective. The raw texts, extracted features, and the URLs from which the articles were retrieved are provided.
5. A study of Asian Religious and Biblical Texts: Mainly from Project Gutenberg, we combine Upanishads, Yoga Sutras, Buddha Sutras, Tao Te Ching and Book of Wisdom, Book of Proverbs, Book of Ecclesiastes and Book of Ecclesiasticus