1. Speaker Accent Recognition: Data set featuring single English words read by speakers from six different countries for accent detection and recognition
2. Gender Gap in Spanish WP: Data set used to estimate the number of women editors and their editing practices in the Spanish Wikipedia
3. Real-time Election Results: Portugal 2019: Data set of the real-time election results of the 2019 Portuguese Parliamentary Election.
4. Drug consumption (quantified): Classify type of drug consumer by personality data
5. Student Performance: Predict student performance in secondary education (high school).
6. Higher Education Students Performance Evaluation Dataset: The data was collected from the Faculty of Engineering and Faculty of Educational Sciences students in 2019. The purpose is to predict students' end-of-term performances using ML techniques.
7. Sports articles for objectivity analysis: 1000 sports articles were labeled using Amazon Mechanical Turk as objective or subjective. The raw texts, extracted features, and the URLs from which the articles were retrieved are provided.