1. Multimodal Damage Identification for Humanitarian Computing: 5879 captioned images (image and text) from social media related to damage during natural disasters/wars, and belong to 6 classes: Fires, Floods, Natural landscape, Infrastructural, Human, Non-damage.
2. Gender Gap in Spanish WP: Data set used to estimate the number of women editors and their editing practices in the Spanish Wikipedia
3. Real-time Election Results: Portugal 2019: Data set of the real-time election results of the 2019 Portuguese Parliamentary Election.
4. Drug consumption (quantified): Classify type of drug consumer by personality data
5. Communities and Crime: Communities within the United States. The data combines socio-economic data from the 1990 US Census, law enforcement data from the 1990 US LEMAS survey, and crime data from the 1995 FBI UCR.
6. Communities and Crime Unnormalized: Communities in the US. Data combines socio-economic data from the '90 Census, law enforcement data from the 1990 Law Enforcement Management and Admin Stats survey, and crime data from the 1995 FBI UCR
7. BlogFeedback: Instances in this dataset contain features extracted from blog posts. The task associated with the data is to predict how many comments the post will receive.