Sentiment Labelled Sentences
Donated on 5/29/2015
The dataset contains sentences labelled with positive or negative sentiment.
Dataset Characteristics
Text
Subject Area
Other
Associated Tasks
Classification
Feature Type
-
# Instances
3000
# Features
-
Dataset Information
Additional Information
This dataset was created for the Paper 'From Group to Individual Labels using Deep Features', Kotzias et. al,. KDD 2015 Please cite the paper if you want to use it :) It contains sentences labelled with positive or negative sentiment. ======= Format: ======= sentence score ======= Details: ======= Score is either 1 (for positive) or 0 (for negative) The sentences come from three different websites/fields: imdb.com amazon.com yelp.com For each website, there exist 500 positive and 500 negative sentences. Those were selected randomly for larger datasets of reviews. We attempted to select sentences that have a clearly positive or negative connotaton, the goal was for no neutral sentences to be selected.
Has Missing Values?
No
Introductory Paper
By Dimitrios Kotzias, Misha Denil, Nando de Freitas, Padhraic Smyth. 2015
Published in Knowledge Discovery and Data Mining
Variable Information
The attributes are text sentences, extracted from reviews of products, movies, and restaurants
Dataset Files
File | Size |
---|---|
sentiment labelled sentences/imdb_labelled.txt | 83.3 KB |
sentiment labelled sentences/yelp_labelled.txt | 59.9 KB |
sentiment labelled sentences/amazon_cells_labelled.txt | 56.9 KB |
sentiment labelled sentences/.DS_Store | 6 KB |
sentiment labelled sentences/readme.txt | 1 KB |
0 to 5 of 9
Reviews
There are no reviews for this dataset yet.
pip install ucimlrepo
from ucimlrepo import fetch_ucirepo # fetch dataset sentiment_labelled_sentences = fetch_ucirepo(id=331) # data (as pandas dataframes) X = sentiment_labelled_sentences.data.features y = sentiment_labelled_sentences.data.targets # metadata print(sentiment_labelled_sentences.metadata) # variable information print(sentiment_labelled_sentences.variables)
Kotzias, D. (2015). Sentiment Labelled Sentences [Dataset]. UCI Machine Learning Repository. https://doi.org/10.24432/C57604.
Creators
Dimitrios Kotzias
DOI
License
This dataset is licensed under a Creative Commons Attribution 4.0 International (CC BY 4.0) license.
This allows for the sharing and adaptation of the datasets for any purpose, provided that the appropriate credit is given.