Center for Machine Learning and Intelligent Systems
About  Citation Policy  Donate a Data Set  Contact


Repository Web            Google
View ALL Data Sets

× Check out the beta version of the new UCI Machine Learning Repository we are currently testing! Contact us if you have any issues, questions, or concerns. Click here to try out the new site.

Sentiment Labelled Sentences Data Set
Download: Data Folder, Data Set Description

Abstract: The dataset contains sentences labelled with positive or negative sentiment.

Data Set Characteristics:  

Text

Number of Instances:

3000

Area:

N/A

Attribute Characteristics:

N/A

Number of Attributes:

N/A

Date Donated

2015-05-30

Associated Tasks:

Classification

Missing Values?

N/A

Number of Web Hits:

212074


Source:

Dimitrios Kotzias dkotzias '@' ics.uci.edu


Data Set Information:

This dataset was created for the Paper 'From Group to Individual Labels using Deep Features', Kotzias et. al,. KDD 2015
Please cite the paper if you want to use it :)

It contains sentences labelled with positive or negative sentiment.

=======
Format:
=======
sentence score



=======
Details:
=======
Score is either 1 (for positive) or 0 (for negative)
The sentences come from three different websites/fields:

imdb.com
amazon.com
yelp.com

For each website, there exist 500 positive and 500 negative sentences. Those were selected randomly for larger datasets of reviews.
We attempted to select sentences that have a clearly positive or negative connotaton, the goal was for no neutral sentences to be selected.


Attribute Information:

The attributes are text sentences, extracted from reviews of products, movies, and restaurants


Relevant Papers:

'From Group to Individual Labels using Deep Features', Kotzias et. al,. KDD 2015



Citation Request:

'From Group to Individual Labels using Deep Features', Kotzias et. al,. KDD 2015


Supported By:

 In Collaboration With:

About  ||  Citation Policy  ||  Donation Policy  ||  Contact  ||  CML