Center for Machine Learning and Intelligent Systems
About  Citation Policy  Donate a Data Set  Contact


Repository Web            Google
View ALL Data Sets

Opinosis Opinion ⁄ Review Data Set
Download: Data Folder, Data Set Description

Abstract: This dataset contains sentences extracted from user reviews on a given topic. Example topics are “performance of Toyota Camry” and “sound quality of ipod nano”.

Data Set Characteristics:  

Text

Number of Instances:

51

Area:

Computer

Attribute Characteristics:

N/A

Number of Attributes:

N/A

Date Donated

2010-07-06

Associated Tasks:

N/A

Missing Values?

N/A

Number of Web Hits:

35091


Source:

Kavita Ganesan
kganes2 '@' illinois.edu
http://kavita-ganesan.com/opinosis-opinion-dataset


Data Set Information:

This dataset contains sentences extracted from user reviews on a given topic. Example topics are “performance of Toyota Camry” and “sound quality of ipod nano”, etc. In total there are 51 such topics with each topic having approximately 100 sentences (on the average). The reviews were obtained from various sources - Tripadvisor (hotels), Edmunds.com (cars) and Amazon.com (various electronics). The dataset file also comes with gold standard summaries used for the Opinosis summarization paper (see relevant papers).


Attribute Information:

N/A


Relevant Papers:

Kavita Ganesan, ChengXiang Zhai, Jiawei Han. Opinosis: A Graph Based Approach to Abstractive Summarization of Highly Redundant Opinions. In Proceedings of the 23rd International Conference on Computational Linguistics (COLING 2010). Beijing, China.



Citation Request:

Kavita Ganesan, ChengXiang Zhai, Jiawei Han. Opinosis: A Graph Based Approach to Abstractive Summarization of Highly Redundant Opinions. In Proceedings of the 23rd International Conference on Computational Linguistics (COLING 2010). Beijing, China.


Supported By:

 In Collaboration With:

About  ||  Citation Policy  ||  Donation Policy  ||  Contact  ||  CML