Center for Machine Learning and Intelligent Systems
About  Citation Policy  Donate a Data Set  Contact

Repository Web            Google
View ALL Data Sets

× Check out the beta version of the new UCI Machine Learning Repository we are currently testing! Contact us if you have any issues, questions, or concerns. Click here to try out the new site.

OpinRank Review Dataset Data Set
Download: Data Folder, Data Set Description

Abstract: This data set contains user reviews of cars and and hotels collected from Tripadvisor (~259,000 reviews) and Edmunds (~42,230 reviews).

Data Set Characteristics:  


Number of Instances:




Attribute Characteristics:


Number of Attributes:


Date Donated


Associated Tasks:


Missing Values?


Number of Web Hits:



Kavita Ganesan & ChengXiang Zhai
University of Illinois @ Urbana Champaign

Data Set Information:

Car Reviews
-Full reviews of cars for model-years 2007, 2008, and 2009
-There are about 140-250 cars for each model year
-Extracted fields include dates, author names, favorites and the full textual review
-Total number of reviews: ~42,230

Hotel Reviews
-Full reviews of hotels in 10 different cities (Dubai, Beijing, London, New York City, New Delhi, San Francisco, Shanghai, Montreal, Las Vegas, Chicago)
-There are about 80-700 hotels in each city
-Extracted fields include date, review title and the full review
-Total number of reviews: ~259,000

Attribute Information:


Relevant Papers:

Ganesan, K. A., and C. X. Zhai, 'Opinion-Based Entity Ranking', Information Retrieval, 2011.

Citation Request:

Bibtex as follows:

@article {opinrank,
title = {Opinion-Based Entity Ranking},
journal = {Information Retrieval},
year = {2011},
keywords = {adhoc multifaceted search, entity oriented search, entity ranking, entity retrieval, product search},
doi = {10.1007/s10791-011-9174-8},
attachments = {[Web Link]},
author = {Kavita Ganesan and ChengXiang Zhai}

Supported By:

 In Collaboration With:

About  ||  Citation Policy  ||  Donation Policy  ||  Contact  ||  CML