Center for Machine Learning and Intelligent Systems
About  Citation Policy  Donate a Data Set  Contact

Repository Web            Google
View ALL Data Sets

Amazon book reviews Data Set
Download: Data Folder, Data Set Description

Abstract: 213.335 book reviews for 8 different books. There are books which are scored very negatively in general and books which are scored very positively.

Data Set Characteristics:  

Multivariate, Text

Number of Instances:




Attribute Characteristics:

Integer, Real

Number of Attributes:


Date Donated


Associated Tasks:

Classification, Clustering

Missing Values?


Number of Web Hits:



Ahmet Taspinar, info '@',

Data Set Information:

Gone Girl: 41.974
The Girl on the Train: 37.139
The Fault in our Stars: 35.844
Fifty Shades of Grey: 32.977
Unbroken: 25.876
The hunger games: 24.027
The Goldfinch: 22.861
The Martian: 22.571

Attribute Information:

Each entry is separated by a newline character ('
'). Each entry contains four attributes, which are separated by a space (' '):
1. review score
2. tail of review url ([Web Link])
3. review title
4. HTML of review text

Relevant Papers:

[Web Link]

Citation Request:

[Web Link]

Supported By:

 In Collaboration With:

About  ||  Citation Policy  ||  Donation Policy  ||  Contact  ||  CML