Opinion Corpus for Lebanese Arabic Reviews (OCLAR)

Donated on 6/16/2019

Opinion Corpus for Lebanese Arabic Reviews (OCLAR) corpus is utilizable for Arabic sentiment classification on services’ reviews, including hotels, restaurants, shops, and others.

Dataset Characteristics

Text

Subject Area

Computer Science

Associated Tasks

Classification

Feature Type

Integer

# Instances

3916

# Features

3916

Dataset Information

Additional Information

The researchers of OCLAR Marwan et al. (2019), they gathered Arabic costumer reviews from (https://maps.google.com) and Zomato website (https://www.zomato.com/lebanon) on wide scope of domain, including restaurants, hotels, hospitals, local shops, etc. The corpus finally contains 3916 reviews in 5-rating scale. For this research purpose, the positive class considers rating stars from 5 to 3 of 3465 reviews, and the negative class is represented from values of 1 and 2 of about 451 texts.

Has Missing Values?

No

Variable Information

1- 3916 text reviews 2- 5-rating scale: 1: 303 2: 148 3: 418 4: 734 5: 2313 Positive class includes rating stars from 5 to 3 of 3465 total. Negative class include rating stars from 1 to 2 of 451 total.

Dataset Files

FileSize
OCLAR - Opinion Corpus for Lebanese Arabic Reviews.csv374 KB

Reviews

There are no reviews for this dataset yet.

Login to Write a Review
Download (374.2 KB)
1 citations
848 views

Creators

Marwan Omari

Moustafa Al-Hajj

Nacereddine Hammami

Amani Sabra

License

By using the UCI Machine Learning Repository, you acknowledge and accept the cookies and privacy practices used by the UCI Machine Learning Repository.

Read Policy