Roman Urdu Sentiment Analysis Dataset (RUSAD)
Donated on 8/14/2023
The dataset was gathered to carry out research on the task of sentiment analysis for Roman Urdu.
Dataset Characteristics
Text
Subject Area
Computer Science
Associated Tasks
Classification
Feature Type
Categorical
# Instances
11000
# Features
2
Dataset Information
Additional Information
The dataset has two columns. The first column has the binary categorical information (positive, negative) and the second column has the actual review.
Has Missing Values?
No
Introductory Paper
By Khawar Mehmood, D. Essam, Kamran Shafi. 2018
Published in Advances in Intelligent Systems and Computing
Variable Information
There are two attributes of this dataset. The first attribute holds the binary categorical information (positive, negative) while the second attribute holds the actual review.
Class Labels
positive, negative
Dataset Files
File | Size |
---|---|
Dataset 11000 Reviews.tsv | 1.3 MB |
Reviews
There are no reviews for this dataset yet.
pip install ucimlrepo
from ucimlrepo import fetch_ucirepo # fetch dataset roman_urdu_sentiment_analysis_dataset_rusad = fetch_ucirepo(id=854) # data (as pandas dataframes) X = roman_urdu_sentiment_analysis_dataset_rusad.data.features y = roman_urdu_sentiment_analysis_dataset_rusad.data.targets # metadata print(roman_urdu_sentiment_analysis_dataset_rusad.metadata) # variable information print(roman_urdu_sentiment_analysis_dataset_rusad.variables)
Mehmood, K., Essam, D., & Malik, M. (2018). Roman Urdu Sentiment Analysis Dataset (RUSAD) [Dataset]. UCI Machine Learning Repository. https://doi.org/10.24432/C59046.
Keywords
Creators
Khawar Mehmood
k.mehmood@unsw.edu.au
Daryl Essam
d.essam@unsw.edu.au
Muhammad Kamran Malik
kamran.malik@pucit.edu.pk
DOI
License
This dataset is licensed under a Creative Commons Attribution 4.0 International (CC BY 4.0) license.
This allows for the sharing and adaptation of the datasets for any purpose, provided that the appropriate credit is given.