Roman Urdu Data Set

Donated on 8/28/2018

Roman Urdu (the scripting style for Urdu language) is one of the limited resource languages.A data corpus comprising of more than 20000 records was collected.

Dataset Characteristics

Text

Subject Area

Computer Science

Associated Tasks

Classification

Feature Type

-

# Instances

20000

# Features

-

Dataset Information

Additional Information

Tagged for Sentiment (Positive, Negative, Neutral)

Has Missing Values?

No

Variables Table

Variable NameRoleTypeDescriptionUnitsMissing Values
no
no

0 to 2 of 2

Additional Variable Information

Each record comprises of two string datatype values. One for Comment/Review and the second for sentiment.

Dataset Files

FileSize
Roman Urdu DataSet.csv1.6 MB

Reviews

There are no reviews for this dataset yet.

Login to Write a Review
Download (1.6 MB)
0 citations
1805 views

Creators

Zareen Sharf

License

By using the UCI Machine Learning Repository, you acknowledge and accept the cookies and privacy practices used by the UCI Machine Learning Repository.

Read Policy