Sundanese Twitter Dataset

Donated on 11/26/2021

This dataset contains tweet of the second-largest local language in Indonesia and is used for emotion classification.

Tabular

Computer Science

Classification

2510

For what purpose was the dataset created?

This dataset is created as contribution for NLP research particularly in Indonesia

Who funded the creation of the dataset?

This dataset is self-funded

What do the instances in this dataset represent?

Are there recommended data splits?

Was there any data preprocessing performed?

tokenization, stopword removal, stemming

Has Missing Values?

By Oddy Virgantara Putra; Fathin Muhammad Wasmanson; Triana Harmini; Shoffin Nahwa Utama. 2020

Published in Conference

Variable Name	Role	Type	Description	Units	Missing Values
label	Target	Categorical			no
data	Feature	Categorical			no

Rows per page

0 to 2 of 2

1 citations

4279 views

Oddy Virgantara Putra

oddy@unida.gontor.ac.id

This dataset is licensed under a Creative Commons Attribution 4.0 International (CC BY 4.0) license.

This allows for the sharing and adaptation of the datasets for any purpose, provided that the appropriate credit is given.