Youtube cookery channels viewers comments in Hinglish
Donated on 7/2/2019
The datasets are taken from top 2 Indian cooking channel named Nisha Madhulika channel and Kabita’s Kitchen channel. The data set is in Hinglish Language.
Dataset Characteristics
Multivariate, Text
Subject Area
Computer Science
Associated Tasks
Classification
Feature Type
-
# Instances
9800
# Features
3
Dataset Information
Additional Information
The datasets are taken from top 2 Indian cooking channel named Nisha Madhulika channel and Kabita’s Kitchen channel. Both the datasets are divided into seven categories :- Label 1- Gratitude Label 2- About the recipe Label 3- About the video Label 4- Praising Label 5- Hybrid Label 6- Undefined Label 7- Suggestions and queries All the labelling has been done manually. Nisha Madhulika dataset: Dataset characteristics: Multivariate Number of instances: 4900 Area: Cooking Attribute characteristics: Real Number of attributes: 3 Date donated: March, 2019 Associate tasks: Classification Missing values: Null Number of subscribers: 7,063,604 Kabita Kitchen dataset: Dataset characteristics: Multivariate Number of instances: 4900 Area: Cooking Attribute characteristics: Real Number of attributes: 3 Date donated: March, 2019 Associate tasks: Classification Missing values: Null Number of subscribers: 4,867,502 There are two separate datasets file of each channel. The files with preprocessing names are generated after doing the preprocessing and exploratory data analysis on both the datasets. This file includes: • Id • Comment text • Labels • Count of stop-words • Uppercase words • Hashtags • Word count • Char count • Average words • Numeric The main file includes: • Id • comment text • Labels
Has Missing Values?
No
Dataset Files
File | Size |
---|---|
Cooking Data/nisha_preprocessing.csv | 408.1 KB |
Cooking Data/kabita_preprocessing.csv | 400 KB |
Cooking Data/Nishafin.csv | 374.3 KB |
Cooking Data/kabitakitchen.csv | 359.8 KB |
Cooking Data/readme.docx | 14.7 KB |
Reviews
There are no reviews for this dataset yet.
pip install ucimlrepo
from ucimlrepo import fetch_ucirepo # fetch dataset youtube_cookery_channels_viewers_comments_in_hinglish = fetch_ucirepo(id=535) # data (as pandas dataframes) X = youtube_cookery_channels_viewers_comments_in_hinglish.data.features y = youtube_cookery_channels_viewers_comments_in_hinglish.data.targets # metadata print(youtube_cookery_channels_viewers_comments_in_hinglish.metadata) # variable information print(youtube_cookery_channels_viewers_comments_in_hinglish.variables)
Youtube cookery channels viewers comments in Hinglish [Dataset]. (2019). UCI Machine Learning Repository. https://doi.org/10.24432/C55026.
DOI
License
This dataset is licensed under a Creative Commons Attribution 4.0 International (CC BY 4.0) license.
This allows for the sharing and adaptation of the datasets for any purpose, provided that the appropriate credit is given.