Turkish Spam V01
Donated on 4/6/2019
The TurkishSpam data set contains spam and normal emails written in Turkish.
Dataset Characteristics
Text
Subject Area
Social Science
Associated Tasks
Classification
Feature Type
-
# Instances
826
# Features
2
Dataset Information
Additional Information
A total of 330 spam and 496 normal e-mails were collected from several personal accounts
Has Missing Values?
No
Variable Information
text=Contains spam messages classification=contains classification information
Dataset Files
File | Size |
---|---|
trspam.csv | 2.1 MB |
Reviews
There are no reviews for this dataset yet.
Download (2.1 MB)
Install the ucimlrepo package
pip install ucimlrepo
Import the dataset into your code
View the full documentationfrom ucimlrepo import fetch_ucirepo # fetch dataset turkish_spam_v01 = fetch_ucirepo(id=530) # data (as pandas dataframes) X = turkish_spam_v01.data.features y = turkish_spam_v01.data.targets # metadata print(turkish_spam_v01.metadata) # variable information print(turkish_spam_v01.variables)
0 citations
1376 views
Citation
Turkish Spam V01 [Dataset]. (2019). UCI Machine Learning Repository. https://doi.org/10.24432/C5WG7F.
Style:
DOI
License
This dataset is licensed under a Creative Commons Attribution 4.0 International (CC BY 4.0) license.
This allows for the sharing and adaptation of the datasets for any purpose, provided that the appropriate credit is given.