
Turkish Spam V01
Donated on 4/6/2019
The TurkishSpam data set contains spam and normal emails written in Turkish.
Dataset Characteristics
Text
Subject Area
Social Science
Associated Tasks
Classification
Feature Type
-
# Instances
826
# Features
2
Dataset Information
Additional Information
A total of 330 spam and 496 normal e-mails were collected from several personal accounts
Has Missing Values?
No
Variable Information
text=Contains spam messages classification=contains classification information
Dataset Files
| File | Size |
|---|---|
| trspam.csv | 2.1 MB |
Download (2.1 MB)
Install the ucimlrepo package
pip install ucimlrepo
Import the dataset into your code
View the full documentationfrom ucimlrepo import fetch_ucirepo # fetch dataset turkish_spam_v01 = fetch_ucirepo(id=530) # data (as pandas dataframes) X = turkish_spam_v01.data.features y = turkish_spam_v01.data.targets # metadata print(turkish_spam_v01.metadata) # variable information print(turkish_spam_v01.variables)
0 citations
1979 views
Citation
Turkish Spam V01 [Dataset]. (2019). UCI Machine Learning Repository. https://doi.org/10.24432/C5WG7F.
Style:
DOI
License
This dataset is licensed under a Creative Commons Attribution 4.0 International (CC BY 4.0) license.
This allows for the sharing and adaptation of the datasets for any purpose, provided that the appropriate credit is given.