Phishing Websites

Donated on 3/25/2015

This dataset collected mainly from: PhishTank archive, MillerSmiles archive, Google’s searching operators.

Dataset Characteristics

-

Subject Area

Computer Science

Associated Tasks

Classification

Feature Type

Integer

# Instances

2456

# Features

-

Dataset Information

Additional Information

One of the challenges faced by our research was the unavailability of reliable training datasets. In fact this challenge faces any researcher in the field. However, although plenty of articles about predicting phishing websites have been disseminated these days, no reliable training dataset has been published publically, may be because there is no agreement in literature on the definitive features that characterize phishing webpages, hence it is difficult to shape a dataset that covers all possible features. In this dataset, we shed light on the important features that have proved to be sound and effective in predicting phishing websites. In addition, we propose some new features.

Has Missing Values?

No

Variables Table

Variable NameRoleTypeDemographicDescriptionUnitsMissing Values
no
no
no
no
no
no
no
no
no
no

0 to 10 of 30

Additional Variable Information

For Further information about the features see the features file in the data folder.

Download
0 citations
12891 views

Creators

Rami Mohammad

Lee McCluskey

License

By using the UCI Machine Learning Repository, you acknowledge and accept the cookies and privacy practices used by the UCI Machine Learning Repository.

Read Policy