Center for Machine Learning and Intelligent Systems
About  Citation Policy  Donate a Data Set  Contact


Repository Web            Google
View ALL Data Sets

× Check out the beta version of the new UCI Machine Learning Repository we are currently testing! Contact us if you have any issues, questions, or concerns. Click here to try out the new site.

KDD Cup 1999 Data Data Set
Download: Data Folder, Data Set Description

Abstract: This is the data set used for The Third International Knowledge Discovery and Data Mining Tools Competition, which was held in conjunction with KDD-99

Data Set Characteristics:  

Multivariate

Number of Instances:

4000000

Area:

Computer

Attribute Characteristics:

Categorical, Integer

Number of Attributes:

42

Date Donated

1999-01-01

Associated Tasks:

Classification

Missing Values?

N/A

Number of Web Hits:

191339


Source:

N/A


Data Set Information:

Please see task description.


Attribute Information:

N/A


Relevant Papers:

Salvatore J. Stolfo, Wei Fan, Wenke Lee, Andreas Prodromidis, and Philip K. Chan. Cost-based Modeling and Evaluation for Data Mining With Application to Fraud and Intrusion Detection: Results from the JAM Project.
[Web Link]


Papers That Cite This Data Set1:

Stephen D. Bay and Dennis F. Kibler and Michael J. Pazzani and Padhraic Smyth. The UCI KDD Archive of Large Data Sets for Data Mining Research and Experimentation. SIGKDD Explorations, 2. 2000. [View Context].


Citation Request:

Please refer to the Machine Learning Repository's citation policy


[1] Papers were automatically harvested and associated with this data set, in collaboration with Rexa.info

Supported By:

 In Collaboration With:

About  ||  Citation Policy  ||  Donation Policy  ||  Contact  ||  CML