Center for Machine Learning and Intelligent Systems
About  Citation Policy  Donate a Data Set  Contact

Repository Web            Google
View ALL Data Sets

× Check out the beta version of the new UCI Machine Learning Repository we are currently testing! Contact us if you have any issues, questions, or concerns. Click here to try out the new site.

KDD Cup 1998 Data Data Set
Download: Data Folder, Data Set Description

Abstract: This is the data set used for The Second International Knowledge Discovery and Data Mining Tools Competition, which was held in conjunction with KDD-98

Data Set Characteristics:  


Number of Instances:




Attribute Characteristics:

Categorical, Integer

Number of Attributes:


Date Donated


Associated Tasks:


Missing Values?


Number of Web Hits:



Ismail Parsa
50 Cambridge Street
Burlington MA 01803 USA
TEL: (781) 685-6734
FAX: (781) 685-0806

Data Set Information:

Please see associated text files in the download folder.

Attribute Information:


Relevant Papers:


Papers That Cite This Data Set1:

Chris Giannella and Bassem Sayrafi. An Information Theoretic Histogram for Single Dimensional Selectivity Estimation. Department of Computer Science, Indiana University Bloomington. [View Context].

Citation Request:

The KDD-CUP-98 data set and the accompanying documentation are now available for general use with the following restrictions:

1. The users of the data must notify Ismail Parsa (iparsa '@' and Ken Howes (khowes '@' in the event they produce results, visuals or tables, etc. from the data and send a note that includes a summary of the final result.
2. The authors of published and/or unpublished articles that use the KDD-Cup-98 data set must also notify the individuals listed above and send a copy of their published and/or unpublished work.
3. If you intend to use this data set for training or educational purposes, you must not reveal the name of the sponsor PVA (Paralyzed Veterans of America) to the trainees or students. You are allowed to say "a national veterans organization"...

For more information regarding the KDD-Cup (including the list of the participants and the results), please visit the KDD-Cup-98 web page at: [Web Link]. While there, scroll down to Data Mining Presentations where you will find the KDD-Cup-98 web page.

[1] Papers were automatically harvested and associated with this data set, in collaboration with

Supported By:

 In Collaboration With:

About  ||  Citation Policy  ||  Donation Policy  ||  Contact  ||  CML