+--------------------------------------------------------------------+ | NOTE TO ALL DOWN-LOADERS | +--------------------------------------------------------------------+ The KDD-CUP-98 data set and the accompanying documentation are now available for general use with the following restrictions: (1) The users of the data must notify Ismail Parsa (iparsa@epsilon.com) and Ken Howes (khowes@epsilon.com) in the event they produce results, visuals or tables, etc. from the data and send a note that includes a summary of the final result. (2) The authors of published and/or unpublished articles that use the KDD-Cup-98 data set must also notify the individuals listed above and send a copy of their published and/or unpublished work. (3) If you intend to use this data set for training or educational purposes, you must not reveal the name of the sponsor PVA (Paralyzed Veterans of America) to the trainees or students. You are allowed to say "a national veterans organization"... For more information regarding the KDD-Cup (including the list of the participants and the results), please visit the KDD-Cup-98 web page at http://www.epsilon.com/new While there, scroll down to Data Mining Presentations where you will find the KDD-Cup-98 web page. Ismail Parsa Epsilon 50 Cambridge Street Burlington MA 01803 USA TEL: (781) 685-6734 FAX: (781) 685-0806 +--------------------------------------------------------------------+ | LISTING of the FILES (README FILE) | +--------------------------------------------------------------------+ File Naming Conventions: o cup98 : KDD-CUP-98 o QUE : QUEstionnaire o DOC : DOCumentation o DIC : DICtionary o LRN : LeaRNing data set o VAL : VALidation data set o VALtargt : TARGeT fields for VALidation data set o .txt : plain ascii text files o .zip : PKZIP compressed files o .txt.Z : UNIX COMPRESSED files FILE NAME DESCRIPTION --------------- ------------------------------------------------------ README This list, listing the files in the FTP server and their contents. cup98NDA.txt The Non-Disclosure Agreement. MUST BE SIGNED BY ALL PARTICIPANTS AND MAILED BACK TO ISMAIL PARSA BEFORE DOWNLOADING THE DATA SETS. cup98DOC.txt This file, an overview and pointer to more detailed information about the competition cup98DIC.txt Data dictionary to accompany the analysis data set. cup98QUE.txt KDD-CUP questionnaire. PARTICIPANTS ARE REQUIRED TO FILL-OUT THE QUESTIONNAIRE and turned in with the results. cup98LRN.zip PKZIP compressed raw LEARNING data set. Internal name: cup98LRN.txt File size: 36,468,735 bytes zipped. 117,167,952 bytes unzipped. Number of Records: 95412. Number of Fields: 481. cup98VAL.zip PKZIP compressed raw VALIDATION data set. Internal name: cup98VAL.txt File size: 36,763,018 bytes zipped. 117,943,347 bytes unzipped. Number of Records: 96367. Number of Fields: 479. cup98LRN.txt.Z UNIX COMPRESSed raw LEARNING data set. Internal name: cup98LRN.txt File size: 36,579,127 bytes compressed. 117,167,952 bytes uncompressed. Number of Records: 95412. Number of Fields: 481. cup98VAL.txt.Z UNIX COMPRESSed raw VALIDATION data set. Internal name: cup98VAL.txt File size: 36,903,761 bytes compressed. 117,943,347 bytes uncompressed. Number of Records: 96367. Number of Fields: 479.