Center for Machine Learning and Intelligent Systems
About  Citation Policy  Donate a Data Set  Contact


Repository Web            Google
View ALL Data Sets

QtyT40I10D100K Data Set
Download: Data Folder, Data Set Description

Abstract: Since there is no numerical sequential data stream available in standard data sets, this data set is generated from the original T40I10D100K data set

Data Set Characteristics:  

Sequential

Number of Instances:

3960456

Area:

N/A

Attribute Characteristics:

Integer

Number of Attributes:

4

Date Donated

2012-10-21

Associated Tasks:

N/A

Missing Values?

N/A

Number of Web Hits:

13338


Source:

Omid Shakeri, M.Sc
omid.shakeri '@' tmu.ac.ir ; omid.shakeri '@' gmail.com
Data Mining Lab., Computer Engineering Department, Kharazmi University, Karaj/Tehran, Iran

Mir Mohsen Pedram, Ph.D
pedram '@' tmu.ac.ir
Data Mining Lab., Computer Engineering Department, Kharazmi University, Karaj/Tehran, Iran


Data Set Information:

This data set is generated from the original T40I10D100K data set, to mine fuzzy sequential patterns over quantitative streams. While the original T40I10D100K is generated from the synthetic data generator described in “R. Agrawal, R. Srikant, Fast algorithms for mining association rules, 20th Intl. Conf. on Very Large Databases (VLDB’94), pp. 487-499. 1994”.
The data set is a SQL Server 2008 database, which can be attached to a SQL Server Instance to use


Attribute Information:

CustomerID: the ID of the customer who has performed the transaction (randomly generated [1 100])
Time: the time that the transaction has been performed
Transaction: the transaction which has been performed
Quantity: the quantity value of each transaction (randomly generated [1 10])


Relevant Papers:

The papers which use this data set are being reviewed by referees.



Citation Request:

Please refer to the Machine Learning Repository's citation policy


Supported By:

 In Collaboration With:

About  ||  Citation Policy  ||  Donation Policy  ||  Contact  ||  CML