Clickstream Data for Online Shopping
Donated on 12/8/2019
The dataset contains information on clickstream from online store offering clothing for pregnant women.
Dataset Characteristics
Multivariate, Sequential
Subject Area
Business
Associated Tasks
Classification, Regression, Clustering
Feature Type
Integer, Real
# Instances
165474
# Features
14
Dataset Information
Additional Information
The dataset contains information on clickstream from online store offering clothing for pregnant women. Data are from five months of 2008 and include, among others, product category, location of the photo on the page, country of origin of the IP address and product price in US dollars.
Has Missing Values?
No
Introductory Paper
By Mariusz Łapczyński, S. Białowąs. 2013
Published in Studia Ekonomiczne
Variables Table
Variable Name | Role | Type | Description | Units | Missing Values |
---|---|---|---|---|---|
year | Feature | Date | 2008 | no | |
month | Feature | Date | from April (4) to August (8) | no | |
day | Feature | Date | day number of the month | no | |
order | Feature | Integer | sequence of clicks during one session | no | |
country | Feature | Categorical | variable indicating the country of origin of the IP address | no | |
session ID | Feature | Integer | variable indicating session id (short record) | no | |
page 1 (main category) | Feature | Categorical | concerns the main product category | no | |
page 2 (clothing model) | Feature | Categorical | contains information about the code for each product (217 products) | no | |
colour | Feature | Categorical | colour of product | no | |
location | Feature | Categorical | photo location on the page, the screen has been divided into six parts | no |
0 to 10 of 14
Additional Variable Information
The dataset contains 14 variables described in a separate file (See 'Data set description')
Dataset Files
File | Size |
---|---|
e-shop clothing 2008.csv | 6.4 MB |
e-shop clothing 2008 data description.txt | 3.2 KB |
Reviews
There are no reviews for this dataset yet.
pip install ucimlrepo
from ucimlrepo import fetch_ucirepo # fetch dataset clickstream_data_for_online_shopping = fetch_ucirepo(id=553) # data (as pandas dataframes) X = clickstream_data_for_online_shopping.data.features y = clickstream_data_for_online_shopping.data.targets # metadata print(clickstream_data_for_online_shopping.metadata) # variable information print(clickstream_data_for_online_shopping.variables)
Clickstream Data for Online Shopping [Dataset]. (2019). UCI Machine Learning Repository. https://doi.org/10.24432/C5QK7X.
Keywords
DOI
License
This dataset is licensed under a Creative Commons Attribution 4.0 International (CC BY 4.0) license.
This allows for the sharing and adaptation of the datasets for any purpose, provided that the appropriate credit is given.