Amazon Access Samples
Donated on 9/12/2011
Amazon's InfoSec is getting smarter about the way Access data is leveraged. This is an anonymized sample of access provisioned within the company.
Dataset Characteristics
Time-Series, Domain-Theory
Subject Area
Business
Associated Tasks
Regression, Clustering, Causal-Discovery
Feature Type
-
# Instances
30000
# Features
20000
Dataset Information
Additional Information
This is a sparse data set, less than 10% of the attributes are used for each sample. The link is to a '*.tgz' file which contains two files: [amzn-anon-access-samples-2.0.csv] this file contains the access for users [amzn-anon-access-samples-history-2.0.csv] this file contains the access history for a given user
Has Missing Values?
No
Variable Information
__amzn-anon-access-samples-2.0.csv__ This is a sparse data set containing users and their assigned access. The file contains 4 categories of attributes. 1) [PERSON_{ATTRIBUTE}] This category describes the 'user' who was given access. The [PERSON_ID] column is the primary key column for the file. There is one row per user. PERSON_ID: id of the user PERSON_MGR_ID: id of the user's manager PERSON_ROLLUP_1: user grouping id PERSON_ROLLUP_2: user grouping id PERSON_ROLLUP_3: user grouping id PERSON_DEPTNAME: department desciption id PERSON_LOCATION: region id PERSON_BUSINESS_TITLE: title id PERSON_BUSINESS_TITLE_DETAIL: description id PERSON_JOB_CODE: job code id PERSON_COMPANY: company id PERSON_JOB_FAMILY: job family id 2) [RESOURCE_{ID}] This category of attributes are the resources that a users can possibly have access to. A user will have a 1 in this column if the have access to it otherwise it will be 0. 3) [GROUP_{ID}] - This category of attributes are the groups that a users can possibly have access to. A user will have a 1 in this column if the have access to it otherwise it will be 0. 4) [SYSTEM_SUPPORT_{ID}] - This category of attributes are the system that a user can possibly be supporting. A user will have a 1 in this column if the have can possibly be supporting it, otherwise it will be 0. __amzn-anon-access-samples-history-2.0.csv__ Permissions Time series data. Here is a short description of the columns: ACTION: either 'remove_access' or 'add_access' TARGET_NAME: either the {RESOURCE_ID} or {GROUP_ID} LOGIN: the id of the user that is obtaining or losing access REQUEST_DATE: YYYY-MM-DD HH:MM:SS AUTHORIZATION_DATE: YYYY-MM-DD HH:MM:SS
Dataset Files
File | Size |
---|---|
amzn-anon-access-samples.tgz | 11.7 MB |
Papers Citing this Dataset
Sort by Year, desc
By Carlos Cotrini, Luca Corinzia, Thilo Weghorn, David Basin. 2019
Published in ArXiv.
0 to 1 of 1
Reviews
There are no reviews for this dataset yet.
pip install ucimlrepo
from ucimlrepo import fetch_ucirepo # fetch dataset amazon_access_samples = fetch_ucirepo(id=216) # data (as pandas dataframes) X = amazon_access_samples.data.features y = amazon_access_samples.data.targets # metadata print(amazon_access_samples.metadata) # variable information print(amazon_access_samples.variables)
Montanez, K. (2011). Amazon Access Samples [Dataset]. UCI Machine Learning Repository. https://doi.org/10.24432/C5JW2K.
Creators
Ken Montanez
DOI
License
This dataset is licensed under a Creative Commons Attribution 4.0 International (CC BY 4.0) license.
This allows for the sharing and adaptation of the datasets for any purpose, provided that the appropriate credit is given.