Sirtuin6 Small Molecules
Donated on 10/27/2022
The dataset includes 100 molecules with 6 most relevant descriptors to determine the candidate inhibitors of a target protein, Sirtuin6. The molecules are grouped based on their low- and high-BFEs.
Dataset Characteristics
Tabular
Subject Area
Biology
Associated Tasks
Classification
Feature Type
-
# Instances
100
# Features
6
Dataset Information
What do the instances in this dataset represent?
Small molecules
Was there any data preprocessing performed?
The original data consists a complete set of 1875 molecular descriptors generated by PaDEL-Descriptor software and needs feature selection before classification since some of the features are redundant. We reduced the descriptor set by Unsupervised Forward Selection and used the hyperbox classification method in combination with partial least squares regression to determine the most relevant molecular descriptors of the drug molecules for an efficient classification.
Has Missing Values?
No
Introductory Paper
By Mehmet Tardu, F. Rahim, I. Kavakli, Metin Türkay. 2016
Published in RAIRO Oper. Res.
Variables Table
Variable Name | Role | Type | Description | Units | Missing Values |
---|---|---|---|---|---|
SC-5 | Feature | Continuous | no | ||
SP-6 | Feature | Continuous | no | ||
SHBd | Feature | Continuous | no | ||
minHaaCH | Feature | Continuous | no | ||
maxwHBa | Feature | Continuous | no | ||
FMF | Feature | Continuous | no | ||
Class | Target | Categorical | no |
0 to 7 of 7
Dataset Files
File | Size |
---|---|
SIRTUIN6.csv | 5.7 KB |
Reviews
There are no reviews for this dataset yet.
pip install ucimlrepo
from ucimlrepo import fetch_ucirepo # fetch dataset sirtuin6_small_molecules = fetch_ucirepo(id=748) # data (as pandas dataframes) X = sirtuin6_small_molecules.data.features y = sirtuin6_small_molecules.data.targets # metadata print(sirtuin6_small_molecules.metadata) # variable information print(sirtuin6_small_molecules.variables)
Tardu, M. & RAHIM, F. (2016). Sirtuin6 Small Molecules [Dataset]. UCI Machine Learning Repository. https://doi.org/10.24432/C56C9Z.
Keywords
Creators
Mehmet Tardu
mtardu@ku.edu.tr
Koç University
FATIH RAHIM
frahim@ku.edu.tr
DOI
License
This dataset is licensed under a Creative Commons Attribution 4.0 International (CC BY 4.0) license.
This allows for the sharing and adaptation of the datasets for any purpose, provided that the appropriate credit is given.