Bone marrow transplant: children
Donated on 4/20/2020
The data set describes pediatric patients with several hematologic diseases, who were subject to the unmanipulated allogeneic unrelated donor hematopoietic stem cell transplantation.
Dataset Characteristics
Subject Area
Health and Medicine
Associated Tasks
Classification, Regression
Feature Type
Integer, Real
# Instances
# Features
Dataset Information
Additional Information
The data set describes pediatric patients with several hematologic diseases: malignant disorders (i.a. acute lymphoblastic leukemia, acute myelogenous leukemia, chronic myelogenous leukemia, myelodysplastic syndrome) and nonmalignant cases (i.a. severe aplastic anemia, Fanconi anemia, with X-linked adrenoleukodystrophy). All patients were subject to the unmanipulated allogeneic unrelated donor hematopoietic stem cell transplantation. The motivation of the study was to identify the most important factors influencing the success or failure of the transplantation procedure. In particular, the aim was to verify the hypothesis that increased dosage of CD34+ cells / kg extends overall survival time without simultaneous occurrence of undesirable events affecting patients' quality of life (Kawłak et al., 2010). The data set has been used in our work concerning survival rules (Wróbel et al., 2017) and user-guided rule induction (Sikora et al., 2019). The authors of the research on stem cell transplantation (Kawłak et al., 2010) who inspired our study also contributed to the set.
Has Missing Values?
Introductory Paper
By M. Sikora, Lukasz Wróbel, Adam Gudyś. 2018
Published in Knowledge-Based Systems
Variables Table
Variable Name | Role | Type | Demographic | Description | Units | Missing Values |
Recipientgender | Feature | Binary | Gender | Male - 1, Female - 0 | no | |
Stemcellsource | Feature | Binary | Source of hematopoietic stem cells (Peripheral blood - 1, Bone marrow - 0) | no | ||
Donorage | Feature | Integer | Age | Age of the donor at the time of hematopoietic stem cells apheresis | no | |
Donorage35 | Feature | Binary | Age | Donor age <35 - 0, Donor age >=35 - 1 | no | |
IIIV | Feature | Binary | Development of acute graft versus host disease stage II or III or IV (Yes - 1, No - 0) | no | ||
Gendermatch | Feature | Binary | Gender | Compatibility of the donor and recipient according to their gender (Female to Male - 1, Other - 0) | no | |
DonorABO | Feature | Categorical | ABO blood group of the donor of hematopoietic stem cells (0 - 0, 1, A, B=-1, AB=2) | no | ||
RecipientABO | Feature | Categorical | ABO blood group of the recipient of hematopoietic stem cells (0 - 0, 1, A, B=-1, AB=2) | yes | ||
RecipientRh | Feature | Binary | Presence of the Rh factor on recipient s red blood cells ('+' - 1, '-' - 0) | yes | ||
ABOmatch | Feature | Binary | Compatibility of the donor and the recipient of hematopoietic stem cells according to ABO blood group (matched - 1, mismatched - 1) | yes |
0 to 10 of 37
Additional Variable Information
donor_age - Age of the donor at the time of hematopoietic stem cells apheresis donor_age_below_35 - Is donor age less than 35 (yes, no) donor_ABO - ABO blood group of the donor of hematopoietic stem cells (0, A, B, AB) donor_CMV - Presence of cytomegalovirus infection in the donor of hematopoietic stem cells prior to transplantation (present, absent) recipient_age - Age of the recipient of hematopoietic stem cells at the time of transplantation recipient_age_below_10 - Is recipient age below 10 (yes, no) recipient_age_int - Age of the recipient discretized to intervals (0,5], (5, 10], (10, 20] recipient_gender - Gender of the recipient (female, male) recipient_body_mass - Body mass of the recipient of hematopoietic stem cells at the time of the transplantation recipient_ABO - ABO blood group of the recipient of hematopoietic stem cells (0, A, B, AB) recipient_rh - Presence of the Rh factor on recipient’s red blood cells (plus, minus) recipient_CMV - Presence of cytomegalovirus infection in the donor of hematopoietic stem cells prior to transplantation (present, absent) disease - Type of disease (ALL, AML, chronic, nonmalignant, lymphoma) disease_group - Type of disease (malignant, nonmalignant) gender_match - Compatibility of the donor and recipient according to their gender (female to male, other) ABO_match - Compatibility of the donor and the recipient of hematopoietic stem cells according to ABO blood group (matched, mismatched) CMV_status - Serological compatibility of the donor and the recipient of hematopoietic stem cells according to cytomegalovirus infection prior to transplantation (the higher the value, the lower the compatibility) HLA_match - Compatibility of antigens of the main histocompatibility complex of the donor and the recipient of hematopoietic stem cells (10/10, 9/10, 8/10, 7/10) HLA_mismatch - HLA matched or mismatched antigen - In how many antigens there is a difference between the donor and the recipient (0-3) allel - In how many allele there is a difference between the donor and the recipient (0-4) HLA_group_1 - The difference type between the donor and the recipient (HLA matched, one antigen, one allel, DRB1 cell, two allele or allel+antigen, two antigenes+allel, mismatched) risk_group - Risk group (high, low) stem_cell_source - Source of hematopoietic stem cells (peripheral blood, bone marrow) tx_post_relapse - The second bone marrow transplantation after relapse (yes ,no) CD34_x1e6_per_kg - CD34kgx10d6 - CD34+ cell dose per kg of recipient body weight (10^6/kg) CD3_x1e8_per_kg - CD3+ cell dose per kg of recipient body weight (10^8/kg) CD3_to_CD34_ratio - CD3+ cell to CD34+ cell ratio ANC_recovery - Neutrophils recovery defined as neutrophils count >0.5 x 10^9/L (yes, no) time_to_ANC_recovery - Time in days to neutrophils recovery PLT_recovery - Platelet recovery defined as platelet count >50000/mm3 (yes, no) time_to_PLT_recovery - Time in days to platelet recovery acute_GvHD_II_III_IV - Development of acute graft versus host disease stage II or III or IV (yes, no) acute_GvHD_III_IV - Development of acute graft versus host disease stage III or IV (yes, no) time_to_acute_GvHD_III_IV - Time in days to development of acute graft versus host disease stage III or IV extensive_chronic_GvHD - Development of extensive chronic graft versus host disease (yes, no) relapse - Relapse of the disease (yes, no) survival_time - Time of observation (if alive) or time to event (if dead) in days survival_status - Survival status (0 - alive, 1 - dead)
Dataset Files
File | Size |
bone-marrow.arff | 27.3 KB |
There are no reviews for this dataset yet.
pip install ucimlrepo
from ucimlrepo import fetch_ucirepo # fetch dataset bone_marrow_transplant_children = fetch_ucirepo(id=565) # data (as pandas dataframes) X = bone_marrow_transplant_children.data.features y = bone_marrow_transplant_children.data.targets # metadata print(bone_marrow_transplant_children.metadata) # variable information print(bone_marrow_transplant_children.variables)
Sikora, M., Wróbel, Ł., & Gudyś, A. (2020). Bone marrow transplant: children [Dataset]. UCI Machine Learning Repository. https://doi.org/10.24432/C5NP6Z.
Marek Sikora
Silesian University of Technology
Łukasz Wróbel
Adam Gudyś
This dataset is licensed under a Creative Commons Attribution 4.0 International (CC BY 4.0) license.
This allows for the sharing and adaptation of the datasets for any purpose, provided that the appropriate credit is given.