Center for Machine Learning and Intelligent Systems
About  Citation Policy  Donate a Data Set  Contact


Repository Web            Google
View ALL Data Sets

Bone marrow transplant: children Data Set
Download: Data Folder, Data Set Description

Abstract: The data set describes pediatric patients with several hematologic diseases, who were subject to the unmanipulated allogeneic unrelated donor hematopoietic stem cell transplantation.

Data Set Characteristics:  

Multivariate

Number of Instances:

187

Area:

Life

Attribute Characteristics:

Integer, Real

Number of Attributes:

39

Date Donated

2020-04-21

Associated Tasks:

Classification, Regression

Missing Values?

Yes

Number of Web Hits:

327


Source:

Marek Sikora (marek.sikora '@' polsl.pl), Łukasz Wróbel (lukasz.wrobel '@' polsl.pl), Adam Gudyś (adam.gudys '@' polsl.pl)
Faculty of Automatic Control, Electronics and Computer Science, Silesian University of Technology, 44-100 Gliwice, Poland


Data Set Information:

The data set describes pediatric patients with several hematologic diseases: malignant disorders (i.a. acute lymphoblastic leukemia, acute myelogenous leukemia, chronic myelogenous leukemia, myelodysplastic syndrome) and nonmalignant cases (i.a. severe aplastic anemia, Fanconi anemia, with X-linked adrenoleukodystrophy). All patients were subject to the unmanipulated allogeneic unrelated donor hematopoietic stem cell transplantation.

The motivation of the study was to identify the most important factors influencing the success or failure of the transplantation procedure. In particular, the aim was to verify the hypothesis that increased dosage of CD34+ cells / kg extends overall survival time without simultaneous occurrence of undesirable events affecting patients' quality of life (Kawłak et al., 2010).

The data set has been used in our work concerning survival rules (Wróbel et al., 2017) and user-guided rule induction (Sikora et al., 2019). The authors of the research on stem cell transplantation (Kawłak et al., 2010) who inspired our study also contributed to the set.


Attribute Information:

donor_age - Age of the donor at the time of hematopoietic stem cells apheresis
donor_age_below_35 - Is donor age less than 35 (yes, no)
donor_ABO - ABO blood group of the donor of hematopoietic stem cells (0, A, B, AB)
donor_CMV - Presence of cytomegalovirus infection in the donor of hematopoietic stem cells prior to transplantation (present, absent)
recipient_age - Age of the recipient of hematopoietic stem cells at the time of transplantation
recipient_age_below_10 - Is recipient age below 10 (yes, no)
recipient_age_int - Age of the recipient discretized to intervals (0,5], (5, 10], (10, 20]
recipient_gender - Gender of the recipient (female, male)
recipient_body_mass - Body mass of the recipient of hematopoietic stem cells at the time of the transplantation
recipient_ABO - ABO blood group of the recipient of hematopoietic stem cells (0, A, B, AB)
recipient_rh - Presence of the Rh factor on recipient’s red blood cells (plus, minus)
recipient_CMV - Presence of cytomegalovirus infection in the donor of hematopoietic stem cells prior to transplantation (present, absent)
disease - Type of disease (ALL, AML, chronic, nonmalignant, lymphoma)
disease_group - Type of disease (malignant, nonmalignant)
gender_match - Compatibility of the donor and recipient according to their gender (female to male, other)
ABO_match - Compatibility of the donor and the recipient of hematopoietic stem cells according to ABO blood group (matched, mismatched)
CMV_status - Serological compatibility of the donor and the recipient of hematopoietic stem cells according to cytomegalovirus infection prior to transplantation (the higher the value, the lower the compatibility)
HLA_match - Compatibility of antigens of the main histocompatibility complex of the donor and the recipient of hematopoietic stem cells (10/10, 9/10, 8/10, 7/10)
HLA_mismatch - HLA matched or mismatched
antigen - In how many antigens there is a difference between the donor and the recipient (0-3)
allel - In how many allele there is a difference between the donor and the recipient (0-4)
HLA_group_1 - The difference type between the donor and the recipient (HLA matched, one antigen, one allel, DRB1 cell, two allele or allel+antigen, two antigenes+allel, mismatched)
risk_group - Risk group (high, low)
stem_cell_source - Source of hematopoietic stem cells (peripheral blood, bone marrow)
tx_post_relapse - The second bone marrow transplantation after relapse (yes ,no)
CD34_x1e6_per_kg - CD34kgx10d6 - CD34+ cell dose per kg of recipient body weight (10^6/kg)
CD3_x1e8_per_kg - CD3+ cell dose per kg of recipient body weight (10^8/kg)
CD3_to_CD34_ratio - CD3+ cell to CD34+ cell ratio
ANC_recovery - Neutrophils recovery defined as neutrophils count >0.5 x 10^9/L (yes, no)
time_to_ANC_recovery - Time in days to neutrophils recovery
PLT_recovery - Platelet recovery defined as platelet count >50000/mm3 (yes, no)
time_to_PLT_recovery - Time in days to platelet recovery
acute_GvHD_II_III_IV - Development of acute graft versus host disease stage II or III or IV (yes, no)
acute_GvHD_III_IV - Development of acute graft versus host disease stage III or IV (yes, no)
time_to_acute_GvHD_III_IV - Time in days to development of acute graft versus host disease stage III or IV
extensive_chronic_GvHD - Development of extensive chronic graft versus host disease (yes, no)
relapse - Relapse of the disease (yes, no)
survival_time - Time of observation (if alive) or time to event (if dead) in days
survival_status - Survival status (0 - alive, 1 - dead)


Relevant Papers:

Gudyś, A, Sikora, M, Wróbel, Ł (2020) RuleKit: A Comprehensive Suite for Rule-Based Learning,
Knowledge-Based Systems ([Web Link])
Sikora, M, Wróbel, Ł, Gudyś, A (2019) GuideR: a guided separate-and-conquer rule learning in classification, regression, and survival settings,
Knowledge-Based Systems, 173:1-14 ([Web Link])
Wróbel, Ł, Gudyś, A, Sikora, M (2017) Learning rule sets from survival data,
BMC Bioinformatics, 18(1):285 ([Web Link])
Kałwak, K, Porwolik, J, Mielcarek, M et al. (2010) Higher CD34+ and CD3+ cell doses in the graft promote long-term survival,
and have no impact on the incidence of severe acute or chronic graft-versus-host disease after in vivo t cell-depleted
unrelated donor hematopoietic stem cell transplantation in children,
Biology of Blood and Marrow Transplantation, 16(10): 1388-1401 ([Web Link])



Citation Request:

@article{sikora2019guider,
title={{GuideR: A guided separate-and-conquer rule learning in classification, regression, and survival settings}},
author={Sikora, Marek and Wr{'o}bel, {L}ukasz and Gudy{'s}, Adam},
journal={Knowledge-Based Systems},
volume={173},
pages={1--14},
year={2019},
publisher={Elsevier}
}


Supported By:

 In Collaboration With:

About  ||  Citation Policy  ||  Donation Policy  ||  Contact  ||  CML