ILPD (Indian Liver Patient Dataset)
Donated on 5/20/2012
Death by liver cirrhosis continues to increase, given the increase in alcohol consumption rates, chronic hepatitis infections, and obesity-related liver disease. Notwithstanding the high mortality of this disease, liver diseases do not affect all sub-populations equally. The early detection of pathology is a determinant of patient outcomes, yet female patients appear to be marginalized when it comes to early diagnosis of liver pathology. The dataset comprises 584 patient records collected from the NorthEast of Andhra Pradesh, India. The prediction task is to determine whether a patient suffers from liver disease based on the information about several biochemical markers, including albumin and other enzymes required for metabolism.
Dataset Characteristics
Multivariate
Subject Area
Health and Medicine
Associated Tasks
Classification
Feature Type
Integer, Real
# Instances
583
# Features
10
Dataset Information
What do the instances in this dataset represent?
Medical patients
Does the dataset contain data that might be considered sensitive in any way?
Yes. The data contains information about the age and gender of the patients.
Was there any data preprocessing performed?
Any patient whose age exceeded 89 is listed as being of age "90".
Additional Information
This data set contains records of 416 patients diagnosed with liver disease and 167 patients without liver disease. This information is contained in the class label named 'Selector'. There are 10 variables per patient: age, gender, total Bilirubin, direct Bilirubin, total proteins, albumin, A/G ratio, SGPT, SGOT and Alkphos. Of the 583 patient records, 441 are male, and 142 are female. The current dataset has been used to study - differences in patients across US and Indian patients that suffer from liver diseases. - gender-based disparities in predicting liver disease, as previous studies have found that biochemical markers do not have the same effectiveness for male and female patients.
Has Missing Values?
No
Introductory Paper
By I. Straw, Honghan Wu. 2022
Published in BMJ Health & Care Informatics
Variables Table
Variable Name | Role | Type | Demographic | Description | Units | Missing Values |
---|---|---|---|---|---|---|
Age | Feature | Integer | Age | Age of the patient. Any patient whose age exceeded 89 is listed as being of age "90". | years | no |
Gender | Feature | Binary | Gender | Gender of the patient | no | |
TB | Feature | Continuous | Total Bilirubin | no | ||
DB | Feature | Continuous | Direct Bilirubin | no | ||
Alkphos | Feature | Integer | Alkaline Phosphotase | no | ||
Sgpt | Feature | Integer | Alamine Aminotransferase | no | ||
Sgot | Feature | Integer | Aspartate Aminotransferase | no | ||
TP | Feature | Continuous | Total Proteins | no | ||
ALB | Feature | Continuous | Albumin | no | ||
A/G Ratio | Feature | Continuous | Albumin and Globulin Ratio | no |
0 to 10 of 11
Dataset Files
File | Size |
---|---|
Indian Liver Patient Dataset (ILPD).csv | 23.2 KB |
Reviews
There are no reviews for this dataset yet.
pip install ucimlrepo
from ucimlrepo import fetch_ucirepo # fetch dataset ilpd_indian_liver_patient_dataset = fetch_ucirepo(id=225) # data (as pandas dataframes) X = ilpd_indian_liver_patient_dataset.data.features y = ilpd_indian_liver_patient_dataset.data.targets # metadata print(ilpd_indian_liver_patient_dataset.metadata) # variable information print(ilpd_indian_liver_patient_dataset.variables)
Ramana, B. & Venkateswarlu, N. (2022). ILPD (Indian Liver Patient Dataset) [Dataset]. UCI Machine Learning Repository. https://doi.org/10.24432/C5D02C.
Keywords
Creators
Bendi Ramana
N. Venkateswarlu
DOI
License
This dataset is licensed under a Creative Commons Attribution 4.0 International (CC BY 4.0) license.
This allows for the sharing and adaptation of the datasets for any purpose, provided that the appropriate credit is given.