Gallstone

Donated on 4/19/2025

The clinical dataset was collected from the Internal Medicine Outpatient Clinic of Ankara VM Medical Park Hospital and includes data from 319 individuals (June 2022–June 2023), 161 of whom were diagnosed with gallstone disease. It contains 38 features, including demographic, bioimpedance, and laboratory data, and was ethically approved by the Ankara City Hospital Ethics Committee (E2-23-4632). Demographic variables are age, sex, height, weight, and BMI. Bioimpedance data includes total, extracellular, and intracellular water, muscle and fat mass, protein, visceral fat area, and hepatic fat. Laboratory features are glucose, total cholesterol, HDL, LDL, triglycerides, AST, ALT, ALP, creatinine, GFR, CRP, hemoglobin, and vitamin D. The dataset is complete, with no missing values, and balanced in terms of disease status, eliminating the need for additional preprocessing. It provides a strong foundation for machine learning-based gallstone prediction using non-imaging features.

Dataset Characteristics

Tabular

Subject Area

Computer Science

Associated Tasks

Classification

Feature Type

Real

# Instances

320

# Features

38

Dataset Information

Has Missing Values?

No

Introductory Paper

Early prediction of gallstone disease with a machine learning-based method from bioimpedance and laboratory data

By Irfan Esen, Hilal Arslan, Selin Aktürk Esen, Mervenur Gülşen, Nimet Kültekin, Oğuzhan Özdemir. 2024

Published in Medicine

Variables Table

Variable NameRoleTypeDemographicDescriptionUnitsMissing Values
Gallstone StatusTargetBinaryTarget variable, Gallstones present(1), and absent(0)no
AgeFeatureIntegerAgeAge of the personno
GenderFeatureCategoricalGenderGender of the personno
ComorbidityFeatureCategoricalConcomitant diseasesno
Coronary Artery Disease (CAD)FeatureBinaryCardiovascular diseaseno
HypothyroidismFeatureBinaryUnderactive thyroid glandno
HyperlipidemiaFeatureBinaryHigh levels of fat in the bloodno
Diabetes Mellitus (DM)FeatureBinaryHigh blood sugarno
HeightFeatureIntegerHeight is the lengthno
WeightFeatureContinuousBody weightno

0 to 10 of 39

Additional Variable Information

Class Labels

- Gallstone Status: 0 (No), 1 (Yes) - Gender: 0 (Male), 1 (Female) - Comorbidity: 0 (No comorbidities present),1 (One comorbid condition), 2 (Two comorbid conditions), 3 (Three or more comorbid conditions) - Coronary Artery Disease: 0 (No), 1 (Yes) - Hypothyroidism: 0 (No), 1 (Yes) - Hyperlipidemia: 0 (No), 1 (Yes) - Diabetes Mellitus: 0 (No), 1 (Yes) - Hepatic Fat Accumulation (HFA): 0 (No fat accumulation),1 (Grade 1 (mild)), 2 (Grade 2 (moderate)), 3 (Grade 3 (severe)), 4 (Grade 4 (very severe))

Dataset Files

FileSize
dataset-uci.xlsx76.3 KB

Reviews

There are no reviews for this dataset yet.

Login to Write a Review
Download (73.2 KB)
1 citations
1968 views

Creators

Irfan Esen

Hilal Arslan

hilalarslanceng@gmail.com

Ankara Yildirim Beyazit University, Department of Software Engineering

Selin Aktürk

Mervenur Gülşen

Nimet Kültekin

Oğuzhan Özdemir

License

By using the UCI Machine Learning Repository, you acknowledge and accept the cookies and privacy practices used by the UCI Machine Learning Repository.

Read Policy