Cervical Cancer (Risk Factors)

Donated on 3/2/2017

This dataset focuses on the prediction of indicators/diagnosis of cervical cancer. The features cover demographic information, habits, and historic medical records.

Dataset Characteristics

Multivariate

Subject Area

Health and Medicine

Associated Tasks

Classification

Feature Type

Integer, Real

# Instances

858

# Features

36

Dataset Information

Additional Information

The dataset was collected at 'Hospital Universitario de Caracas' in Caracas, Venezuela. The dataset comprises demographic information, habits, and historic medical records of 858 patients. Several patients decided not to answer some of the questions because of privacy concerns (missing values).

Has Missing Values?

Yes

Introductory Paper

Transfer Learning with Partial Observability Applied to Cervical Cancer Screening

By Kelwin Fernandes, Jaime S. Cardoso, Jessica C. Fernandes. 2017

Published in Iberian Conference on Pattern Recognition and Image Analysis

Variables Table

Variable NameRoleTypeDemographicDescriptionUnitsMissing Values
AgeFeatureIntegerAgeno
Number of sexual partnersFeatureContinuousOtheryes
First sexual intercourseFeatureContinuousyes
Num of pregnanciesFeatureContinuousyes
SmokesFeatureContinuousyes
Smokes (years)FeatureContinuousyes
Smokes (packs/year)FeatureContinuousyes
Hormonal ContraceptivesFeatureContinuousyes
Hormonal Contraceptives (years)FeatureContinuousyes
IUDFeatureContinuousyes

0 to 10 of 36

Additional Variable Information

(int) Age (int) Number of sexual partners (int) First sexual intercourse (age) (int) Num of pregnancies (bool) Smokes (bool) Smokes (years) (bool) Smokes (packs/year) (bool) Hormonal Contraceptives (int) Hormonal Contraceptives (years) (bool) IUD (int) IUD (years) (bool) STDs (int) STDs (number) (bool) STDs:condylomatosis (bool) STDs:cervical condylomatosis (bool) STDs:vaginal condylomatosis (bool) STDs:vulvo-perineal condylomatosis (bool) STDs:syphilis (bool) STDs:pelvic inflammatory disease (bool) STDs:genital herpes (bool) STDs:molluscum contagiosum (bool) STDs:AIDS (bool) STDs:HIV (bool) STDs:Hepatitis B (bool) STDs:HPV (int) STDs: Number of diagnosis (int) STDs: Time since first diagnosis (int) STDs: Time since last diagnosis (bool) Dx:Cancer (bool) Dx:CIN (bool) Dx:HPV (bool) Dx (bool) Hinselmann: target variable (bool) Schiller: target variable (bool) Cytology: target variable (bool) Biopsy: target variable

Dataset Files

FileSize
risk_factors_cervical_cancer.csv99.7 KB

Reviews

There are no reviews for this dataset yet.

Login to Write a Review
Download (99.8 KB)
1 citations
24139 views

Creators

Kelwin Fernandes

Jaime Cardoso

Jessica Fernandes

License

By using the UCI Machine Learning Repository, you acknowledge and accept the cookies and privacy practices used by the UCI Machine Learning Repository.

Read Policy