National Poll on Healthy Aging (NPHA)
Donated on 12/5/2023
This is a subset of the NPHA dataset filtered down to develop and validate machine learning algorithms for predicting the number of doctors a survey respondent sees in a year. This dataset’s records represent seniors who responded to the NPHA survey.
Dataset Characteristics
Tabular
Subject Area
Health and Medicine
Associated Tasks
Classification
Feature Type
Categorical
# Instances
714
# Features
14
Dataset Information
For what purpose was the dataset created?
The National Poll on Healthy Aging dataset was created to gather insights on the health, healthcare, and health policy issues affecting Americans aged 50 and older. By focusing on the perspectives of older adults and their caregivers, the University of Michigan aimed to inform the public, healthcare providers, policymakers, and advocates about the various aspects of aging. This includes topics like health insurance, household composition, sleep issues, dental care, prescription medications, and caregiving, thereby providing a comprehensive understanding of the health-related needs and concerns of the older population.
Who funded the creation of the dataset?
The dataset was funded by AARP and Michigan Medicine, the University of Michigan's academic medical centre.
What do the instances in this dataset represent?
Each row represents a survey respondent.
Does the dataset contain data that might be considered sensitive in any way?
Yes. There is information about race/ethnicity, gender, age.
Was there any data preprocessing performed?
For this subset of the original NPHA dataset we chose 14 features related to health and sleep to use for the prediction task. We then removed all survey respondents with missing responses for any of the chosen features.
Has Missing Values?
No
Introductory Paper
By Malani, Preeti N., Kullgren, Jeffrey, and Solway, Erica. 2017
Published in Inter-university Consortium for Political and Social Research
Variables Table
Variable Name | Role | Type | Demographic | Description | Units | Missing Values |
---|---|---|---|---|---|---|
Number_of_Doctors_Visited | Target | Categorical | The total count of different doctors the patient has seen = { 1: 0-1 doctors 2: 2-3 doctors 3: 4 or more doctors } | no | ||
Age | Feature | Categorical | Age | The patient's age group = { 1: 50-64 2: 65-80 } | no | |
Physical_Health | Feature | Categorical | A self-assessment of the patient's physical well-being = { -1: Refused 1: Excellent 2: Very Good 3: Good 4: Fair 5: Poor } | no | ||
Mental_Health | Feature | Categorical | A self-evaluation of the patient's mental or psychological health = { -1: Refused 1: Excellent 2: Very Good 3: Good 4: Fair 5: Poor } | no | ||
Dental_Health | Feature | Categorical | A self-assessment of the patient's oral or dental health= { -1: Refused 1: Excellent 2: Very Good 3: Good 4: Fair 5: Poor } | no | ||
Employment | Feature | Categorical | The patient's employment status or work-related information = { -1: Refused 6 1: Working full-time 2: Working part-time 3: Retired 4: Not working at this time } | no | ||
Stress_Keeps_Patient_from_Sleeping | Feature | Categorical | Whether stress affects the patient's ability to sleep = { 0: No 1: Yes } | no | ||
Medication_Keeps_Patient_from_Sleeping | Feature | Categorical | Whether medication impacts the patient's sleep = { 0: No 1: Yes } | no | ||
Pain_Keeps_Patient_from_Sleeping | Feature | Categorical | Whether physical pain disturbs the patient's sleep = { 0: No 1: Yes } | no | ||
Bathroom_Needs_Keeps_Patient_from_Sleeping | Feature | Categorical | Whether the need to use the bathroom affects the patient's sleep = { 0: No 1: Yes } | no |
0 to 10 of 15
Additional Variable Information
Class Labels
Number of Doctors Visited: The total count of different doctors the patient has seen = { 1: 0-1 doctors 2: 2-3 doctors 3: 4 or more doctors } Age: The patient's age group = { 1: 50-64 2: 65-80 } Physical Health: A self-assessment of the patient's physical well-being = { -1: Refused 1: Excellent 2: Very Good 3: Good 4: Fair 5: Poor } Mental Health: A self-evaluation of the patient's mental or psychological health = { -1: Refused 1: Excellent 2: Very Good 3: Good 4: Fair 5: Poor } Dental Health: A self-assessment of the patient's oral or dental health= { -1: Refused 1: Excellent 2: Very Good 3: Good 4: Fair 5: Poor } Employment: The patient's employment status or work-related information = { -1: Refused 6 1: Working full-time 2: Working part-time 3: Retired 4: Not working at this time } Stress Keeps Patient from Sleeping: Whether stress affects the patient's ability to sleep = { 0: No 1: Yes } Medication Keeps Patient from Sleeping: Whether medication impacts the patient's sleep = { 0: No 1: Yes } Pain Keeps Patient from Sleeping: Whether physical pain disturbs the patient's sleep = { 0: No 1: Yes } Bathroom Needs Keeps Patient from Sleeping: Whether the need to use the bathroom affects the patient's sleep = { 0: No 1: Yes } Unknown Keeps Patient from Sleeping: Unidentified factors affecting the patient's sleep = { 0: No 1: Yes } Trouble Sleeping: General issues or difficulties the patient faces with sleeping = { 0: No 1: Yes } Prescription Sleep Medication: Information about any sleep medication prescribed to the patient = { -1: Refused 1: Use regularly 2: Use occasionally 3: Do not use } Race: The patient's racial or ethnic background = { -2: Not asked -1: REFUSED 1: White, Non-Hispanic 2: Black, Non-Hispanic 3: Other, Non-Hispanic 4: Hispanic 5: 2+ Races, Non-Hispanic } Gender: The gender identity of the patient = { -2: Not asked -1: REFUSED 1: Male 2: Female }
Dataset Files
File | Size |
---|---|
NPHA-doctor-visits.csv | 21.3 KB |
Reviews
There are no reviews for this dataset yet.
pip install ucimlrepo
from ucimlrepo import fetch_ucirepo # fetch dataset national_poll_on_healthy_aging_npha = fetch_ucirepo(id=936) # data (as pandas dataframes) X = national_poll_on_healthy_aging_npha.data.features y = national_poll_on_healthy_aging_npha.data.targets # metadata print(national_poll_on_healthy_aging_npha.metadata) # variable information print(national_poll_on_healthy_aging_npha.variables)
National Poll on Healthy Aging (NPHA) [Dataset]. (2017). UCI Machine Learning Repository. https://doi.org/10.3886/ICPSR37305.v1.
License
This dataset is licensed under a Creative Commons Attribution 4.0 International (CC BY 4.0) license.
This allows for the sharing and adaptation of the datasets for any purpose, provided that the appropriate credit is given.