Palmer Penguins
Linked on 11/14/2021
An introductory dataset presented as an alternative to Iris and useful for teaching data exploration/visualization. Data comes from 3 penguin species in the islands of Palmer Archipelago, Antarctica.
Dataset Characteristics
Tabular
Subject Area
Biology
Associated Tasks
Classification
Feature Type
-
# Instances
344
# Features
-
Dataset Information
For what purpose was the dataset created?
The data was collected as part of research to study Antarctic penguins’ foraging behavior and its relationship with environmental variability. The canonical paper from Gorman et al. presents the research in detail.
Who funded the creation of the dataset?
The data was collected for research that was conducted as part of the Palmer Station, Antarctica, Long-Term Ecological Research program which was supported by grants through the National Science Foundation, Office of Polar Programs (NSF-OPP). Please see the funding statement in the canonical paper for further details.
What do the instances in this dataset represent?
Antarctic penguins (Genus Pygoscelis)
Are there recommended data splits?
No
Was there any data preprocessing performed?
Data was originally listed as three separate datasets within the Palmer Station Long-Term Ecological Research (PAL-LTER) data system. Each of these separate datasets corresponds to one of the three species of penguin: Adélie (152 penguins), Gentoo (124 penguins), and Chinstrap (68 penguins). They were combined into the single Palmer penguins dataset (152+124+68 = 344 penguins).
Has Missing Values?
No
Introductory Paper
By Kristen B. Gorman, Tony D. Williams, William R. Fraser. 2014
Published in PLoS ONE
Reviews
There are no reviews for this dataset yet.
pip install ucimlrepo
from ucimlrepo import fetch_ucirepo # fetch dataset palmer_penguins = fetch_ucirepo(id=690) # data (as pandas dataframes) X = palmer_penguins.data.features y = palmer_penguins.data.targets # metadata print(palmer_penguins.metadata) # variable information print(palmer_penguins.variables)
Palmer Penguins [Dataset]. (2014). UCI Machine Learning Repository. https://doi.org/10.24432/C5R89W.
Citations/Acknowledgements
If you use this dataset, please follow the acknowledgment policy on the original dataset website.