Plants
Donated on 12/30/2008
Data has been extracted from the USDA plants database. It contains all plants (species and genera) in the database and the states of USA and Canada where they occur.
Dataset Characteristics
Multivariate
Subject Area
Biology
Associated Tasks
Clustering
Feature Type
Categorical
# Instances
34781
# Features
70
Dataset Information
Additional Information
The data is in the transactional form. It contains the Latin names (species or genus) and state abbreviations.
Has Missing Values?
Yes
Variable Information
Each row contains a Latin name (species or genus) and a list of state abbreviations.
Dataset Files
File | Size |
---|---|
plants.data | 1.6 MB |
plants.names | 1.8 KB |
stateabbr.txt | 1004 Bytes |
Papers Citing this Dataset
Sort by Year, desc
By Rares-Darius Buhai, Andrej Risteski, Yoni Halpern, David Sontag. 2019
Published in ArXiv.
By Michael Mampaey, Jilles Vreeken, Nikolaj Tatti. 2019
Published in ArXiv.
By Yanmin Sun, Mohamed Kamel, Andrew Wong, Yang Wang. 2007
Published in Pattern Recognition.
0 to 4 of 4
Reviews
There are no reviews for this dataset yet.
pip install ucimlrepo
from ucimlrepo import fetch_ucirepo # fetch dataset plants = fetch_ucirepo(id=180) # data (as pandas dataframes) X = plants.data.features y = plants.data.targets # metadata print(plants.metadata) # variable information print(plants.variables)
Hmlinen, W. (2008). Plants [Dataset]. UCI Machine Learning Repository. https://doi.org/10.24432/C5HS40.
Creators
W. Hmlinen
DOI
License
This dataset is licensed under a Creative Commons Attribution 4.0 International (CC BY 4.0) license.
This allows for the sharing and adaptation of the datasets for any purpose, provided that the appropriate credit is given.