Forty Soybean Cultivars from Subsequent Harvests
Donated on 10/28/2023
Soybean cultivation is one of the most important because it is used in several segments of the food industry. The evaluation of soybean cultivars subject to different planting and harvesting characteristics is an ongoing field of research. We present a dataset obtained from forty soybean cultivars planted in subsequent seasons. The experiment used randomized blocks, arranged in a split-plot scheme, with four replications. The following variables were collected: plant height, insertion of the first pod, number of stems, number of legumes per plant, number of grains per pod, thousand seed weight, and grain yield, resulting in 320 data samples. The dataset presented can be used by researchers from different fields of activity.
Dataset Characteristics
Tabular
Subject Area
Other
Associated Tasks
Classification, Regression, Clustering, Other
Feature Type
Real, Categorical, Integer
# Instances
320
# Features
11
Dataset Information
For what purpose was the dataset created?
To study soybean cultivars harvested in subsequent seasons.
Who funded the creation of the dataset?
There was no cash financing, but support for carrying out the experiments by Accert Pesquisa e Consultoria Agronomia, located in Balsas, Maranhão, Brazil.
What do the instances in this dataset represent?
The average values of 10 plants per plot at harvest (phase R8).
Are there recommended data splits?
We recommend that stratified cross-validation be applied, so that the same cultivar does not appear in the training and test sets simultaneously.
Does the dataset contain data that might be considered sensitive in any way?
No data is confidential
Was there any data preprocessing performed?
The data presented is raw data
Additional Information
More details about the dataset can be found in the published article: https://editorapantanal.com.br/journal/index.php/taes/article/view/8/5
Has Missing Values?
No
Introductory Paper
By Bruno Rodrigues de Oliveira, Alan Mario Zuffo, Francisco Charles dos Santos Silva, Ricardo Mezzomo, Leandra Matos Barrozo, Tatiane Scilewski da Costa Zanatta, Joel Cabral dos Santos, Carlos Henrique Conceição Sousa, Yago Pinto Coelho. 2023
Published in Trends in Agricultural and Environmental Sciences
Variables Table
Variable Name | Role | Type | Description | Units | Missing Values |
---|---|---|---|---|---|
Season | Feature | Integer | 1 or 2 | no | |
Cultivar | Feature | Categorical | Cultivar names | no | |
Repetition | Feature | Integer | 1, 2, 3 or 4 | no | |
PH | Feature | Continuous | plant height (cm) – determined from the soil surface to the insertion of the last leaf using a millimeter ruler; | cm | no |
IFP | Feature | Continuous | insertion of the first pod (cm) – determined from the soil surface to the insertion of the first vegetable; | cm | no |
NLP | Feature | Continuous | Number of stems (unit) – through manual counting; | no | |
NGP | Feature | Continuous | Number of legumes per plant (unit) – through manual counting; | no | |
NGL | Feature | Continuous | Number of grains per plant (unit) – through manual counting; | no | |
NS | Feature | Continuous | Number of grains per pod (unit) – through manual counting; | no | |
MHG | Feature | Continuous | Thousand seed weight (g) – according to the methodology described in Brasil (2009); | g | no |
0 to 10 of 11
Dataset Files
File | Size |
---|---|
data.csv | 22.2 KB |
cultivars-description.ods | 4.6 KB |
Reviews
There are no reviews for this dataset yet.
pip install ucimlrepo
from ucimlrepo import fetch_ucirepo # fetch dataset forty_soybean_cultivars_from_subsequent_harvests = fetch_ucirepo(id=913) # data (as pandas dataframes) X = forty_soybean_cultivars_from_subsequent_harvests.data.features y = forty_soybean_cultivars_from_subsequent_harvests.data.targets # metadata print(forty_soybean_cultivars_from_subsequent_harvests.metadata) # variable information print(forty_soybean_cultivars_from_subsequent_harvests.variables)
Rodrigues de Oliveira, B. & Mario Zuffo, A. (2023). Forty Soybean Cultivars from Subsequent Harvests [Dataset]. UCI Machine Learning Repository. https://doi.org/10.46420/TAES.e230005.
Creators
Bruno Rodrigues de Oliveira
bruno@editorapantanal.com.br
Editora Pantanal
Alan Mario Zuffo
State University of Maranhão
License
This dataset is licensed under a Creative Commons Attribution 4.0 International (CC BY 4.0) license.
This allows for the sharing and adaptation of the datasets for any purpose, provided that the appropriate credit is given.