Soybean yield modeling using bootstrap methods for small samples

One of the problems that occur when working with regression models is regarding the sample size; once the statistical methods used in inferential analyzes are asymptotic if the sample is small the analysis may be compromised because the estimates will be biased. An alternative is to use the bootstra...

Full description

Bibliographic Details
Main Authors: Gustavo H. Dalposso, Miguel A. Uribe-Opazo, Jerry A. Johann
Format: Article
Language:English
Published: Instituto Nacional de Investigación y Tecnología Agraria y Alimentaria 2016-08-01
Series:Spanish Journal of Agricultural Research
Subjects:
Online Access:http://revistas.inia.es/index.php/sjar/article/view/8635
id doaj-f8c82f59f2e44515a59e317dfb624710
record_format Article
spelling doaj-f8c82f59f2e44515a59e317dfb6247102020-11-25T01:08:04ZengInstituto Nacional de Investigación y Tecnología Agraria y AlimentariaSpanish Journal of Agricultural Research2171-92922016-08-01143e0207e020710.5424/sjar/2016143-86352384Soybean yield modeling using bootstrap methods for small samplesGustavo H. Dalposso0Miguel A. Uribe-Opazo1Jerry A. Johann2Federal Technological University of Paraná (UTFPR), 19 Cristo Rei Street, 85902-490, Toledo, PRWestern Paraná State University (UNIOESTE), 2069 Universitária Street, 85819-110, Cascavel, PRWestern Paraná State University (UNIOESTE), 2069 Universitária Street, 85819-110, Cascavel, PROne of the problems that occur when working with regression models is regarding the sample size; once the statistical methods used in inferential analyzes are asymptotic if the sample is small the analysis may be compromised because the estimates will be biased. An alternative is to use the bootstrap methodology, which in its non-parametric version does not need to guess or know the probability distribution that generated the original sample. In this work we used a set of soybean yield data and physical and chemical soil properties formed with fewer samples to determine a multiple linear regression model. Bootstrap methods were used for variable selection, identification of influential points and for determination of confidence intervals of the model parameters. The results showed that the bootstrap methods enabled us to select the physical and chemical soil properties, which were significant in the construction of the soybean yield regression model, construct the confidence intervals of the parameters and identify the points that had great influence on the estimated parameters.http://revistas.inia.es/index.php/sjar/article/view/8635multiple linear regressionmodel selectionbootstrap global influence diagnosisbootstrap confidence intervals
collection DOAJ
language English
format Article
sources DOAJ
author Gustavo H. Dalposso
Miguel A. Uribe-Opazo
Jerry A. Johann
spellingShingle Gustavo H. Dalposso
Miguel A. Uribe-Opazo
Jerry A. Johann
Soybean yield modeling using bootstrap methods for small samples
Spanish Journal of Agricultural Research
multiple linear regression
model selection
bootstrap global influence diagnosis
bootstrap confidence intervals
author_facet Gustavo H. Dalposso
Miguel A. Uribe-Opazo
Jerry A. Johann
author_sort Gustavo H. Dalposso
title Soybean yield modeling using bootstrap methods for small samples
title_short Soybean yield modeling using bootstrap methods for small samples
title_full Soybean yield modeling using bootstrap methods for small samples
title_fullStr Soybean yield modeling using bootstrap methods for small samples
title_full_unstemmed Soybean yield modeling using bootstrap methods for small samples
title_sort soybean yield modeling using bootstrap methods for small samples
publisher Instituto Nacional de Investigación y Tecnología Agraria y Alimentaria
series Spanish Journal of Agricultural Research
issn 2171-9292
publishDate 2016-08-01
description One of the problems that occur when working with regression models is regarding the sample size; once the statistical methods used in inferential analyzes are asymptotic if the sample is small the analysis may be compromised because the estimates will be biased. An alternative is to use the bootstrap methodology, which in its non-parametric version does not need to guess or know the probability distribution that generated the original sample. In this work we used a set of soybean yield data and physical and chemical soil properties formed with fewer samples to determine a multiple linear regression model. Bootstrap methods were used for variable selection, identification of influential points and for determination of confidence intervals of the model parameters. The results showed that the bootstrap methods enabled us to select the physical and chemical soil properties, which were significant in the construction of the soybean yield regression model, construct the confidence intervals of the parameters and identify the points that had great influence on the estimated parameters.
topic multiple linear regression
model selection
bootstrap global influence diagnosis
bootstrap confidence intervals
url http://revistas.inia.es/index.php/sjar/article/view/8635
work_keys_str_mv AT gustavohdalposso soybeanyieldmodelingusingbootstrapmethodsforsmallsamples
AT miguelauribeopazo soybeanyieldmodelingusingbootstrapmethodsforsmallsamples
AT jerryajohann soybeanyieldmodelingusingbootstrapmethodsforsmallsamples
_version_ 1725184436182450176