Corn Yield Prediction With Ensemble CNN-DNN

We investigate the predictive performance of two novel CNN-DNN machine learning ensemble models in predicting county-level corn yields across the US Corn Belt (12 states). The developed data set is a combination of management, environment, and historical corn yields from 1980 to 2019. Two scenarios...

Full description

Bibliographic Details
Main Authors: Mohsen Shahhosseini, Guiping Hu, Saeed Khaki, Sotirios V. Archontoulis
Format: Article
Language:English
Published: Frontiers Media S.A. 2021-08-01
Series:Frontiers in Plant Science
Subjects:
Online Access:https://www.frontiersin.org/articles/10.3389/fpls.2021.709008/full
id doaj-3fccf0d1568144478c709b03b4163d3b
record_format Article
spelling doaj-3fccf0d1568144478c709b03b4163d3b2021-08-02T04:38:41ZengFrontiers Media S.A.Frontiers in Plant Science1664-462X2021-08-011210.3389/fpls.2021.709008709008Corn Yield Prediction With Ensemble CNN-DNNMohsen Shahhosseini0Guiping Hu1Saeed Khaki2Sotirios V. Archontoulis3Department of Industrial and Manufacturing Systems Engineering, Iowa State University, Ames, IA, United StatesDepartment of Industrial and Manufacturing Systems Engineering, Iowa State University, Ames, IA, United StatesDepartment of Industrial and Manufacturing Systems Engineering, Iowa State University, Ames, IA, United StatesDepartment of Agronomy, Iowa State University, Ames, IA, United StatesWe investigate the predictive performance of two novel CNN-DNN machine learning ensemble models in predicting county-level corn yields across the US Corn Belt (12 states). The developed data set is a combination of management, environment, and historical corn yields from 1980 to 2019. Two scenarios for ensemble creation are considered: homogenous and heterogenous ensembles. In homogenous ensembles, the base CNN-DNN models are all the same, but they are generated with a bagging procedure to ensure they exhibit a certain level of diversity. Heterogenous ensembles are created from different base CNN-DNN models which share the same architecture but have different hyperparameters. Three types of ensemble creation methods were used to create several ensembles for either of the scenarios: Basic Ensemble Method (BEM), Generalized Ensemble Method (GEM), and stacked generalized ensembles. Results indicated that both designed ensemble types (heterogenous and homogenous) outperform the ensembles created from five individual ML models (linear regression, LASSO, random forest, XGBoost, and LightGBM). Furthermore, by introducing improvements over the heterogenous ensembles, the homogenous ensembles provide the most accurate yield predictions across US Corn Belt states. This model could make 2019 yield predictions with a root mean square error of 866 kg/ha, equivalent to 8.5% relative root mean square and could successfully explain about 77% of the spatio-temporal variation in the corn grain yields. The significant predictive power of this model can be leveraged for designing a reliable tool for corn yield prediction which will in turn assist agronomic decision makers.https://www.frontiersin.org/articles/10.3389/fpls.2021.709008/fullyield predictionCNN-DNNhomogenous ensembleheterogenous ensembleUS Corn Belt
collection DOAJ
language English
format Article
sources DOAJ
author Mohsen Shahhosseini
Guiping Hu
Saeed Khaki
Sotirios V. Archontoulis
spellingShingle Mohsen Shahhosseini
Guiping Hu
Saeed Khaki
Sotirios V. Archontoulis
Corn Yield Prediction With Ensemble CNN-DNN
Frontiers in Plant Science
yield prediction
CNN-DNN
homogenous ensemble
heterogenous ensemble
US Corn Belt
author_facet Mohsen Shahhosseini
Guiping Hu
Saeed Khaki
Sotirios V. Archontoulis
author_sort Mohsen Shahhosseini
title Corn Yield Prediction With Ensemble CNN-DNN
title_short Corn Yield Prediction With Ensemble CNN-DNN
title_full Corn Yield Prediction With Ensemble CNN-DNN
title_fullStr Corn Yield Prediction With Ensemble CNN-DNN
title_full_unstemmed Corn Yield Prediction With Ensemble CNN-DNN
title_sort corn yield prediction with ensemble cnn-dnn
publisher Frontiers Media S.A.
series Frontiers in Plant Science
issn 1664-462X
publishDate 2021-08-01
description We investigate the predictive performance of two novel CNN-DNN machine learning ensemble models in predicting county-level corn yields across the US Corn Belt (12 states). The developed data set is a combination of management, environment, and historical corn yields from 1980 to 2019. Two scenarios for ensemble creation are considered: homogenous and heterogenous ensembles. In homogenous ensembles, the base CNN-DNN models are all the same, but they are generated with a bagging procedure to ensure they exhibit a certain level of diversity. Heterogenous ensembles are created from different base CNN-DNN models which share the same architecture but have different hyperparameters. Three types of ensemble creation methods were used to create several ensembles for either of the scenarios: Basic Ensemble Method (BEM), Generalized Ensemble Method (GEM), and stacked generalized ensembles. Results indicated that both designed ensemble types (heterogenous and homogenous) outperform the ensembles created from five individual ML models (linear regression, LASSO, random forest, XGBoost, and LightGBM). Furthermore, by introducing improvements over the heterogenous ensembles, the homogenous ensembles provide the most accurate yield predictions across US Corn Belt states. This model could make 2019 yield predictions with a root mean square error of 866 kg/ha, equivalent to 8.5% relative root mean square and could successfully explain about 77% of the spatio-temporal variation in the corn grain yields. The significant predictive power of this model can be leveraged for designing a reliable tool for corn yield prediction which will in turn assist agronomic decision makers.
topic yield prediction
CNN-DNN
homogenous ensemble
heterogenous ensemble
US Corn Belt
url https://www.frontiersin.org/articles/10.3389/fpls.2021.709008/full
work_keys_str_mv AT mohsenshahhosseini cornyieldpredictionwithensemblecnndnn
AT guipinghu cornyieldpredictionwithensemblecnndnn
AT saeedkhaki cornyieldpredictionwithensemblecnndnn
AT sotiriosvarchontoulis cornyieldpredictionwithensemblecnndnn
_version_ 1721242134952017920