Predicting Pre-planting Risk of Stagonospora nodorum blotch in Winter Wheat Using Machine Learning Models

Pre-planting factors have been associated with the late-season severity of Stagonospora nodorum blotch (SNB), caused by the fungal pathogen Parastagonospora nodorum, in winter wheat (Triticum aestivum). The relative importance of these factors in the risk of SNB has not been determined and this know...

Full description

Bibliographic Details
Main Authors: Lucky eMehra, Christina eCowger, Kevin eGross, Peter eOjiambo
Format: Article
Language:English
Published: Frontiers Media S.A. 2016-03-01
Series:Frontiers in Plant Science
Subjects:
Online Access:http://journal.frontiersin.org/Journal/10.3389/fpls.2016.00390/full
id doaj-6ee9b8ecf8aa429bbb3bc70b124d7c9b
record_format Article
spelling doaj-6ee9b8ecf8aa429bbb3bc70b124d7c9b2020-11-24T21:07:23ZengFrontiers Media S.A.Frontiers in Plant Science1664-462X2016-03-01710.3389/fpls.2016.00390181436Predicting Pre-planting Risk of Stagonospora nodorum blotch in Winter Wheat Using Machine Learning ModelsLucky eMehra0Christina eCowger1Kevin eGross2Peter eOjiambo3North Carolina State UniversityNorth Carolina State UniversityNorth Carolina State UniversityNorth Carolina State UniversityPre-planting factors have been associated with the late-season severity of Stagonospora nodorum blotch (SNB), caused by the fungal pathogen Parastagonospora nodorum, in winter wheat (Triticum aestivum). The relative importance of these factors in the risk of SNB has not been determined and this knowledge can facilitate disease management decisions prior to planting of the wheat crop. In this study, we examined the performance of multiple regression (MR) and three machine learning algorithms namely artificial neural networks, categorical and regression trees, and random forests (RF) in predicting the pre-planting risk of SNB in wheat. Pre-planting factors tested as potential predictor variables were cultivar resistance, latitude, longitude, previous crop, seeding rate, seed treatment, tillage type, and wheat residue. Disease severity assessed at the end of the growing season was used as the response variable. The models were developed using 431 disease cases (unique combinations of predictors) collected from 2012 to 2014 and these cases were randomly divided into training, validation, and test datasets. Models were evaluated based on the regression of observed against predicted severity values of SNB, sensitivity-specificity ROC analysis, and the Kappa statistic. A strong relationship was observed between late-season severity of SNB and specific pre-planting factors in which latitude, longitude, wheat residue, and cultivar resistance were the most important predictors. The MR model explained 33% of variability in the data, while machine learning models explained 47 to 79% of the total variability. Similarly, the MR model correctly classified 74% of the disease cases, while machine learning models correctly classified 81 to 83% of these cases. Results show that the RF algorithm, which explained 79% of the variability within the data, was the most accurate in predicting the risk of SNB, with an accuracy rate of 93%. The RF algorithm could allow early assessment of the risk of SNB, facilitating sound disease management decisions prior to planting of wheat.http://journal.frontiersin.org/Journal/10.3389/fpls.2016.00390/fullmachine learningwheatrandom forestvariable importanceDisease riskStagonospora nodorum blotch
collection DOAJ
language English
format Article
sources DOAJ
author Lucky eMehra
Christina eCowger
Kevin eGross
Peter eOjiambo
spellingShingle Lucky eMehra
Christina eCowger
Kevin eGross
Peter eOjiambo
Predicting Pre-planting Risk of Stagonospora nodorum blotch in Winter Wheat Using Machine Learning Models
Frontiers in Plant Science
machine learning
wheat
random forest
variable importance
Disease risk
Stagonospora nodorum blotch
author_facet Lucky eMehra
Christina eCowger
Kevin eGross
Peter eOjiambo
author_sort Lucky eMehra
title Predicting Pre-planting Risk of Stagonospora nodorum blotch in Winter Wheat Using Machine Learning Models
title_short Predicting Pre-planting Risk of Stagonospora nodorum blotch in Winter Wheat Using Machine Learning Models
title_full Predicting Pre-planting Risk of Stagonospora nodorum blotch in Winter Wheat Using Machine Learning Models
title_fullStr Predicting Pre-planting Risk of Stagonospora nodorum blotch in Winter Wheat Using Machine Learning Models
title_full_unstemmed Predicting Pre-planting Risk of Stagonospora nodorum blotch in Winter Wheat Using Machine Learning Models
title_sort predicting pre-planting risk of stagonospora nodorum blotch in winter wheat using machine learning models
publisher Frontiers Media S.A.
series Frontiers in Plant Science
issn 1664-462X
publishDate 2016-03-01
description Pre-planting factors have been associated with the late-season severity of Stagonospora nodorum blotch (SNB), caused by the fungal pathogen Parastagonospora nodorum, in winter wheat (Triticum aestivum). The relative importance of these factors in the risk of SNB has not been determined and this knowledge can facilitate disease management decisions prior to planting of the wheat crop. In this study, we examined the performance of multiple regression (MR) and three machine learning algorithms namely artificial neural networks, categorical and regression trees, and random forests (RF) in predicting the pre-planting risk of SNB in wheat. Pre-planting factors tested as potential predictor variables were cultivar resistance, latitude, longitude, previous crop, seeding rate, seed treatment, tillage type, and wheat residue. Disease severity assessed at the end of the growing season was used as the response variable. The models were developed using 431 disease cases (unique combinations of predictors) collected from 2012 to 2014 and these cases were randomly divided into training, validation, and test datasets. Models were evaluated based on the regression of observed against predicted severity values of SNB, sensitivity-specificity ROC analysis, and the Kappa statistic. A strong relationship was observed between late-season severity of SNB and specific pre-planting factors in which latitude, longitude, wheat residue, and cultivar resistance were the most important predictors. The MR model explained 33% of variability in the data, while machine learning models explained 47 to 79% of the total variability. Similarly, the MR model correctly classified 74% of the disease cases, while machine learning models correctly classified 81 to 83% of these cases. Results show that the RF algorithm, which explained 79% of the variability within the data, was the most accurate in predicting the risk of SNB, with an accuracy rate of 93%. The RF algorithm could allow early assessment of the risk of SNB, facilitating sound disease management decisions prior to planting of wheat.
topic machine learning
wheat
random forest
variable importance
Disease risk
Stagonospora nodorum blotch
url http://journal.frontiersin.org/Journal/10.3389/fpls.2016.00390/full
work_keys_str_mv AT luckyemehra predictingpreplantingriskofstagonosporanodorumblotchinwinterwheatusingmachinelearningmodels
AT christinaecowger predictingpreplantingriskofstagonosporanodorumblotchinwinterwheatusingmachinelearningmodels
AT kevinegross predictingpreplantingriskofstagonosporanodorumblotchinwinterwheatusingmachinelearningmodels
AT petereojiambo predictingpreplantingriskofstagonosporanodorumblotchinwinterwheatusingmachinelearningmodels
_version_ 1716763074767093760