Machine learning techniques in disease forecasting: a case study on rice blast prediction

<p>Abstract</p> <p>Background</p> <p>Diverse modeling approaches <it>viz</it>. neural networks and multiple regression have been followed to date for disease prediction in plant populations. However, due to their inability to predict value of unknown data po...

Full description

Bibliographic Details
Main Authors: Kapoor Amar S, Kaundal Rakesh, Raghava Gajendra PS
Format: Article
Language:English
Published: BMC 2006-11-01
Series:BMC Bioinformatics
Online Access:http://www.biomedcentral.com/1471-2105/7/485
id doaj-07af0358c8704213b4f257b461722108
record_format Article
spelling doaj-07af0358c8704213b4f257b4617221082020-11-24T23:24:36ZengBMCBMC Bioinformatics1471-21052006-11-017148510.1186/1471-2105-7-485Machine learning techniques in disease forecasting: a case study on rice blast predictionKapoor Amar SKaundal RakeshRaghava Gajendra PS<p>Abstract</p> <p>Background</p> <p>Diverse modeling approaches <it>viz</it>. neural networks and multiple regression have been followed to date for disease prediction in plant populations. However, due to their inability to predict value of unknown data points and longer training times, there is need for exploiting new prediction softwares for better understanding of plant-pathogen-environment relationships. Further, there is no online tool available which can help the plant researchers or farmers in timely application of control measures. This paper introduces a new prediction approach based on support vector machines for developing weather-based prediction models of plant diseases.</p> <p>Results</p> <p>Six significant weather variables were selected as predictor variables. Two series of models (cross-location and cross-year) were developed and validated using a five-fold cross validation procedure. For cross-year models, the conventional multiple regression (REG) approach achieved an average correlation coefficient (<it>r</it>) of 0.50, which increased to 0.60 and percent mean absolute error (%MAE) decreased from 65.42 to 52.24 when back-propagation neural network (BPNN) was used. With generalized regression neural network (GRNN), the <it>r </it>increased to 0.70 and %MAE also improved to 46.30, which further increased to <it>r </it>= 0.77 and %MAE = 36.66 when support vector machine (SVM) based method was used. Similarly, cross-location validation achieved <it>r </it>= 0.48, 0.56 and 0.66 using REG, BPNN and GRNN respectively, with their corresponding %MAE as 77.54, 66.11 and 58.26. The SVM-based method outperformed all the three approaches by further increasing <it>r </it>to 0.74 with improvement in %MAE to 44.12. Overall, this SVM-based prediction approach will open new vistas in the area of forecasting plant diseases of various crops.</p> <p>Conclusion</p> <p>Our case study demonstrated that SVM is better than existing machine learning techniques and conventional REG approaches in forecasting plant diseases. In this direction, we have also developed a SVM-based web server for rice blast prediction, a first of its kind worldwide, which can help the plant science community and farmers in their decision making process. The server is freely available at <url>http://www.imtech.res.in/raghava/rbpred/</url>.</p> http://www.biomedcentral.com/1471-2105/7/485
collection DOAJ
language English
format Article
sources DOAJ
author Kapoor Amar S
Kaundal Rakesh
Raghava Gajendra PS
spellingShingle Kapoor Amar S
Kaundal Rakesh
Raghava Gajendra PS
Machine learning techniques in disease forecasting: a case study on rice blast prediction
BMC Bioinformatics
author_facet Kapoor Amar S
Kaundal Rakesh
Raghava Gajendra PS
author_sort Kapoor Amar S
title Machine learning techniques in disease forecasting: a case study on rice blast prediction
title_short Machine learning techniques in disease forecasting: a case study on rice blast prediction
title_full Machine learning techniques in disease forecasting: a case study on rice blast prediction
title_fullStr Machine learning techniques in disease forecasting: a case study on rice blast prediction
title_full_unstemmed Machine learning techniques in disease forecasting: a case study on rice blast prediction
title_sort machine learning techniques in disease forecasting: a case study on rice blast prediction
publisher BMC
series BMC Bioinformatics
issn 1471-2105
publishDate 2006-11-01
description <p>Abstract</p> <p>Background</p> <p>Diverse modeling approaches <it>viz</it>. neural networks and multiple regression have been followed to date for disease prediction in plant populations. However, due to their inability to predict value of unknown data points and longer training times, there is need for exploiting new prediction softwares for better understanding of plant-pathogen-environment relationships. Further, there is no online tool available which can help the plant researchers or farmers in timely application of control measures. This paper introduces a new prediction approach based on support vector machines for developing weather-based prediction models of plant diseases.</p> <p>Results</p> <p>Six significant weather variables were selected as predictor variables. Two series of models (cross-location and cross-year) were developed and validated using a five-fold cross validation procedure. For cross-year models, the conventional multiple regression (REG) approach achieved an average correlation coefficient (<it>r</it>) of 0.50, which increased to 0.60 and percent mean absolute error (%MAE) decreased from 65.42 to 52.24 when back-propagation neural network (BPNN) was used. With generalized regression neural network (GRNN), the <it>r </it>increased to 0.70 and %MAE also improved to 46.30, which further increased to <it>r </it>= 0.77 and %MAE = 36.66 when support vector machine (SVM) based method was used. Similarly, cross-location validation achieved <it>r </it>= 0.48, 0.56 and 0.66 using REG, BPNN and GRNN respectively, with their corresponding %MAE as 77.54, 66.11 and 58.26. The SVM-based method outperformed all the three approaches by further increasing <it>r </it>to 0.74 with improvement in %MAE to 44.12. Overall, this SVM-based prediction approach will open new vistas in the area of forecasting plant diseases of various crops.</p> <p>Conclusion</p> <p>Our case study demonstrated that SVM is better than existing machine learning techniques and conventional REG approaches in forecasting plant diseases. In this direction, we have also developed a SVM-based web server for rice blast prediction, a first of its kind worldwide, which can help the plant science community and farmers in their decision making process. The server is freely available at <url>http://www.imtech.res.in/raghava/rbpred/</url>.</p>
url http://www.biomedcentral.com/1471-2105/7/485
work_keys_str_mv AT kapooramars machinelearningtechniquesindiseaseforecastingacasestudyonriceblastprediction
AT kaundalrakesh machinelearningtechniquesindiseaseforecastingacasestudyonriceblastprediction
AT raghavagajendraps machinelearningtechniquesindiseaseforecastingacasestudyonriceblastprediction
_version_ 1725559873520795648