Estimation of paddy rice leaf area index using machine learning methods based on hyperspectral data from multi-year experiments.

The performance of three machine learning methods (support vector regression, random forests and artificial neural network) for estimating the LAI of paddy rice was evaluated in this study. Traditional univariate regression models involving narrowband NDVI with optimized band combinations as well as...

Full description

Bibliographic Details
Main Authors: Li Wang, Qingrui Chang, Jing Yang, Xiaohua Zhang, Fenling Li
Format: Article
Language:English
Published: Public Library of Science (PLoS) 2018-01-01
Series:PLoS ONE
Online Access:https://doi.org/10.1371/journal.pone.0207624
id doaj-71605f7d071144b995afdb5165cccffd
record_format Article
spelling doaj-71605f7d071144b995afdb5165cccffd2021-03-03T21:04:14ZengPublic Library of Science (PLoS)PLoS ONE1932-62032018-01-011312e020762410.1371/journal.pone.0207624Estimation of paddy rice leaf area index using machine learning methods based on hyperspectral data from multi-year experiments.Li WangQingrui ChangJing YangXiaohua ZhangFenling LiThe performance of three machine learning methods (support vector regression, random forests and artificial neural network) for estimating the LAI of paddy rice was evaluated in this study. Traditional univariate regression models involving narrowband NDVI with optimized band combinations as well as linear multivariate calibration partial least squares regression models were also evaluated for comparison. A four year field-collected dataset was used to test the robustness of LAI estimation models against temporal variation. The partial least squares regression and three machine learning methods were built on the raw hyperspectral reflectance and the first derivative separately. Two different rules were used to determine the models' key parameters. The results showed that the combination of the red edge and NIR bands (766 nm and 830 nm) as well as the combination of SWIR bands (1114 nm and 1190 nm) were optimal for producing the narrowband NDVI. The models built on the first derivative spectra yielded more accurate results than the corresponding models built on the raw spectra. Properly selected model parameters resulted in comparable accuracy and robustness with the empirical optimal parameter and significantly reduced the model complexity. The machine learning methods were more accurate and robust than the VI methods and partial least squares regression. When validating the calibrated models against the standalone validation dataset, the VI method yielded a validation RMSE value of 1.17 for NDVI(766,830) and 1.01 for NDVI(1114,1190), while the best models for the partial least squares, support vector machine and artificial neural network methods yielded validation RMSE values of 0.84, 0.82, 0.67 and 0.84, respectively. The RF models built on the first derivative spectra with mtry = 10 showed the highest potential for estimating the LAI of paddy rice.https://doi.org/10.1371/journal.pone.0207624
collection DOAJ
language English
format Article
sources DOAJ
author Li Wang
Qingrui Chang
Jing Yang
Xiaohua Zhang
Fenling Li
spellingShingle Li Wang
Qingrui Chang
Jing Yang
Xiaohua Zhang
Fenling Li
Estimation of paddy rice leaf area index using machine learning methods based on hyperspectral data from multi-year experiments.
PLoS ONE
author_facet Li Wang
Qingrui Chang
Jing Yang
Xiaohua Zhang
Fenling Li
author_sort Li Wang
title Estimation of paddy rice leaf area index using machine learning methods based on hyperspectral data from multi-year experiments.
title_short Estimation of paddy rice leaf area index using machine learning methods based on hyperspectral data from multi-year experiments.
title_full Estimation of paddy rice leaf area index using machine learning methods based on hyperspectral data from multi-year experiments.
title_fullStr Estimation of paddy rice leaf area index using machine learning methods based on hyperspectral data from multi-year experiments.
title_full_unstemmed Estimation of paddy rice leaf area index using machine learning methods based on hyperspectral data from multi-year experiments.
title_sort estimation of paddy rice leaf area index using machine learning methods based on hyperspectral data from multi-year experiments.
publisher Public Library of Science (PLoS)
series PLoS ONE
issn 1932-6203
publishDate 2018-01-01
description The performance of three machine learning methods (support vector regression, random forests and artificial neural network) for estimating the LAI of paddy rice was evaluated in this study. Traditional univariate regression models involving narrowband NDVI with optimized band combinations as well as linear multivariate calibration partial least squares regression models were also evaluated for comparison. A four year field-collected dataset was used to test the robustness of LAI estimation models against temporal variation. The partial least squares regression and three machine learning methods were built on the raw hyperspectral reflectance and the first derivative separately. Two different rules were used to determine the models' key parameters. The results showed that the combination of the red edge and NIR bands (766 nm and 830 nm) as well as the combination of SWIR bands (1114 nm and 1190 nm) were optimal for producing the narrowband NDVI. The models built on the first derivative spectra yielded more accurate results than the corresponding models built on the raw spectra. Properly selected model parameters resulted in comparable accuracy and robustness with the empirical optimal parameter and significantly reduced the model complexity. The machine learning methods were more accurate and robust than the VI methods and partial least squares regression. When validating the calibrated models against the standalone validation dataset, the VI method yielded a validation RMSE value of 1.17 for NDVI(766,830) and 1.01 for NDVI(1114,1190), while the best models for the partial least squares, support vector machine and artificial neural network methods yielded validation RMSE values of 0.84, 0.82, 0.67 and 0.84, respectively. The RF models built on the first derivative spectra with mtry = 10 showed the highest potential for estimating the LAI of paddy rice.
url https://doi.org/10.1371/journal.pone.0207624
work_keys_str_mv AT liwang estimationofpaddyriceleafareaindexusingmachinelearningmethodsbasedonhyperspectraldatafrommultiyearexperiments
AT qingruichang estimationofpaddyriceleafareaindexusingmachinelearningmethodsbasedonhyperspectraldatafrommultiyearexperiments
AT jingyang estimationofpaddyriceleafareaindexusingmachinelearningmethodsbasedonhyperspectraldatafrommultiyearexperiments
AT xiaohuazhang estimationofpaddyriceleafareaindexusingmachinelearningmethodsbasedonhyperspectraldatafrommultiyearexperiments
AT fenlingli estimationofpaddyriceleafareaindexusingmachinelearningmethodsbasedonhyperspectraldatafrommultiyearexperiments
_version_ 1714818914869837824