Exploring the Impact of Noise on Hybrid Inversion of PROSAIL RTM on Sentinel-2 Data

Remote sensing (RS) of biophysical variables plays a vital role in providing the information necessary for understanding spatio-temporal dynamics in ecosystems. The <i>hybrid</i> approach to retrieve biophysical variables from RS by combining Machine Learning (ML) algorithms with surroga...

Full description

Bibliographic Details
Main Authors: Nuno César de Sá, Mitra Baratchi, Leon T. Hauser, Peter van Bodegom
Format: Article
Language:English
Published: MDPI AG 2021-02-01
Series:Remote Sensing
Subjects:
Online Access:https://www.mdpi.com/2072-4292/13/4/648
id doaj-6d5f3710a9ec421ca8de8d6a6fda4c8b
record_format Article
spelling doaj-6d5f3710a9ec421ca8de8d6a6fda4c8b2021-02-12T00:00:20ZengMDPI AGRemote Sensing2072-42922021-02-011364864810.3390/rs13040648Exploring the Impact of Noise on Hybrid Inversion of PROSAIL RTM on Sentinel-2 DataNuno César de Sá0Mitra Baratchi1Leon T. Hauser2Peter van Bodegom3Institute of Environmental Sciences (CML), Leiden University, P. O. Box 9518, 2300 RA Leiden, The NetherlandsLeiden Institute of Advanced Computer Science (LIACS), Leiden University, P.O. Box 9512, 2300 RA Leiden, The NetherlandsInstitute of Environmental Sciences (CML), Leiden University, P. O. Box 9518, 2300 RA Leiden, The NetherlandsInstitute of Environmental Sciences (CML), Leiden University, P. O. Box 9518, 2300 RA Leiden, The NetherlandsRemote sensing (RS) of biophysical variables plays a vital role in providing the information necessary for understanding spatio-temporal dynamics in ecosystems. The <i>hybrid</i> approach to retrieve biophysical variables from RS by combining Machine Learning (ML) algorithms with surrogate data generated by Radiative Transfer Models (RTM). The susceptibility of the ill-posed solutions to noise currently constrains further application of hybrid approaches. Here, we explored how noise affects the performance of ML algorithms for biophysical trait retrieval. We focused on synthetic Sentinel-2 (S2) data generated using the PROSAIL RTM and four commonly applied ML algorithms: Gaussian Processes (GPR), Random Forests (RFR), and Artificial Neural Networks (ANN) and Multi-task Neural Networks (MTN). After identifying which biophysical variables can be retrieved from S2 using a Global Sensitivity Analysis, we evaluated the performance loss of each algorithm using the Mean Absolute Percentage Error (MAPE) with increasing noise levels. We found that, for S2 data, Carotenoid concentrations are uniquely dependent on band 2, Chlorophyll is almost exclusively dependent on the visible ranges, and Leaf Area Index, water, and dry matter contents are mostly dependent on infrared bands. Without added noise, GPR was the best algorithm (<0.05%), followed by the MTN (<3%) and ANN (<5%), with the RFR performing very poorly (<50%). The addition of noise critically affected the performance of all algorithms (>20%) even at low levels of added noise (≈5%). Overall, both neural networks performed significantly better than GPR and RFR when noise was added with the MTN being slightly better when compared to the ANN. Our results imply that the performance of the commonly used algorithms in <i>hybrid-</i>RTM inversion are pervasively sensitive to noise. The implication is that more advanced models or approaches are necessary to minimize the impact of noise to improve near real-time and accurate RS monitoring of biophysical trait retrieval.https://www.mdpi.com/2072-4292/13/4/648radiative transfer modelsPROSAILsensitivity analysisinversionbiophysical variablesmachine learning
collection DOAJ
language English
format Article
sources DOAJ
author Nuno César de Sá
Mitra Baratchi
Leon T. Hauser
Peter van Bodegom
spellingShingle Nuno César de Sá
Mitra Baratchi
Leon T. Hauser
Peter van Bodegom
Exploring the Impact of Noise on Hybrid Inversion of PROSAIL RTM on Sentinel-2 Data
Remote Sensing
radiative transfer models
PROSAIL
sensitivity analysis
inversion
biophysical variables
machine learning
author_facet Nuno César de Sá
Mitra Baratchi
Leon T. Hauser
Peter van Bodegom
author_sort Nuno César de Sá
title Exploring the Impact of Noise on Hybrid Inversion of PROSAIL RTM on Sentinel-2 Data
title_short Exploring the Impact of Noise on Hybrid Inversion of PROSAIL RTM on Sentinel-2 Data
title_full Exploring the Impact of Noise on Hybrid Inversion of PROSAIL RTM on Sentinel-2 Data
title_fullStr Exploring the Impact of Noise on Hybrid Inversion of PROSAIL RTM on Sentinel-2 Data
title_full_unstemmed Exploring the Impact of Noise on Hybrid Inversion of PROSAIL RTM on Sentinel-2 Data
title_sort exploring the impact of noise on hybrid inversion of prosail rtm on sentinel-2 data
publisher MDPI AG
series Remote Sensing
issn 2072-4292
publishDate 2021-02-01
description Remote sensing (RS) of biophysical variables plays a vital role in providing the information necessary for understanding spatio-temporal dynamics in ecosystems. The <i>hybrid</i> approach to retrieve biophysical variables from RS by combining Machine Learning (ML) algorithms with surrogate data generated by Radiative Transfer Models (RTM). The susceptibility of the ill-posed solutions to noise currently constrains further application of hybrid approaches. Here, we explored how noise affects the performance of ML algorithms for biophysical trait retrieval. We focused on synthetic Sentinel-2 (S2) data generated using the PROSAIL RTM and four commonly applied ML algorithms: Gaussian Processes (GPR), Random Forests (RFR), and Artificial Neural Networks (ANN) and Multi-task Neural Networks (MTN). After identifying which biophysical variables can be retrieved from S2 using a Global Sensitivity Analysis, we evaluated the performance loss of each algorithm using the Mean Absolute Percentage Error (MAPE) with increasing noise levels. We found that, for S2 data, Carotenoid concentrations are uniquely dependent on band 2, Chlorophyll is almost exclusively dependent on the visible ranges, and Leaf Area Index, water, and dry matter contents are mostly dependent on infrared bands. Without added noise, GPR was the best algorithm (<0.05%), followed by the MTN (<3%) and ANN (<5%), with the RFR performing very poorly (<50%). The addition of noise critically affected the performance of all algorithms (>20%) even at low levels of added noise (≈5%). Overall, both neural networks performed significantly better than GPR and RFR when noise was added with the MTN being slightly better when compared to the ANN. Our results imply that the performance of the commonly used algorithms in <i>hybrid-</i>RTM inversion are pervasively sensitive to noise. The implication is that more advanced models or approaches are necessary to minimize the impact of noise to improve near real-time and accurate RS monitoring of biophysical trait retrieval.
topic radiative transfer models
PROSAIL
sensitivity analysis
inversion
biophysical variables
machine learning
url https://www.mdpi.com/2072-4292/13/4/648
work_keys_str_mv AT nunocesardesa exploringtheimpactofnoiseonhybridinversionofprosailrtmonsentinel2data
AT mitrabaratchi exploringtheimpactofnoiseonhybridinversionofprosailrtmonsentinel2data
AT leonthauser exploringtheimpactofnoiseonhybridinversionofprosailrtmonsentinel2data
AT petervanbodegom exploringtheimpactofnoiseonhybridinversionofprosailrtmonsentinel2data
_version_ 1724274023329693696