Gap-Filling of Surface Fluxes Using Machine Learning Algorithms in Various Ecosystems

Five machine learning (ML) algorithms were employed for gap-filling surface fluxes of CO<sub>2</sub>, water vapor, and sensible heat above three different ecosystems: grassland, rice paddy field, and forest. The performance and limitations of these ML models, which are support vector mac...

Full description

Bibliographic Details
Main Authors: I-Hang Huang, Cheng-I Hsieh
Format: Article
Language:English
Published: MDPI AG 2020-12-01
Series:Water
Subjects:
Online Access:https://www.mdpi.com/2073-4441/12/12/3415
id doaj-bc55442ecfe447d79e1d11d3975c901f
record_format Article
spelling doaj-bc55442ecfe447d79e1d11d3975c901f2020-12-05T00:03:51ZengMDPI AGWater2073-44412020-12-01123415341510.3390/w12123415Gap-Filling of Surface Fluxes Using Machine Learning Algorithms in Various EcosystemsI-Hang Huang0Cheng-I Hsieh1Department of Bioenvironmental Systems Engineering, National Taiwan University, Taipei 10617, TaiwanDepartment of Bioenvironmental Systems Engineering, National Taiwan University, Taipei 10617, TaiwanFive machine learning (ML) algorithms were employed for gap-filling surface fluxes of CO<sub>2</sub>, water vapor, and sensible heat above three different ecosystems: grassland, rice paddy field, and forest. The performance and limitations of these ML models, which are support vector machine, random forest, multi-layer perception, deep neural network, and long short-term memory, were investigated. Firstly, the accuracy of gap-filling to time and hysteresis input factors of ML algorithms for different ecosystems is discussed. Secondly, the optimal ML model selected in the first stage is compared with the classic method—the Penman–Monteith (P–M) equation for water vapor flux gap-filling. Thirdly, with different gap lengths (from one hour to one week), we explored the data length required for an ML model to perform the optimal gap-filling. Our results demonstrate the following: (1) for ecosystems with a strong hysteresis between surface fluxes and net radiation, adding proceeding meteorological data into the model inputs could improve the model performance; (2) the five ML models gave similar gap-filling performance; (3) for gap-filling water vapor flux, the ML model is better than the P–M equation; and (4) for a gap with length of half day, one day, or one week, an ML model with training data length greater than 1300 h would provide a better gap-filling accuracy.https://www.mdpi.com/2073-4441/12/12/3415flux gap-fillingmachine learningPenman–Monteith equationartificial neural network
collection DOAJ
language English
format Article
sources DOAJ
author I-Hang Huang
Cheng-I Hsieh
spellingShingle I-Hang Huang
Cheng-I Hsieh
Gap-Filling of Surface Fluxes Using Machine Learning Algorithms in Various Ecosystems
Water
flux gap-filling
machine learning
Penman–Monteith equation
artificial neural network
author_facet I-Hang Huang
Cheng-I Hsieh
author_sort I-Hang Huang
title Gap-Filling of Surface Fluxes Using Machine Learning Algorithms in Various Ecosystems
title_short Gap-Filling of Surface Fluxes Using Machine Learning Algorithms in Various Ecosystems
title_full Gap-Filling of Surface Fluxes Using Machine Learning Algorithms in Various Ecosystems
title_fullStr Gap-Filling of Surface Fluxes Using Machine Learning Algorithms in Various Ecosystems
title_full_unstemmed Gap-Filling of Surface Fluxes Using Machine Learning Algorithms in Various Ecosystems
title_sort gap-filling of surface fluxes using machine learning algorithms in various ecosystems
publisher MDPI AG
series Water
issn 2073-4441
publishDate 2020-12-01
description Five machine learning (ML) algorithms were employed for gap-filling surface fluxes of CO<sub>2</sub>, water vapor, and sensible heat above three different ecosystems: grassland, rice paddy field, and forest. The performance and limitations of these ML models, which are support vector machine, random forest, multi-layer perception, deep neural network, and long short-term memory, were investigated. Firstly, the accuracy of gap-filling to time and hysteresis input factors of ML algorithms for different ecosystems is discussed. Secondly, the optimal ML model selected in the first stage is compared with the classic method—the Penman–Monteith (P–M) equation for water vapor flux gap-filling. Thirdly, with different gap lengths (from one hour to one week), we explored the data length required for an ML model to perform the optimal gap-filling. Our results demonstrate the following: (1) for ecosystems with a strong hysteresis between surface fluxes and net radiation, adding proceeding meteorological data into the model inputs could improve the model performance; (2) the five ML models gave similar gap-filling performance; (3) for gap-filling water vapor flux, the ML model is better than the P–M equation; and (4) for a gap with length of half day, one day, or one week, an ML model with training data length greater than 1300 h would provide a better gap-filling accuracy.
topic flux gap-filling
machine learning
Penman–Monteith equation
artificial neural network
url https://www.mdpi.com/2073-4441/12/12/3415
work_keys_str_mv AT ihanghuang gapfillingofsurfacefluxesusingmachinelearningalgorithmsinvariousecosystems
AT chengihsieh gapfillingofsurfacefluxesusingmachinelearningalgorithmsinvariousecosystems
_version_ 1724400230873432064