Development of an Extreme Gradient Boosting Model Integrated With Evolutionary Algorithms for Hourly Water Level Prediction

The establishment of reliable water level prediction models is vital for urban flood control and planning. In this paper, we develop hybrid models (GA-XGBoost and DE-XGBoost) that couple two evolutionary models, a genetic algorithm (GA) and a differential evolution (DE) algorithm, with the extreme g...

Full description

Bibliographic Details
Main Authors: Duc Hai Nguyen, Xuan Hien Le, Jae-Yeong Heo, Deg-Hyo Bae
Format: Article
Language:English
Published: IEEE 2021-01-01
Series:IEEE Access
Subjects:
Online Access:https://ieeexplore.ieee.org/document/9531631/
id doaj-87f19dcee7f84ee7a949288427afc324
record_format Article
spelling doaj-87f19dcee7f84ee7a949288427afc3242021-09-16T23:00:54ZengIEEEIEEE Access2169-35362021-01-01912585312586710.1109/ACCESS.2021.31112879531631Development of an Extreme Gradient Boosting Model Integrated With Evolutionary Algorithms for Hourly Water Level PredictionDuc Hai Nguyen0https://orcid.org/0000-0002-0721-5667Xuan Hien Le1https://orcid.org/0000-0002-0947-0805Jae-Yeong Heo2https://orcid.org/0000-0002-8621-8080Deg-Hyo Bae3https://orcid.org/0000-0002-0429-1154Department of Civil and Environmental Engineering, Sejong University, Gwangjin-Gu, Seoul, South KoreaFaculty of Water Resources Engineering, Thuyloi University, Dong Da, Hanoi, VietnamDepartment of Civil and Environmental Engineering, Sejong University, Gwangjin-Gu, Seoul, South KoreaDepartment of Civil and Environmental Engineering, Sejong University, Gwangjin-Gu, Seoul, South KoreaThe establishment of reliable water level prediction models is vital for urban flood control and planning. In this paper, we develop hybrid models (GA-XGBoost and DE-XGBoost) that couple two evolutionary models, a genetic algorithm (GA) and a differential evolution (DE) algorithm, with the extreme gradient boosting (XGBoost) model for hourly water level prediction. The Jungrang urban basin located on the Han River, South Korea, was selected as a case study for the proposed models. Hourly rainfall and water level data were collected between 2003 and 2020 to construct and evaluate the performance of the selected models. To compare the prediction efficiency, two other tree-based models were chosen: classification and registration tree (CART) and random forest (RF) models. A comparison of the results showed that two hybrid models, GA-XGBoost and DE-XGBoost, outperformed RF and CART in the multistep-ahead prediction of water level, and the relative errors of the hybrid model ranged from [2.18%-9.21%], compared to [3.76%-10.41%] and [2.99%-11.88%] for the RF and CART, respectively. Reliable performance was also supported by other measures. In general, the GA-XGBoost and DE-XGBoost models displayed relatively similar performance despite their small differences. The CART model was not preferable for multistep-ahead water level predictions, even though it yielded the lowest Akaike information criterion (AIC) value. This study verifies that despite having some drawbacks when considering long step-ahead prediction and model complexity, hybrid XGBoost models might be superior to many existing models for hourly water level prediction.https://ieeexplore.ieee.org/document/9531631/Extreme gradient boostingevolutionary algorithmswater level predictiontree-based modelurban floods
collection DOAJ
language English
format Article
sources DOAJ
author Duc Hai Nguyen
Xuan Hien Le
Jae-Yeong Heo
Deg-Hyo Bae
spellingShingle Duc Hai Nguyen
Xuan Hien Le
Jae-Yeong Heo
Deg-Hyo Bae
Development of an Extreme Gradient Boosting Model Integrated With Evolutionary Algorithms for Hourly Water Level Prediction
IEEE Access
Extreme gradient boosting
evolutionary algorithms
water level prediction
tree-based model
urban floods
author_facet Duc Hai Nguyen
Xuan Hien Le
Jae-Yeong Heo
Deg-Hyo Bae
author_sort Duc Hai Nguyen
title Development of an Extreme Gradient Boosting Model Integrated With Evolutionary Algorithms for Hourly Water Level Prediction
title_short Development of an Extreme Gradient Boosting Model Integrated With Evolutionary Algorithms for Hourly Water Level Prediction
title_full Development of an Extreme Gradient Boosting Model Integrated With Evolutionary Algorithms for Hourly Water Level Prediction
title_fullStr Development of an Extreme Gradient Boosting Model Integrated With Evolutionary Algorithms for Hourly Water Level Prediction
title_full_unstemmed Development of an Extreme Gradient Boosting Model Integrated With Evolutionary Algorithms for Hourly Water Level Prediction
title_sort development of an extreme gradient boosting model integrated with evolutionary algorithms for hourly water level prediction
publisher IEEE
series IEEE Access
issn 2169-3536
publishDate 2021-01-01
description The establishment of reliable water level prediction models is vital for urban flood control and planning. In this paper, we develop hybrid models (GA-XGBoost and DE-XGBoost) that couple two evolutionary models, a genetic algorithm (GA) and a differential evolution (DE) algorithm, with the extreme gradient boosting (XGBoost) model for hourly water level prediction. The Jungrang urban basin located on the Han River, South Korea, was selected as a case study for the proposed models. Hourly rainfall and water level data were collected between 2003 and 2020 to construct and evaluate the performance of the selected models. To compare the prediction efficiency, two other tree-based models were chosen: classification and registration tree (CART) and random forest (RF) models. A comparison of the results showed that two hybrid models, GA-XGBoost and DE-XGBoost, outperformed RF and CART in the multistep-ahead prediction of water level, and the relative errors of the hybrid model ranged from [2.18%-9.21%], compared to [3.76%-10.41%] and [2.99%-11.88%] for the RF and CART, respectively. Reliable performance was also supported by other measures. In general, the GA-XGBoost and DE-XGBoost models displayed relatively similar performance despite their small differences. The CART model was not preferable for multistep-ahead water level predictions, even though it yielded the lowest Akaike information criterion (AIC) value. This study verifies that despite having some drawbacks when considering long step-ahead prediction and model complexity, hybrid XGBoost models might be superior to many existing models for hourly water level prediction.
topic Extreme gradient boosting
evolutionary algorithms
water level prediction
tree-based model
urban floods
url https://ieeexplore.ieee.org/document/9531631/
work_keys_str_mv AT duchainguyen developmentofanextremegradientboostingmodelintegratedwithevolutionaryalgorithmsforhourlywaterlevelprediction
AT xuanhienle developmentofanextremegradientboostingmodelintegratedwithevolutionaryalgorithmsforhourlywaterlevelprediction
AT jaeyeongheo developmentofanextremegradientboostingmodelintegratedwithevolutionaryalgorithmsforhourlywaterlevelprediction
AT deghyobae developmentofanextremegradientboostingmodelintegratedwithevolutionaryalgorithmsforhourlywaterlevelprediction
_version_ 1717377829932367872