Using Shapley additive explanations to interpret extreme gradient boosting predictions of grassland degradation in Xilingol, China

<p>Machine learning (ML) and data-driven approaches are increasingly used in many research areas. Extreme gradient boosting (XGBoost) is a tree boosting method that has evolved into a state-of-the-art approach for many ML challenges. However, it has rarely been used in simulations of land use...

Full description

Bibliographic Details
Main Authors: Batunacun, R. Wieland, T. Lakes, C. Nendel
Format: Article
Language:English
Published: Copernicus Publications 2021-03-01
Series:Geoscientific Model Development
Online Access:https://gmd.copernicus.org/articles/14/1493/2021/gmd-14-1493-2021.pdf
id doaj-10247cb1b35a4e14b65ca5b1e6ba15a4
record_format Article
spelling doaj-10247cb1b35a4e14b65ca5b1e6ba15a42021-03-16T07:10:12ZengCopernicus PublicationsGeoscientific Model Development1991-959X1991-96032021-03-01141493151010.5194/gmd-14-1493-2021Using Shapley additive explanations to interpret extreme gradient boosting predictions of grassland degradation in Xilingol, ChinaBatunacun0Batunacun1R. Wieland2T. Lakes3T. Lakes4C. Nendel5C. Nendel6Department of Geography, Humboldt-Universität zu Berlin, Unter den Linden 6, 10099 Berlin, GermanyLeibniz Centre for Agricultural Landscape Research (ZALF), Eberswalder Straße 84, 15374 Müncheberg, GermanyLeibniz Centre for Agricultural Landscape Research (ZALF), Eberswalder Straße 84, 15374 Müncheberg, GermanyDepartment of Geography, Humboldt-Universität zu Berlin, Unter den Linden 6, 10099 Berlin, GermanyIntegrative Research Institute on Transformations of Human-Environment Systems, Humboldt-Universität zu Berlin, Friedrichstraße 191, 10099 Berlin, GermanyLeibniz Centre for Agricultural Landscape Research (ZALF), Eberswalder Straße 84, 15374 Müncheberg, GermanyIntegrative Research Institute on Transformations of Human-Environment Systems, Humboldt-Universität zu Berlin, Friedrichstraße 191, 10099 Berlin, Germany<p>Machine learning (ML) and data-driven approaches are increasingly used in many research areas. Extreme gradient boosting (XGBoost) is a tree boosting method that has evolved into a state-of-the-art approach for many ML challenges. However, it has rarely been used in simulations of land use change so far. Xilingol, a typical region for research on serious grassland degradation and its drivers, was selected as a case study to test whether XGBoost can provide alternative insights that conventional land-use models are unable to generate. A set of 20 drivers was analysed using XGBoost, involving four alternative sampling strategies, and SHAP (Shapley additive explanations) to interpret the results of the purely data-driven approach. The results indicated that, with three of the sampling strategies (over-balanced, balanced, and imbalanced), XGBoost achieved similar and robust simulation results. SHAP values were useful for analysing the complex relationship between the different drivers of grassland degradation. Four drivers accounted for 99 % of the grassland degradation dynamics in Xilingol. These four drivers were spatially allocated, and a risk map of further degradation was produced. The limitations of using XGBoost to predict future land-use change are discussed.</p>https://gmd.copernicus.org/articles/14/1493/2021/gmd-14-1493-2021.pdf
collection DOAJ
language English
format Article
sources DOAJ
author Batunacun
Batunacun
R. Wieland
T. Lakes
T. Lakes
C. Nendel
C. Nendel
spellingShingle Batunacun
Batunacun
R. Wieland
T. Lakes
T. Lakes
C. Nendel
C. Nendel
Using Shapley additive explanations to interpret extreme gradient boosting predictions of grassland degradation in Xilingol, China
Geoscientific Model Development
author_facet Batunacun
Batunacun
R. Wieland
T. Lakes
T. Lakes
C. Nendel
C. Nendel
author_sort Batunacun
title Using Shapley additive explanations to interpret extreme gradient boosting predictions of grassland degradation in Xilingol, China
title_short Using Shapley additive explanations to interpret extreme gradient boosting predictions of grassland degradation in Xilingol, China
title_full Using Shapley additive explanations to interpret extreme gradient boosting predictions of grassland degradation in Xilingol, China
title_fullStr Using Shapley additive explanations to interpret extreme gradient boosting predictions of grassland degradation in Xilingol, China
title_full_unstemmed Using Shapley additive explanations to interpret extreme gradient boosting predictions of grassland degradation in Xilingol, China
title_sort using shapley additive explanations to interpret extreme gradient boosting predictions of grassland degradation in xilingol, china
publisher Copernicus Publications
series Geoscientific Model Development
issn 1991-959X
1991-9603
publishDate 2021-03-01
description <p>Machine learning (ML) and data-driven approaches are increasingly used in many research areas. Extreme gradient boosting (XGBoost) is a tree boosting method that has evolved into a state-of-the-art approach for many ML challenges. However, it has rarely been used in simulations of land use change so far. Xilingol, a typical region for research on serious grassland degradation and its drivers, was selected as a case study to test whether XGBoost can provide alternative insights that conventional land-use models are unable to generate. A set of 20 drivers was analysed using XGBoost, involving four alternative sampling strategies, and SHAP (Shapley additive explanations) to interpret the results of the purely data-driven approach. The results indicated that, with three of the sampling strategies (over-balanced, balanced, and imbalanced), XGBoost achieved similar and robust simulation results. SHAP values were useful for analysing the complex relationship between the different drivers of grassland degradation. Four drivers accounted for 99 % of the grassland degradation dynamics in Xilingol. These four drivers were spatially allocated, and a risk map of further degradation was produced. The limitations of using XGBoost to predict future land-use change are discussed.</p>
url https://gmd.copernicus.org/articles/14/1493/2021/gmd-14-1493-2021.pdf
work_keys_str_mv AT batunacun usingshapleyadditiveexplanationstointerpretextremegradientboostingpredictionsofgrasslanddegradationinxilingolchina
AT batunacun usingshapleyadditiveexplanationstointerpretextremegradientboostingpredictionsofgrasslanddegradationinxilingolchina
AT rwieland usingshapleyadditiveexplanationstointerpretextremegradientboostingpredictionsofgrasslanddegradationinxilingolchina
AT tlakes usingshapleyadditiveexplanationstointerpretextremegradientboostingpredictionsofgrasslanddegradationinxilingolchina
AT tlakes usingshapleyadditiveexplanationstointerpretextremegradientboostingpredictionsofgrasslanddegradationinxilingolchina
AT cnendel usingshapleyadditiveexplanationstointerpretextremegradientboostingpredictionsofgrasslanddegradationinxilingolchina
AT cnendel usingshapleyadditiveexplanationstointerpretextremegradientboostingpredictionsofgrasslanddegradationinxilingolchina
_version_ 1724219860994490368