Using Shapley additive explanations to interpret extreme gradient boosting predictions of grassland degradation in Xilingol, China
<p>Machine learning (ML) and data-driven approaches are increasingly used in many research areas. Extreme gradient boosting (XGBoost) is a tree boosting method that has evolved into a state-of-the-art approach for many ML challenges. However, it has rarely been used in simulations of land use...
Main Authors: | , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
Copernicus Publications
2021-03-01
|
Series: | Geoscientific Model Development |
Online Access: | https://gmd.copernicus.org/articles/14/1493/2021/gmd-14-1493-2021.pdf |
id |
doaj-10247cb1b35a4e14b65ca5b1e6ba15a4 |
---|---|
record_format |
Article |
spelling |
doaj-10247cb1b35a4e14b65ca5b1e6ba15a42021-03-16T07:10:12ZengCopernicus PublicationsGeoscientific Model Development1991-959X1991-96032021-03-01141493151010.5194/gmd-14-1493-2021Using Shapley additive explanations to interpret extreme gradient boosting predictions of grassland degradation in Xilingol, ChinaBatunacun0Batunacun1R. Wieland2T. Lakes3T. Lakes4C. Nendel5C. Nendel6Department of Geography, Humboldt-Universität zu Berlin, Unter den Linden 6, 10099 Berlin, GermanyLeibniz Centre for Agricultural Landscape Research (ZALF), Eberswalder Straße 84, 15374 Müncheberg, GermanyLeibniz Centre for Agricultural Landscape Research (ZALF), Eberswalder Straße 84, 15374 Müncheberg, GermanyDepartment of Geography, Humboldt-Universität zu Berlin, Unter den Linden 6, 10099 Berlin, GermanyIntegrative Research Institute on Transformations of Human-Environment Systems, Humboldt-Universität zu Berlin, Friedrichstraße 191, 10099 Berlin, GermanyLeibniz Centre for Agricultural Landscape Research (ZALF), Eberswalder Straße 84, 15374 Müncheberg, GermanyIntegrative Research Institute on Transformations of Human-Environment Systems, Humboldt-Universität zu Berlin, Friedrichstraße 191, 10099 Berlin, Germany<p>Machine learning (ML) and data-driven approaches are increasingly used in many research areas. Extreme gradient boosting (XGBoost) is a tree boosting method that has evolved into a state-of-the-art approach for many ML challenges. However, it has rarely been used in simulations of land use change so far. Xilingol, a typical region for research on serious grassland degradation and its drivers, was selected as a case study to test whether XGBoost can provide alternative insights that conventional land-use models are unable to generate. A set of 20 drivers was analysed using XGBoost, involving four alternative sampling strategies, and SHAP (Shapley additive explanations) to interpret the results of the purely data-driven approach. The results indicated that, with three of the sampling strategies (over-balanced, balanced, and imbalanced), XGBoost achieved similar and robust simulation results. SHAP values were useful for analysing the complex relationship between the different drivers of grassland degradation. Four drivers accounted for 99 % of the grassland degradation dynamics in Xilingol. These four drivers were spatially allocated, and a risk map of further degradation was produced. The limitations of using XGBoost to predict future land-use change are discussed.</p>https://gmd.copernicus.org/articles/14/1493/2021/gmd-14-1493-2021.pdf |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Batunacun Batunacun R. Wieland T. Lakes T. Lakes C. Nendel C. Nendel |
spellingShingle |
Batunacun Batunacun R. Wieland T. Lakes T. Lakes C. Nendel C. Nendel Using Shapley additive explanations to interpret extreme gradient boosting predictions of grassland degradation in Xilingol, China Geoscientific Model Development |
author_facet |
Batunacun Batunacun R. Wieland T. Lakes T. Lakes C. Nendel C. Nendel |
author_sort |
Batunacun |
title |
Using Shapley additive explanations to interpret extreme gradient boosting predictions of grassland degradation in Xilingol, China |
title_short |
Using Shapley additive explanations to interpret extreme gradient boosting predictions of grassland degradation in Xilingol, China |
title_full |
Using Shapley additive explanations to interpret extreme gradient boosting predictions of grassland degradation in Xilingol, China |
title_fullStr |
Using Shapley additive explanations to interpret extreme gradient boosting predictions of grassland degradation in Xilingol, China |
title_full_unstemmed |
Using Shapley additive explanations to interpret extreme gradient boosting predictions of grassland degradation in Xilingol, China |
title_sort |
using shapley additive explanations to interpret extreme gradient boosting predictions of grassland degradation in xilingol, china |
publisher |
Copernicus Publications |
series |
Geoscientific Model Development |
issn |
1991-959X 1991-9603 |
publishDate |
2021-03-01 |
description |
<p>Machine learning (ML) and data-driven approaches are increasingly used in
many research areas. Extreme gradient boosting (XGBoost) is a tree boosting method that has evolved into
a state-of-the-art approach for many ML challenges. However, it has rarely
been used in simulations of land use change so far. Xilingol, a typical
region for research on serious grassland degradation and its drivers, was
selected as a case study to test whether XGBoost can provide alternative
insights that conventional land-use models are unable to generate. A set of
20 drivers was analysed using XGBoost, involving four alternative
sampling strategies, and SHAP (Shapley additive explanations) to interpret
the results of the purely data-driven approach. The results indicated that,
with three of the sampling strategies (over-balanced, balanced, and
imbalanced), XGBoost achieved similar and robust simulation results. SHAP
values were useful for analysing the complex relationship between the
different drivers of grassland degradation. Four drivers accounted for
99 % of the grassland degradation dynamics in Xilingol. These four drivers
were spatially allocated, and a risk map of further degradation was
produced. The limitations of using XGBoost to predict future land-use change
are discussed.</p> |
url |
https://gmd.copernicus.org/articles/14/1493/2021/gmd-14-1493-2021.pdf |
work_keys_str_mv |
AT batunacun usingshapleyadditiveexplanationstointerpretextremegradientboostingpredictionsofgrasslanddegradationinxilingolchina AT batunacun usingshapleyadditiveexplanationstointerpretextremegradientboostingpredictionsofgrasslanddegradationinxilingolchina AT rwieland usingshapleyadditiveexplanationstointerpretextremegradientboostingpredictionsofgrasslanddegradationinxilingolchina AT tlakes usingshapleyadditiveexplanationstointerpretextremegradientboostingpredictionsofgrasslanddegradationinxilingolchina AT tlakes usingshapleyadditiveexplanationstointerpretextremegradientboostingpredictionsofgrasslanddegradationinxilingolchina AT cnendel usingshapleyadditiveexplanationstointerpretextremegradientboostingpredictionsofgrasslanddegradationinxilingolchina AT cnendel usingshapleyadditiveexplanationstointerpretextremegradientboostingpredictionsofgrasslanddegradationinxilingolchina |
_version_ |
1724219860994490368 |