The large scale digital mapping of soil organic carbon using machine learning algorithms

The results of digital mapping of organic carbon content within the arable horizons of soils and the assessment of obtained models accuracy with the use of machine learning methods for the area of Central Russian Upland in Voronezh Oblast are presented. The digital mapping was based on 22 points of...

Full description

Bibliographic Details
Main Authors: A. V. Chinilin, I. Yu. Savin
Format: Article
Language:Russian
Published: V.V. Dokuchaev Soil Science Institute 2018-03-01
Series:Бюллетень Почвенного института им. В.В. Докучаева
Subjects:
Online Access:https://bulletin.esoil.ru/jour/article/view/202
id doaj-2a0f30b530e14621b4876a88ca7ff598
record_format Article
spelling doaj-2a0f30b530e14621b4876a88ca7ff5982021-07-28T13:05:28ZrusV.V. Dokuchaev Soil Science InstituteБюллетень Почвенного института им. В.В. Докучаева0136-16942312-42022018-03-01091466210.19047/0136-1694-2018-91-46-62202The large scale digital mapping of soil organic carbon using machine learning algorithmsA. V. Chinilin0I. Yu. Savin1RSAU-MTAAV.V. Dokuchaev Soil Science InstituteThe results of digital mapping of organic carbon content within the arable horizons of soils and the assessment of obtained models accuracy with the use of machine learning methods for the area of Central Russian Upland in Voronezh Oblast are presented. The digital mapping was based on 22 points of soil samplings, applied for the learning and verification of models, and also on several sets of predictor variables. We took also digital elevation model, its derivatives and also remote sensing data of different spatial resolution as predictor variables. Several methods were used to create the spatial variability models for the investigated property based on the decision trees methods: random forest, boosting regression trees and Bayessian regression trees. The assessment of the models obtained accuracy was conducted by a method of cross-validation. As the accuracy indices we used the determination coefficient, mean absolute error and the root mean square error. The modelling results showed that the use of predictor variables presented by digital elevation model, its derivatives and Landsat 8 data we were able to obtain more sustainable models. The determination coefficient varied from 0.6 to 0.7, RMSEcv, i.e., the prognosing error varied from 0.5791 to 0.6520. Whereas, the best model was obtained with the method of Bayessian regression trees; whereas the predictor variables presented by the digital elevation model, its derivatives and Sentinel 2 data determination coefficient varied from 0.47 to 0.55, and the prognosing error varied from 0.7031 to 0.7909. It was revealed that in the described models according to different data sets the most significant were the various predictor variables.https://bulletin.esoil.ru/jour/article/view/202пространственное прогнозированиецифровая модель рельефаметод ансамблей деревьев решенийбустингspatial predictiondigital elevation modelrandom forestboosting
collection DOAJ
language Russian
format Article
sources DOAJ
author A. V. Chinilin
I. Yu. Savin
spellingShingle A. V. Chinilin
I. Yu. Savin
The large scale digital mapping of soil organic carbon using machine learning algorithms
Бюллетень Почвенного института им. В.В. Докучаева
пространственное прогнозирование
цифровая модель рельефа
метод ансамблей деревьев решений
бустинг
spatial prediction
digital elevation model
random forest
boosting
author_facet A. V. Chinilin
I. Yu. Savin
author_sort A. V. Chinilin
title The large scale digital mapping of soil organic carbon using machine learning algorithms
title_short The large scale digital mapping of soil organic carbon using machine learning algorithms
title_full The large scale digital mapping of soil organic carbon using machine learning algorithms
title_fullStr The large scale digital mapping of soil organic carbon using machine learning algorithms
title_full_unstemmed The large scale digital mapping of soil organic carbon using machine learning algorithms
title_sort large scale digital mapping of soil organic carbon using machine learning algorithms
publisher V.V. Dokuchaev Soil Science Institute
series Бюллетень Почвенного института им. В.В. Докучаева
issn 0136-1694
2312-4202
publishDate 2018-03-01
description The results of digital mapping of organic carbon content within the arable horizons of soils and the assessment of obtained models accuracy with the use of machine learning methods for the area of Central Russian Upland in Voronezh Oblast are presented. The digital mapping was based on 22 points of soil samplings, applied for the learning and verification of models, and also on several sets of predictor variables. We took also digital elevation model, its derivatives and also remote sensing data of different spatial resolution as predictor variables. Several methods were used to create the spatial variability models for the investigated property based on the decision trees methods: random forest, boosting regression trees and Bayessian regression trees. The assessment of the models obtained accuracy was conducted by a method of cross-validation. As the accuracy indices we used the determination coefficient, mean absolute error and the root mean square error. The modelling results showed that the use of predictor variables presented by digital elevation model, its derivatives and Landsat 8 data we were able to obtain more sustainable models. The determination coefficient varied from 0.6 to 0.7, RMSEcv, i.e., the prognosing error varied from 0.5791 to 0.6520. Whereas, the best model was obtained with the method of Bayessian regression trees; whereas the predictor variables presented by the digital elevation model, its derivatives and Sentinel 2 data determination coefficient varied from 0.47 to 0.55, and the prognosing error varied from 0.7031 to 0.7909. It was revealed that in the described models according to different data sets the most significant were the various predictor variables.
topic пространственное прогнозирование
цифровая модель рельефа
метод ансамблей деревьев решений
бустинг
spatial prediction
digital elevation model
random forest
boosting
url https://bulletin.esoil.ru/jour/article/view/202
work_keys_str_mv AT avchinilin thelargescaledigitalmappingofsoilorganiccarbonusingmachinelearningalgorithms
AT iyusavin thelargescaledigitalmappingofsoilorganiccarbonusingmachinelearningalgorithms
AT avchinilin largescaledigitalmappingofsoilorganiccarbonusingmachinelearningalgorithms
AT iyusavin largescaledigitalmappingofsoilorganiccarbonusingmachinelearningalgorithms
_version_ 1721277064332443648