The large scale digital mapping of soil organic carbon using machine learning algorithms
The results of digital mapping of organic carbon content within the arable horizons of soils and the assessment of obtained models accuracy with the use of machine learning methods for the area of Central Russian Upland in Voronezh Oblast are presented. The digital mapping was based on 22 points of...
Main Authors: | , |
---|---|
Format: | Article |
Language: | Russian |
Published: |
V.V. Dokuchaev Soil Science Institute
2018-03-01
|
Series: | Бюллетень Почвенного института им. В.В. Докучаева |
Subjects: | |
Online Access: | https://bulletin.esoil.ru/jour/article/view/202 |
id |
doaj-2a0f30b530e14621b4876a88ca7ff598 |
---|---|
record_format |
Article |
spelling |
doaj-2a0f30b530e14621b4876a88ca7ff5982021-07-28T13:05:28ZrusV.V. Dokuchaev Soil Science InstituteБюллетень Почвенного института им. В.В. Докучаева0136-16942312-42022018-03-01091466210.19047/0136-1694-2018-91-46-62202The large scale digital mapping of soil organic carbon using machine learning algorithmsA. V. Chinilin0I. Yu. Savin1RSAU-MTAAV.V. Dokuchaev Soil Science InstituteThe results of digital mapping of organic carbon content within the arable horizons of soils and the assessment of obtained models accuracy with the use of machine learning methods for the area of Central Russian Upland in Voronezh Oblast are presented. The digital mapping was based on 22 points of soil samplings, applied for the learning and verification of models, and also on several sets of predictor variables. We took also digital elevation model, its derivatives and also remote sensing data of different spatial resolution as predictor variables. Several methods were used to create the spatial variability models for the investigated property based on the decision trees methods: random forest, boosting regression trees and Bayessian regression trees. The assessment of the models obtained accuracy was conducted by a method of cross-validation. As the accuracy indices we used the determination coefficient, mean absolute error and the root mean square error. The modelling results showed that the use of predictor variables presented by digital elevation model, its derivatives and Landsat 8 data we were able to obtain more sustainable models. The determination coefficient varied from 0.6 to 0.7, RMSEcv, i.e., the prognosing error varied from 0.5791 to 0.6520. Whereas, the best model was obtained with the method of Bayessian regression trees; whereas the predictor variables presented by the digital elevation model, its derivatives and Sentinel 2 data determination coefficient varied from 0.47 to 0.55, and the prognosing error varied from 0.7031 to 0.7909. It was revealed that in the described models according to different data sets the most significant were the various predictor variables.https://bulletin.esoil.ru/jour/article/view/202пространственное прогнозированиецифровая модель рельефаметод ансамблей деревьев решенийбустингspatial predictiondigital elevation modelrandom forestboosting |
collection |
DOAJ |
language |
Russian |
format |
Article |
sources |
DOAJ |
author |
A. V. Chinilin I. Yu. Savin |
spellingShingle |
A. V. Chinilin I. Yu. Savin The large scale digital mapping of soil organic carbon using machine learning algorithms Бюллетень Почвенного института им. В.В. Докучаева пространственное прогнозирование цифровая модель рельефа метод ансамблей деревьев решений бустинг spatial prediction digital elevation model random forest boosting |
author_facet |
A. V. Chinilin I. Yu. Savin |
author_sort |
A. V. Chinilin |
title |
The large scale digital mapping of soil organic carbon using machine learning algorithms |
title_short |
The large scale digital mapping of soil organic carbon using machine learning algorithms |
title_full |
The large scale digital mapping of soil organic carbon using machine learning algorithms |
title_fullStr |
The large scale digital mapping of soil organic carbon using machine learning algorithms |
title_full_unstemmed |
The large scale digital mapping of soil organic carbon using machine learning algorithms |
title_sort |
large scale digital mapping of soil organic carbon using machine learning algorithms |
publisher |
V.V. Dokuchaev Soil Science Institute |
series |
Бюллетень Почвенного института им. В.В. Докучаева |
issn |
0136-1694 2312-4202 |
publishDate |
2018-03-01 |
description |
The results of digital mapping of organic carbon content within the arable horizons of soils and the assessment of obtained models accuracy with the use of machine learning methods for the area of Central Russian Upland in Voronezh Oblast are presented. The digital mapping was based on 22 points of soil samplings, applied for the learning and verification of models, and also on several sets of predictor variables. We took also digital elevation model, its derivatives and also remote sensing data of different spatial resolution as predictor variables. Several methods were used to create the spatial variability models for the investigated property based on the decision trees methods: random forest, boosting regression trees and Bayessian regression trees. The assessment of the models obtained accuracy was conducted by a method of cross-validation. As the accuracy indices we used the determination coefficient, mean absolute error and the root mean square error. The modelling results showed that the use of predictor variables presented by digital elevation model, its derivatives and Landsat 8 data we were able to obtain more sustainable models. The determination coefficient varied from 0.6 to 0.7, RMSEcv, i.e., the prognosing error varied from 0.5791 to 0.6520. Whereas, the best model was obtained with the method of Bayessian regression trees; whereas the predictor variables presented by the digital elevation model, its derivatives and Sentinel 2 data determination coefficient varied from 0.47 to 0.55, and the prognosing error varied from 0.7031 to 0.7909. It was revealed that in the described models according to different data sets the most significant were the various predictor variables. |
topic |
пространственное прогнозирование цифровая модель рельефа метод ансамблей деревьев решений бустинг spatial prediction digital elevation model random forest boosting |
url |
https://bulletin.esoil.ru/jour/article/view/202 |
work_keys_str_mv |
AT avchinilin thelargescaledigitalmappingofsoilorganiccarbonusingmachinelearningalgorithms AT iyusavin thelargescaledigitalmappingofsoilorganiccarbonusingmachinelearningalgorithms AT avchinilin largescaledigitalmappingofsoilorganiccarbonusingmachinelearningalgorithms AT iyusavin largescaledigitalmappingofsoilorganiccarbonusingmachinelearningalgorithms |
_version_ |
1721277064332443648 |