Towards Optimization of Boosting Models for Formation Lithology Identification

Lithology identification is an indispensable part in geological research and petroleum engineering study. In recent years, several mathematical approaches have been used to improve the accuracy of lithology classification. Based on our earlier work that assessed machine learning models on formation...

Full description

Bibliographic Details
Main Authors: Yunxin Xie, Chenyang Zhu, Yue Lu, Zhengwei Zhu
Format: Article
Language:English
Published: Hindawi Limited 2019-01-01
Series:Mathematical Problems in Engineering
Online Access:http://dx.doi.org/10.1155/2019/5309852
id doaj-e86ec1a7035443b38279db42a8669b28
record_format Article
spelling doaj-e86ec1a7035443b38279db42a8669b282020-11-25T01:15:39ZengHindawi LimitedMathematical Problems in Engineering1024-123X1563-51472019-01-01201910.1155/2019/53098525309852Towards Optimization of Boosting Models for Formation Lithology IdentificationYunxin Xie0Chenyang Zhu1Yue Lu2Zhengwei Zhu3School of Petroleum Engineering, Changzhou University, Changzhou 213100, ChinaElectronics and Computer Science, University of Southampton, University Road, Southampton SO17 1BJ, UKElectronics and Computer Science, University of Southampton, University Road, Southampton SO17 1BJ, UKSchool of Information Science and Engineering, Changzhou University, Changzhou 213100, ChinaLithology identification is an indispensable part in geological research and petroleum engineering study. In recent years, several mathematical approaches have been used to improve the accuracy of lithology classification. Based on our earlier work that assessed machine learning models on formation lithology classification, we optimize the boosting approaches to improve the classification ability of our boosting models with the data collected from the Daniudi gas field and Hangjinqi gas field. Three boosting models, namely, AdaBoost, Gradient Tree Boosting, and eXtreme Gradient Boosting, are evaluated with 5-fold cross validation. Regularization is applied to the Gradient Tree Boosting and eXtreme Gradient Boosting to avoid overfitting. After adapting the hyperparameter tuning approach on each boosting model to optimize the parameter set, we use stacking to combine the three optimized models to improve the classification accuracy. Results suggest that the optimized stacked boosting model has better performance concerning the evaluation matrix such as precision, recall, and f1 score compared with the single optimized boosting model. Confusion matrix also shows that the stacked model has better performance in distinguishing sandstone classes.http://dx.doi.org/10.1155/2019/5309852
collection DOAJ
language English
format Article
sources DOAJ
author Yunxin Xie
Chenyang Zhu
Yue Lu
Zhengwei Zhu
spellingShingle Yunxin Xie
Chenyang Zhu
Yue Lu
Zhengwei Zhu
Towards Optimization of Boosting Models for Formation Lithology Identification
Mathematical Problems in Engineering
author_facet Yunxin Xie
Chenyang Zhu
Yue Lu
Zhengwei Zhu
author_sort Yunxin Xie
title Towards Optimization of Boosting Models for Formation Lithology Identification
title_short Towards Optimization of Boosting Models for Formation Lithology Identification
title_full Towards Optimization of Boosting Models for Formation Lithology Identification
title_fullStr Towards Optimization of Boosting Models for Formation Lithology Identification
title_full_unstemmed Towards Optimization of Boosting Models for Formation Lithology Identification
title_sort towards optimization of boosting models for formation lithology identification
publisher Hindawi Limited
series Mathematical Problems in Engineering
issn 1024-123X
1563-5147
publishDate 2019-01-01
description Lithology identification is an indispensable part in geological research and petroleum engineering study. In recent years, several mathematical approaches have been used to improve the accuracy of lithology classification. Based on our earlier work that assessed machine learning models on formation lithology classification, we optimize the boosting approaches to improve the classification ability of our boosting models with the data collected from the Daniudi gas field and Hangjinqi gas field. Three boosting models, namely, AdaBoost, Gradient Tree Boosting, and eXtreme Gradient Boosting, are evaluated with 5-fold cross validation. Regularization is applied to the Gradient Tree Boosting and eXtreme Gradient Boosting to avoid overfitting. After adapting the hyperparameter tuning approach on each boosting model to optimize the parameter set, we use stacking to combine the three optimized models to improve the classification accuracy. Results suggest that the optimized stacked boosting model has better performance concerning the evaluation matrix such as precision, recall, and f1 score compared with the single optimized boosting model. Confusion matrix also shows that the stacked model has better performance in distinguishing sandstone classes.
url http://dx.doi.org/10.1155/2019/5309852
work_keys_str_mv AT yunxinxie towardsoptimizationofboostingmodelsforformationlithologyidentification
AT chenyangzhu towardsoptimizationofboostingmodelsforformationlithologyidentification
AT yuelu towardsoptimizationofboostingmodelsforformationlithologyidentification
AT zhengweizhu towardsoptimizationofboostingmodelsforformationlithologyidentification
_version_ 1725151901740171264