Towards Optimization of Boosting Models for Formation Lithology Identification

Lithology identification is an indispensable part in geological research and petroleum engineering study. In recent years, several mathematical approaches have been used to improve the accuracy of lithology classification. Based on our earlier work that assessed machine learning models on formation...

Full description

Bibliographic Details
Main Authors:	Yunxin Xie, Chenyang Zhu, Yue Lu, Zhengwei Zhu
Format:	Article
Language:	English
Published:	Hindawi Limited 2019-01-01
Series:	Mathematical Problems in Engineering
Online Access:	http://dx.doi.org/10.1155/2019/5309852

id	doaj-e86ec1a7035443b38279db42a8669b28
record_format	Article
spelling	doaj-e86ec1a7035443b38279db42a8669b282020-11-25T01:15:39ZengHindawi LimitedMathematical Problems in Engineering1024-123X1563-51472019-01-01201910.1155/2019/53098525309852Towards Optimization of Boosting Models for Formation Lithology IdentificationYunxin Xie0Chenyang Zhu1Yue Lu2Zhengwei Zhu3School of Petroleum Engineering, Changzhou University, Changzhou 213100, ChinaElectronics and Computer Science, University of Southampton, University Road, Southampton SO17 1BJ, UKElectronics and Computer Science, University of Southampton, University Road, Southampton SO17 1BJ, UKSchool of Information Science and Engineering, Changzhou University, Changzhou 213100, ChinaLithology identification is an indispensable part in geological research and petroleum engineering study. In recent years, several mathematical approaches have been used to improve the accuracy of lithology classification. Based on our earlier work that assessed machine learning models on formation lithology classification, we optimize the boosting approaches to improve the classification ability of our boosting models with the data collected from the Daniudi gas field and Hangjinqi gas field. Three boosting models, namely, AdaBoost, Gradient Tree Boosting, and eXtreme Gradient Boosting, are evaluated with 5-fold cross validation. Regularization is applied to the Gradient Tree Boosting and eXtreme Gradient Boosting to avoid overfitting. After adapting the hyperparameter tuning approach on each boosting model to optimize the parameter set, we use stacking to combine the three optimized models to improve the classification accuracy. Results suggest that the optimized stacked boosting model has better performance concerning the evaluation matrix such as precision, recall, and f1 score compared with the single optimized boosting model. Confusion matrix also shows that the stacked model has better performance in distinguishing sandstone classes.http://dx.doi.org/10.1155/2019/5309852
collection	DOAJ
language	English
format	Article
sources	DOAJ
author	Yunxin Xie Chenyang Zhu Yue Lu Zhengwei Zhu
spellingShingle	Yunxin Xie Chenyang Zhu Yue Lu Zhengwei Zhu Towards Optimization of Boosting Models for Formation Lithology Identification Mathematical Problems in Engineering
author_facet	Yunxin Xie Chenyang Zhu Yue Lu Zhengwei Zhu
author_sort	Yunxin Xie
title	Towards Optimization of Boosting Models for Formation Lithology Identification
title_short	Towards Optimization of Boosting Models for Formation Lithology Identification
title_full	Towards Optimization of Boosting Models for Formation Lithology Identification
title_fullStr	Towards Optimization of Boosting Models for Formation Lithology Identification
title_full_unstemmed	Towards Optimization of Boosting Models for Formation Lithology Identification
title_sort	towards optimization of boosting models for formation lithology identification
publisher	Hindawi Limited
series	Mathematical Problems in Engineering
issn	1024-123X 1563-5147
publishDate	2019-01-01
description	Lithology identification is an indispensable part in geological research and petroleum engineering study. In recent years, several mathematical approaches have been used to improve the accuracy of lithology classification. Based on our earlier work that assessed machine learning models on formation lithology classification, we optimize the boosting approaches to improve the classification ability of our boosting models with the data collected from the Daniudi gas field and Hangjinqi gas field. Three boosting models, namely, AdaBoost, Gradient Tree Boosting, and eXtreme Gradient Boosting, are evaluated with 5-fold cross validation. Regularization is applied to the Gradient Tree Boosting and eXtreme Gradient Boosting to avoid overfitting. After adapting the hyperparameter tuning approach on each boosting model to optimize the parameter set, we use stacking to combine the three optimized models to improve the classification accuracy. Results suggest that the optimized stacked boosting model has better performance concerning the evaluation matrix such as precision, recall, and f1 score compared with the single optimized boosting model. Confusion matrix also shows that the stacked model has better performance in distinguishing sandstone classes.
url	http://dx.doi.org/10.1155/2019/5309852
work_keys_str_mv	AT yunxinxie towardsoptimizationofboostingmodelsforformationlithologyidentification AT chenyangzhu towardsoptimizationofboostingmodelsforformationlithologyidentification AT yuelu towardsoptimizationofboostingmodelsforformationlithologyidentification AT zhengweizhu towardsoptimizationofboostingmodelsforformationlithologyidentification
_version_	1725151901740171264

Towards Optimization of Boosting Models for Formation Lithology Identification

Similar Items