Towards Optimization of Boosting Models for Formation Lithology Identification
Lithology identification is an indispensable part in geological research and petroleum engineering study. In recent years, several mathematical approaches have been used to improve the accuracy of lithology classification. Based on our earlier work that assessed machine learning models on formation...
Main Authors: | , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
Hindawi Limited
2019-01-01
|
Series: | Mathematical Problems in Engineering |
Online Access: | http://dx.doi.org/10.1155/2019/5309852 |
id |
doaj-e86ec1a7035443b38279db42a8669b28 |
---|---|
record_format |
Article |
spelling |
doaj-e86ec1a7035443b38279db42a8669b282020-11-25T01:15:39ZengHindawi LimitedMathematical Problems in Engineering1024-123X1563-51472019-01-01201910.1155/2019/53098525309852Towards Optimization of Boosting Models for Formation Lithology IdentificationYunxin Xie0Chenyang Zhu1Yue Lu2Zhengwei Zhu3School of Petroleum Engineering, Changzhou University, Changzhou 213100, ChinaElectronics and Computer Science, University of Southampton, University Road, Southampton SO17 1BJ, UKElectronics and Computer Science, University of Southampton, University Road, Southampton SO17 1BJ, UKSchool of Information Science and Engineering, Changzhou University, Changzhou 213100, ChinaLithology identification is an indispensable part in geological research and petroleum engineering study. In recent years, several mathematical approaches have been used to improve the accuracy of lithology classification. Based on our earlier work that assessed machine learning models on formation lithology classification, we optimize the boosting approaches to improve the classification ability of our boosting models with the data collected from the Daniudi gas field and Hangjinqi gas field. Three boosting models, namely, AdaBoost, Gradient Tree Boosting, and eXtreme Gradient Boosting, are evaluated with 5-fold cross validation. Regularization is applied to the Gradient Tree Boosting and eXtreme Gradient Boosting to avoid overfitting. After adapting the hyperparameter tuning approach on each boosting model to optimize the parameter set, we use stacking to combine the three optimized models to improve the classification accuracy. Results suggest that the optimized stacked boosting model has better performance concerning the evaluation matrix such as precision, recall, and f1 score compared with the single optimized boosting model. Confusion matrix also shows that the stacked model has better performance in distinguishing sandstone classes.http://dx.doi.org/10.1155/2019/5309852 |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Yunxin Xie Chenyang Zhu Yue Lu Zhengwei Zhu |
spellingShingle |
Yunxin Xie Chenyang Zhu Yue Lu Zhengwei Zhu Towards Optimization of Boosting Models for Formation Lithology Identification Mathematical Problems in Engineering |
author_facet |
Yunxin Xie Chenyang Zhu Yue Lu Zhengwei Zhu |
author_sort |
Yunxin Xie |
title |
Towards Optimization of Boosting Models for Formation Lithology Identification |
title_short |
Towards Optimization of Boosting Models for Formation Lithology Identification |
title_full |
Towards Optimization of Boosting Models for Formation Lithology Identification |
title_fullStr |
Towards Optimization of Boosting Models for Formation Lithology Identification |
title_full_unstemmed |
Towards Optimization of Boosting Models for Formation Lithology Identification |
title_sort |
towards optimization of boosting models for formation lithology identification |
publisher |
Hindawi Limited |
series |
Mathematical Problems in Engineering |
issn |
1024-123X 1563-5147 |
publishDate |
2019-01-01 |
description |
Lithology identification is an indispensable part in geological research and petroleum engineering study. In recent years, several mathematical approaches have been used to improve the accuracy of lithology classification. Based on our earlier work that assessed machine learning models on formation lithology classification, we optimize the boosting approaches to improve the classification ability of our boosting models with the data collected from the Daniudi gas field and Hangjinqi gas field. Three boosting models, namely, AdaBoost, Gradient Tree Boosting, and eXtreme Gradient Boosting, are evaluated with 5-fold cross validation. Regularization is applied to the Gradient Tree Boosting and eXtreme Gradient Boosting to avoid overfitting. After adapting the hyperparameter tuning approach on each boosting model to optimize the parameter set, we use stacking to combine the three optimized models to improve the classification accuracy. Results suggest that the optimized stacked boosting model has better performance concerning the evaluation matrix such as precision, recall, and f1 score compared with the single optimized boosting model. Confusion matrix also shows that the stacked model has better performance in distinguishing sandstone classes. |
url |
http://dx.doi.org/10.1155/2019/5309852 |
work_keys_str_mv |
AT yunxinxie towardsoptimizationofboostingmodelsforformationlithologyidentification AT chenyangzhu towardsoptimizationofboostingmodelsforformationlithologyidentification AT yuelu towardsoptimizationofboostingmodelsforformationlithologyidentification AT zhengweizhu towardsoptimizationofboostingmodelsforformationlithologyidentification |
_version_ |
1725151901740171264 |