A Dynamic Active Safe Semi-Supervised Learning Framework for Fault Identification in Labeled Expensive Chemical Processes
A novel active semi-supervised learning framework using unlabeled data is proposed for fault identification in labeled expensive chemical processes. A principal component analysis (PCA) feature selection strategy is first given to calculate the weight of the variables. Secondly, the identification m...
Main Authors: | , , , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
MDPI AG
2020-01-01
|
Series: | Processes |
Subjects: | |
Online Access: | https://www.mdpi.com/2227-9717/8/1/105 |
id |
doaj-d65fe985cd6748bd859d0cbe68baea89 |
---|---|
record_format |
Article |
spelling |
doaj-d65fe985cd6748bd859d0cbe68baea892020-11-25T01:47:08ZengMDPI AGProcesses2227-97172020-01-018110510.3390/pr8010105pr8010105A Dynamic Active Safe Semi-Supervised Learning Framework for Fault Identification in Labeled Expensive Chemical ProcessesXuqing Jia0Wende Tian1Chuankun Li2Xia Yang3Zhongjun Luo4Hui Wang5College of Chemical Engineering, Qingdao University of Science & Technology, Qingdao 266042, ChinaCollege of Chemical Engineering, Qingdao University of Science & Technology, Qingdao 266042, ChinaState Key Laboratory of Safety and Control for Chemicals, SINOPEC Qingdao Research Institute of Safety Engineering, Qingdao 266071, ChinaCollege of Chemical Engineering, Qingdao University of Science & Technology, Qingdao 266042, ChinaShandong Qiwangda Group Petrochemical CO., LTD, Linzi 255400, ChinaShandong Qiwangda Group Petrochemical CO., LTD, Linzi 255400, ChinaA novel active semi-supervised learning framework using unlabeled data is proposed for fault identification in labeled expensive chemical processes. A principal component analysis (PCA) feature selection strategy is first given to calculate the weight of the variables. Secondly, the identification model is trained based on the obtained key process variables. Thirdly, the pseudo label confidence of identification model is dynamically optimized with an historical, current, and future pseudo label confidence mean. To increase the upper limit of the identification model that is self-learning with high entropy process data, active learning is used to identify process data and diagnosis fault causes by ontology. Finally, a PCA-dynamic active safe semi-supervised support vector machine (PCA-DAS4VM) for fault identification in labeled expensive chemical processes is built. The application in the Tennessee Eastman (TE) process shows that this hybrid technology is able to: (i) eliminate chemical process noise and redundant process variables simultaneously, (ii) combine historical pseudo label confidence with future pseudo label confidence to improve the identification accuracy of abnormal working conditions, (iii) efficiently select and diagnose high entropy unlabeled process data, and (iv) fully utilize unlabeled data to enhance the identification performance.https://www.mdpi.com/2227-9717/8/1/105semi-supervised learningactive learningfeature selectionontologychemical processfault identification |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Xuqing Jia Wende Tian Chuankun Li Xia Yang Zhongjun Luo Hui Wang |
spellingShingle |
Xuqing Jia Wende Tian Chuankun Li Xia Yang Zhongjun Luo Hui Wang A Dynamic Active Safe Semi-Supervised Learning Framework for Fault Identification in Labeled Expensive Chemical Processes Processes semi-supervised learning active learning feature selection ontology chemical process fault identification |
author_facet |
Xuqing Jia Wende Tian Chuankun Li Xia Yang Zhongjun Luo Hui Wang |
author_sort |
Xuqing Jia |
title |
A Dynamic Active Safe Semi-Supervised Learning Framework for Fault Identification in Labeled Expensive Chemical Processes |
title_short |
A Dynamic Active Safe Semi-Supervised Learning Framework for Fault Identification in Labeled Expensive Chemical Processes |
title_full |
A Dynamic Active Safe Semi-Supervised Learning Framework for Fault Identification in Labeled Expensive Chemical Processes |
title_fullStr |
A Dynamic Active Safe Semi-Supervised Learning Framework for Fault Identification in Labeled Expensive Chemical Processes |
title_full_unstemmed |
A Dynamic Active Safe Semi-Supervised Learning Framework for Fault Identification in Labeled Expensive Chemical Processes |
title_sort |
dynamic active safe semi-supervised learning framework for fault identification in labeled expensive chemical processes |
publisher |
MDPI AG |
series |
Processes |
issn |
2227-9717 |
publishDate |
2020-01-01 |
description |
A novel active semi-supervised learning framework using unlabeled data is proposed for fault identification in labeled expensive chemical processes. A principal component analysis (PCA) feature selection strategy is first given to calculate the weight of the variables. Secondly, the identification model is trained based on the obtained key process variables. Thirdly, the pseudo label confidence of identification model is dynamically optimized with an historical, current, and future pseudo label confidence mean. To increase the upper limit of the identification model that is self-learning with high entropy process data, active learning is used to identify process data and diagnosis fault causes by ontology. Finally, a PCA-dynamic active safe semi-supervised support vector machine (PCA-DAS4VM) for fault identification in labeled expensive chemical processes is built. The application in the Tennessee Eastman (TE) process shows that this hybrid technology is able to: (i) eliminate chemical process noise and redundant process variables simultaneously, (ii) combine historical pseudo label confidence with future pseudo label confidence to improve the identification accuracy of abnormal working conditions, (iii) efficiently select and diagnose high entropy unlabeled process data, and (iv) fully utilize unlabeled data to enhance the identification performance. |
topic |
semi-supervised learning active learning feature selection ontology chemical process fault identification |
url |
https://www.mdpi.com/2227-9717/8/1/105 |
work_keys_str_mv |
AT xuqingjia adynamicactivesafesemisupervisedlearningframeworkforfaultidentificationinlabeledexpensivechemicalprocesses AT wendetian adynamicactivesafesemisupervisedlearningframeworkforfaultidentificationinlabeledexpensivechemicalprocesses AT chuankunli adynamicactivesafesemisupervisedlearningframeworkforfaultidentificationinlabeledexpensivechemicalprocesses AT xiayang adynamicactivesafesemisupervisedlearningframeworkforfaultidentificationinlabeledexpensivechemicalprocesses AT zhongjunluo adynamicactivesafesemisupervisedlearningframeworkforfaultidentificationinlabeledexpensivechemicalprocesses AT huiwang adynamicactivesafesemisupervisedlearningframeworkforfaultidentificationinlabeledexpensivechemicalprocesses AT xuqingjia dynamicactivesafesemisupervisedlearningframeworkforfaultidentificationinlabeledexpensivechemicalprocesses AT wendetian dynamicactivesafesemisupervisedlearningframeworkforfaultidentificationinlabeledexpensivechemicalprocesses AT chuankunli dynamicactivesafesemisupervisedlearningframeworkforfaultidentificationinlabeledexpensivechemicalprocesses AT xiayang dynamicactivesafesemisupervisedlearningframeworkforfaultidentificationinlabeledexpensivechemicalprocesses AT zhongjunluo dynamicactivesafesemisupervisedlearningframeworkforfaultidentificationinlabeledexpensivechemicalprocesses AT huiwang dynamicactivesafesemisupervisedlearningframeworkforfaultidentificationinlabeledexpensivechemicalprocesses |
_version_ |
1725016021385871360 |