A Dynamic Active Safe Semi-Supervised Learning Framework for Fault Identification in Labeled Expensive Chemical Processes

A novel active semi-supervised learning framework using unlabeled data is proposed for fault identification in labeled expensive chemical processes. A principal component analysis (PCA) feature selection strategy is first given to calculate the weight of the variables. Secondly, the identification m...

Full description

Bibliographic Details
Main Authors: Xuqing Jia, Wende Tian, Chuankun Li, Xia Yang, Zhongjun Luo, Hui Wang
Format: Article
Language:English
Published: MDPI AG 2020-01-01
Series:Processes
Subjects:
Online Access:https://www.mdpi.com/2227-9717/8/1/105
id doaj-d65fe985cd6748bd859d0cbe68baea89
record_format Article
spelling doaj-d65fe985cd6748bd859d0cbe68baea892020-11-25T01:47:08ZengMDPI AGProcesses2227-97172020-01-018110510.3390/pr8010105pr8010105A Dynamic Active Safe Semi-Supervised Learning Framework for Fault Identification in Labeled Expensive Chemical ProcessesXuqing Jia0Wende Tian1Chuankun Li2Xia Yang3Zhongjun Luo4Hui Wang5College of Chemical Engineering, Qingdao University of Science & Technology, Qingdao 266042, ChinaCollege of Chemical Engineering, Qingdao University of Science & Technology, Qingdao 266042, ChinaState Key Laboratory of Safety and Control for Chemicals, SINOPEC Qingdao Research Institute of Safety Engineering, Qingdao 266071, ChinaCollege of Chemical Engineering, Qingdao University of Science & Technology, Qingdao 266042, ChinaShandong Qiwangda Group Petrochemical CO., LTD, Linzi 255400, ChinaShandong Qiwangda Group Petrochemical CO., LTD, Linzi 255400, ChinaA novel active semi-supervised learning framework using unlabeled data is proposed for fault identification in labeled expensive chemical processes. A principal component analysis (PCA) feature selection strategy is first given to calculate the weight of the variables. Secondly, the identification model is trained based on the obtained key process variables. Thirdly, the pseudo label confidence of identification model is dynamically optimized with an historical, current, and future pseudo label confidence mean. To increase the upper limit of the identification model that is self-learning with high entropy process data, active learning is used to identify process data and diagnosis fault causes by ontology. Finally, a PCA-dynamic active safe semi-supervised support vector machine (PCA-DAS4VM) for fault identification in labeled expensive chemical processes is built. The application in the Tennessee Eastman (TE) process shows that this hybrid technology is able to: (i) eliminate chemical process noise and redundant process variables simultaneously, (ii) combine historical pseudo label confidence with future pseudo label confidence to improve the identification accuracy of abnormal working conditions, (iii) efficiently select and diagnose high entropy unlabeled process data, and (iv) fully utilize unlabeled data to enhance the identification performance.https://www.mdpi.com/2227-9717/8/1/105semi-supervised learningactive learningfeature selectionontologychemical processfault identification
collection DOAJ
language English
format Article
sources DOAJ
author Xuqing Jia
Wende Tian
Chuankun Li
Xia Yang
Zhongjun Luo
Hui Wang
spellingShingle Xuqing Jia
Wende Tian
Chuankun Li
Xia Yang
Zhongjun Luo
Hui Wang
A Dynamic Active Safe Semi-Supervised Learning Framework for Fault Identification in Labeled Expensive Chemical Processes
Processes
semi-supervised learning
active learning
feature selection
ontology
chemical process
fault identification
author_facet Xuqing Jia
Wende Tian
Chuankun Li
Xia Yang
Zhongjun Luo
Hui Wang
author_sort Xuqing Jia
title A Dynamic Active Safe Semi-Supervised Learning Framework for Fault Identification in Labeled Expensive Chemical Processes
title_short A Dynamic Active Safe Semi-Supervised Learning Framework for Fault Identification in Labeled Expensive Chemical Processes
title_full A Dynamic Active Safe Semi-Supervised Learning Framework for Fault Identification in Labeled Expensive Chemical Processes
title_fullStr A Dynamic Active Safe Semi-Supervised Learning Framework for Fault Identification in Labeled Expensive Chemical Processes
title_full_unstemmed A Dynamic Active Safe Semi-Supervised Learning Framework for Fault Identification in Labeled Expensive Chemical Processes
title_sort dynamic active safe semi-supervised learning framework for fault identification in labeled expensive chemical processes
publisher MDPI AG
series Processes
issn 2227-9717
publishDate 2020-01-01
description A novel active semi-supervised learning framework using unlabeled data is proposed for fault identification in labeled expensive chemical processes. A principal component analysis (PCA) feature selection strategy is first given to calculate the weight of the variables. Secondly, the identification model is trained based on the obtained key process variables. Thirdly, the pseudo label confidence of identification model is dynamically optimized with an historical, current, and future pseudo label confidence mean. To increase the upper limit of the identification model that is self-learning with high entropy process data, active learning is used to identify process data and diagnosis fault causes by ontology. Finally, a PCA-dynamic active safe semi-supervised support vector machine (PCA-DAS4VM) for fault identification in labeled expensive chemical processes is built. The application in the Tennessee Eastman (TE) process shows that this hybrid technology is able to: (i) eliminate chemical process noise and redundant process variables simultaneously, (ii) combine historical pseudo label confidence with future pseudo label confidence to improve the identification accuracy of abnormal working conditions, (iii) efficiently select and diagnose high entropy unlabeled process data, and (iv) fully utilize unlabeled data to enhance the identification performance.
topic semi-supervised learning
active learning
feature selection
ontology
chemical process
fault identification
url https://www.mdpi.com/2227-9717/8/1/105
work_keys_str_mv AT xuqingjia adynamicactivesafesemisupervisedlearningframeworkforfaultidentificationinlabeledexpensivechemicalprocesses
AT wendetian adynamicactivesafesemisupervisedlearningframeworkforfaultidentificationinlabeledexpensivechemicalprocesses
AT chuankunli adynamicactivesafesemisupervisedlearningframeworkforfaultidentificationinlabeledexpensivechemicalprocesses
AT xiayang adynamicactivesafesemisupervisedlearningframeworkforfaultidentificationinlabeledexpensivechemicalprocesses
AT zhongjunluo adynamicactivesafesemisupervisedlearningframeworkforfaultidentificationinlabeledexpensivechemicalprocesses
AT huiwang adynamicactivesafesemisupervisedlearningframeworkforfaultidentificationinlabeledexpensivechemicalprocesses
AT xuqingjia dynamicactivesafesemisupervisedlearningframeworkforfaultidentificationinlabeledexpensivechemicalprocesses
AT wendetian dynamicactivesafesemisupervisedlearningframeworkforfaultidentificationinlabeledexpensivechemicalprocesses
AT chuankunli dynamicactivesafesemisupervisedlearningframeworkforfaultidentificationinlabeledexpensivechemicalprocesses
AT xiayang dynamicactivesafesemisupervisedlearningframeworkforfaultidentificationinlabeledexpensivechemicalprocesses
AT zhongjunluo dynamicactivesafesemisupervisedlearningframeworkforfaultidentificationinlabeledexpensivechemicalprocesses
AT huiwang dynamicactivesafesemisupervisedlearningframeworkforfaultidentificationinlabeledexpensivechemicalprocesses
_version_ 1725016021385871360