A Bayesian Failure Prediction Network Based on Text Sequence Mining and Clustering

The purpose of this paper is to predict failures based on textual sequence data. The current failure prediction is mainly based on structured data. However, there are many unstructured data in aircraft maintenance. The failure mentioned here refers to failure types, such as transmitter failure and s...

Full description

Bibliographic Details
Main Authors: Wenbing Chang, Zhenzhong Xu, Meng You, Shenghan Zhou, Yiyong Xiao, Yang Cheng
Format: Article
Language:English
Published: MDPI AG 2018-12-01
Series:Entropy
Subjects:
Online Access:https://www.mdpi.com/1099-4300/20/12/923
id doaj-7920e42853094a2286bdc481e6962294
record_format Article
spelling doaj-7920e42853094a2286bdc481e69622942020-11-24T22:57:38ZengMDPI AGEntropy1099-43002018-12-01201292310.3390/e20120923e20120923A Bayesian Failure Prediction Network Based on Text Sequence Mining and ClusteringWenbing Chang0Zhenzhong Xu1Meng You2Shenghan Zhou3Yiyong Xiao4Yang Cheng5School of Reliability and System Engineering, Beihang University, Beijing 100191, ChinaSchool of Reliability and System Engineering, Beihang University, Beijing 100191, ChinaSchool of Reliability and System Engineering, Beihang University, Beijing 100191, ChinaSchool of Reliability and System Engineering, Beihang University, Beijing 100191, ChinaSchool of Reliability and System Engineering, Beihang University, Beijing 100191, ChinaCenter for Industrial Production, Aalborg University, 9220 Aalborg, DenmarkThe purpose of this paper is to predict failures based on textual sequence data. The current failure prediction is mainly based on structured data. However, there are many unstructured data in aircraft maintenance. The failure mentioned here refers to failure types, such as transmitter failure and signal failure, which are classified by the clustering algorithm based on the failure text. For the failure text, this paper uses the natural language processing technology. Firstly, segmentation and the removal of stop words for Chinese failure text data is performed. The study applies the word2vec moving distance model to obtain the failure occurrence sequence for failure texts collected in a fixed period of time. According to the distance, a clustering algorithm is used to obtain a typical number of fault types. Secondly, the failure occurrence sequence is mined using sequence mining algorithms, such as-PrefixSpan. Finally, the above failure sequence is used to train the Bayesian failure network model. The final experimental results show that the Bayesian failure network has higher accuracy for failure prediction.https://www.mdpi.com/1099-4300/20/12/923textual dataword2vecCFSFDPPrefixSpanBayesian failure network
collection DOAJ
language English
format Article
sources DOAJ
author Wenbing Chang
Zhenzhong Xu
Meng You
Shenghan Zhou
Yiyong Xiao
Yang Cheng
spellingShingle Wenbing Chang
Zhenzhong Xu
Meng You
Shenghan Zhou
Yiyong Xiao
Yang Cheng
A Bayesian Failure Prediction Network Based on Text Sequence Mining and Clustering
Entropy
textual data
word2vec
CFSFDP
PrefixSpan
Bayesian failure network
author_facet Wenbing Chang
Zhenzhong Xu
Meng You
Shenghan Zhou
Yiyong Xiao
Yang Cheng
author_sort Wenbing Chang
title A Bayesian Failure Prediction Network Based on Text Sequence Mining and Clustering
title_short A Bayesian Failure Prediction Network Based on Text Sequence Mining and Clustering
title_full A Bayesian Failure Prediction Network Based on Text Sequence Mining and Clustering
title_fullStr A Bayesian Failure Prediction Network Based on Text Sequence Mining and Clustering
title_full_unstemmed A Bayesian Failure Prediction Network Based on Text Sequence Mining and Clustering
title_sort bayesian failure prediction network based on text sequence mining and clustering
publisher MDPI AG
series Entropy
issn 1099-4300
publishDate 2018-12-01
description The purpose of this paper is to predict failures based on textual sequence data. The current failure prediction is mainly based on structured data. However, there are many unstructured data in aircraft maintenance. The failure mentioned here refers to failure types, such as transmitter failure and signal failure, which are classified by the clustering algorithm based on the failure text. For the failure text, this paper uses the natural language processing technology. Firstly, segmentation and the removal of stop words for Chinese failure text data is performed. The study applies the word2vec moving distance model to obtain the failure occurrence sequence for failure texts collected in a fixed period of time. According to the distance, a clustering algorithm is used to obtain a typical number of fault types. Secondly, the failure occurrence sequence is mined using sequence mining algorithms, such as-PrefixSpan. Finally, the above failure sequence is used to train the Bayesian failure network model. The final experimental results show that the Bayesian failure network has higher accuracy for failure prediction.
topic textual data
word2vec
CFSFDP
PrefixSpan
Bayesian failure network
url https://www.mdpi.com/1099-4300/20/12/923
work_keys_str_mv AT wenbingchang abayesianfailurepredictionnetworkbasedontextsequenceminingandclustering
AT zhenzhongxu abayesianfailurepredictionnetworkbasedontextsequenceminingandclustering
AT mengyou abayesianfailurepredictionnetworkbasedontextsequenceminingandclustering
AT shenghanzhou abayesianfailurepredictionnetworkbasedontextsequenceminingandclustering
AT yiyongxiao abayesianfailurepredictionnetworkbasedontextsequenceminingandclustering
AT yangcheng abayesianfailurepredictionnetworkbasedontextsequenceminingandclustering
AT wenbingchang bayesianfailurepredictionnetworkbasedontextsequenceminingandclustering
AT zhenzhongxu bayesianfailurepredictionnetworkbasedontextsequenceminingandclustering
AT mengyou bayesianfailurepredictionnetworkbasedontextsequenceminingandclustering
AT shenghanzhou bayesianfailurepredictionnetworkbasedontextsequenceminingandclustering
AT yiyongxiao bayesianfailurepredictionnetworkbasedontextsequenceminingandclustering
AT yangcheng bayesianfailurepredictionnetworkbasedontextsequenceminingandclustering
_version_ 1725650002311643136