Fault Diagnosis of Intelligent Production Line Based on Digital Twin and Improved Random Forest

Digital twin (DT) is a key technology for realizing the interconnection and intelligent operation of the physical world and the world of information and provides a new paradigm for fault diagnosis. Traditional machine learning algorithms require a balanced dataset. Training and testing sets must hav...

Full description

Bibliographic Details
Main Authors: Kai Guo, Xiang Wan, Lilan Liu, Zenggui Gao, Muchen Yang
Format: Article
Language:English
Published: MDPI AG 2021-08-01
Series:Applied Sciences
Subjects:
Online Access:https://www.mdpi.com/2076-3417/11/16/7733
id doaj-567805bc2f684446be5d7f6085f5880c
record_format Article
spelling doaj-567805bc2f684446be5d7f6085f5880c2021-08-26T13:31:11ZengMDPI AGApplied Sciences2076-34172021-08-01117733773310.3390/app11167733Fault Diagnosis of Intelligent Production Line Based on Digital Twin and Improved Random ForestKai Guo0Xiang Wan1Lilan Liu2Zenggui Gao3Muchen Yang4Shanghai Key Laboratory of Intelligent Manufacturing and Robotics, Shanghai University, Shanghai 200444, ChinaShanghai Key Laboratory of Intelligent Manufacturing and Robotics, Shanghai University, Shanghai 200444, ChinaShanghai Key Laboratory of Intelligent Manufacturing and Robotics, Shanghai University, Shanghai 200444, ChinaShanghai Key Laboratory of Intelligent Manufacturing and Robotics, Shanghai University, Shanghai 200444, ChinaShanghai Key Laboratory of Intelligent Manufacturing and Robotics, Shanghai University, Shanghai 200444, ChinaDigital twin (DT) is a key technology for realizing the interconnection and intelligent operation of the physical world and the world of information and provides a new paradigm for fault diagnosis. Traditional machine learning algorithms require a balanced dataset. Training and testing sets must have the same distribution. Training a good generalization model is difficult in an actual production line operation process. Fault diagnosis technology based on the digital twin uses its ultrarealistic, multisystem, and high-precision characteristics to simulate fault data that are difficult to obtain in an actual production line to train a reliable fault diagnosis model. In this article, we first propose an improved random forest (IRF) algorithm, which reselects decision trees with high accuracy and large differences through hierarchical clustering and gives them weights. Digital twin technology is used to simulate a large number of balanced datasets to train the model, and the trained model can be transferred to a physical production line through transfer learning for fault diagnosis. Finally, the feasibility of our proposed algorithm is verified through a case study of an automobile rear axle assembly line, for which the accuracy of the proposed algorithm reaches 97.8%. The traditional machine learning plus digital twin fault diagnosis method proposed in this paper involves some generalization, and thus has practical value when extended to other fields.https://www.mdpi.com/2076-3417/11/16/7733fault diagnosisdigital twinrandom foresthierarchical clusteringtransfer learning
collection DOAJ
language English
format Article
sources DOAJ
author Kai Guo
Xiang Wan
Lilan Liu
Zenggui Gao
Muchen Yang
spellingShingle Kai Guo
Xiang Wan
Lilan Liu
Zenggui Gao
Muchen Yang
Fault Diagnosis of Intelligent Production Line Based on Digital Twin and Improved Random Forest
Applied Sciences
fault diagnosis
digital twin
random forest
hierarchical clustering
transfer learning
author_facet Kai Guo
Xiang Wan
Lilan Liu
Zenggui Gao
Muchen Yang
author_sort Kai Guo
title Fault Diagnosis of Intelligent Production Line Based on Digital Twin and Improved Random Forest
title_short Fault Diagnosis of Intelligent Production Line Based on Digital Twin and Improved Random Forest
title_full Fault Diagnosis of Intelligent Production Line Based on Digital Twin and Improved Random Forest
title_fullStr Fault Diagnosis of Intelligent Production Line Based on Digital Twin and Improved Random Forest
title_full_unstemmed Fault Diagnosis of Intelligent Production Line Based on Digital Twin and Improved Random Forest
title_sort fault diagnosis of intelligent production line based on digital twin and improved random forest
publisher MDPI AG
series Applied Sciences
issn 2076-3417
publishDate 2021-08-01
description Digital twin (DT) is a key technology for realizing the interconnection and intelligent operation of the physical world and the world of information and provides a new paradigm for fault diagnosis. Traditional machine learning algorithms require a balanced dataset. Training and testing sets must have the same distribution. Training a good generalization model is difficult in an actual production line operation process. Fault diagnosis technology based on the digital twin uses its ultrarealistic, multisystem, and high-precision characteristics to simulate fault data that are difficult to obtain in an actual production line to train a reliable fault diagnosis model. In this article, we first propose an improved random forest (IRF) algorithm, which reselects decision trees with high accuracy and large differences through hierarchical clustering and gives them weights. Digital twin technology is used to simulate a large number of balanced datasets to train the model, and the trained model can be transferred to a physical production line through transfer learning for fault diagnosis. Finally, the feasibility of our proposed algorithm is verified through a case study of an automobile rear axle assembly line, for which the accuracy of the proposed algorithm reaches 97.8%. The traditional machine learning plus digital twin fault diagnosis method proposed in this paper involves some generalization, and thus has practical value when extended to other fields.
topic fault diagnosis
digital twin
random forest
hierarchical clustering
transfer learning
url https://www.mdpi.com/2076-3417/11/16/7733
work_keys_str_mv AT kaiguo faultdiagnosisofintelligentproductionlinebasedondigitaltwinandimprovedrandomforest
AT xiangwan faultdiagnosisofintelligentproductionlinebasedondigitaltwinandimprovedrandomforest
AT lilanliu faultdiagnosisofintelligentproductionlinebasedondigitaltwinandimprovedrandomforest
AT zengguigao faultdiagnosisofintelligentproductionlinebasedondigitaltwinandimprovedrandomforest
AT muchenyang faultdiagnosisofintelligentproductionlinebasedondigitaltwinandimprovedrandomforest
_version_ 1721194913162330112