Exploring the Relationship between Preprocessing and Hyperparameter Tuning for Vibration-Based Machine Fault Diagnosis Using CNNs
This paper demonstrates the differences between popular transformation-based input representations for vibration-based machine fault diagnosis. This paper highlights the dependency of different input representations on hyperparameter selection with the results of training different configurations of...
Main Authors: | , |
---|---|
Format: | Article |
Language: | English |
Published: |
MDPI AG
2021-04-01
|
Series: | Vibration |
Subjects: | |
Online Access: | https://www.mdpi.com/2571-631X/4/2/19 |
id |
doaj-df587532117a4af6a5a3b63917021dc2 |
---|---|
record_format |
Article |
spelling |
doaj-df587532117a4af6a5a3b63917021dc22021-04-03T23:03:38ZengMDPI AGVibration2571-631X2021-04-0141928430910.3390/vibration4020019Exploring the Relationship between Preprocessing and Hyperparameter Tuning for Vibration-Based Machine Fault Diagnosis Using CNNsJacob Hendriks0Patrick Dumond1Department of Mechanical Engineering, Faculty of Engineering, University of Ottawa, Ottawa, ON K1N 6N5, CanadaDepartment of Mechanical Engineering, Faculty of Engineering, University of Ottawa, Ottawa, ON K1N 6N5, CanadaThis paper demonstrates the differences between popular transformation-based input representations for vibration-based machine fault diagnosis. This paper highlights the dependency of different input representations on hyperparameter selection with the results of training different configurations of classical convolutional neural networks (CNNs) with three common benchmarking datasets. Raw temporal measurement, Fourier spectrum, envelope spectrum, and spectrogram input types are individually used to train CNNs. Many configurations of CNNs are trained, with variable input sizes, convolutional kernel sizes and stride. The results show that each input type favors different combinations of hyperparameters, and that each of the datasets studied yield different performance characteristics. The input sizes are found to be the most significant determiner of whether overfitting will occur. It is demonstrated that CNNs trained with spectrograms are less dependent on hyperparameter optimization over all three datasets. This paper demonstrates the wide range of performance achieved by CNNs when preprocessing method and hyperparameters are varied as well as their complex interaction, providing researchers with useful background information and a starting place for further optimization.https://www.mdpi.com/2571-631X/4/2/19condition monitoringfault diagnosisconvolutional neural networkshyperparametersdata representations |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Jacob Hendriks Patrick Dumond |
spellingShingle |
Jacob Hendriks Patrick Dumond Exploring the Relationship between Preprocessing and Hyperparameter Tuning for Vibration-Based Machine Fault Diagnosis Using CNNs Vibration condition monitoring fault diagnosis convolutional neural networks hyperparameters data representations |
author_facet |
Jacob Hendriks Patrick Dumond |
author_sort |
Jacob Hendriks |
title |
Exploring the Relationship between Preprocessing and Hyperparameter Tuning for Vibration-Based Machine Fault Diagnosis Using CNNs |
title_short |
Exploring the Relationship between Preprocessing and Hyperparameter Tuning for Vibration-Based Machine Fault Diagnosis Using CNNs |
title_full |
Exploring the Relationship between Preprocessing and Hyperparameter Tuning for Vibration-Based Machine Fault Diagnosis Using CNNs |
title_fullStr |
Exploring the Relationship between Preprocessing and Hyperparameter Tuning for Vibration-Based Machine Fault Diagnosis Using CNNs |
title_full_unstemmed |
Exploring the Relationship between Preprocessing and Hyperparameter Tuning for Vibration-Based Machine Fault Diagnosis Using CNNs |
title_sort |
exploring the relationship between preprocessing and hyperparameter tuning for vibration-based machine fault diagnosis using cnns |
publisher |
MDPI AG |
series |
Vibration |
issn |
2571-631X |
publishDate |
2021-04-01 |
description |
This paper demonstrates the differences between popular transformation-based input representations for vibration-based machine fault diagnosis. This paper highlights the dependency of different input representations on hyperparameter selection with the results of training different configurations of classical convolutional neural networks (CNNs) with three common benchmarking datasets. Raw temporal measurement, Fourier spectrum, envelope spectrum, and spectrogram input types are individually used to train CNNs. Many configurations of CNNs are trained, with variable input sizes, convolutional kernel sizes and stride. The results show that each input type favors different combinations of hyperparameters, and that each of the datasets studied yield different performance characteristics. The input sizes are found to be the most significant determiner of whether overfitting will occur. It is demonstrated that CNNs trained with spectrograms are less dependent on hyperparameter optimization over all three datasets. This paper demonstrates the wide range of performance achieved by CNNs when preprocessing method and hyperparameters are varied as well as their complex interaction, providing researchers with useful background information and a starting place for further optimization. |
topic |
condition monitoring fault diagnosis convolutional neural networks hyperparameters data representations |
url |
https://www.mdpi.com/2571-631X/4/2/19 |
work_keys_str_mv |
AT jacobhendriks exploringtherelationshipbetweenpreprocessingandhyperparametertuningforvibrationbasedmachinefaultdiagnosisusingcnns AT patrickdumond exploringtherelationshipbetweenpreprocessingandhyperparametertuningforvibrationbasedmachinefaultdiagnosisusingcnns |
_version_ |
1721543385084329984 |