Exploring the Relationship between Preprocessing and Hyperparameter Tuning for Vibration-Based Machine Fault Diagnosis Using CNNs

This paper demonstrates the differences between popular transformation-based input representations for vibration-based machine fault diagnosis. This paper highlights the dependency of different input representations on hyperparameter selection with the results of training different configurations of...

Full description

Bibliographic Details
Main Authors: Jacob Hendriks, Patrick Dumond
Format: Article
Language:English
Published: MDPI AG 2021-04-01
Series:Vibration
Subjects:
Online Access:https://www.mdpi.com/2571-631X/4/2/19
id doaj-df587532117a4af6a5a3b63917021dc2
record_format Article
spelling doaj-df587532117a4af6a5a3b63917021dc22021-04-03T23:03:38ZengMDPI AGVibration2571-631X2021-04-0141928430910.3390/vibration4020019Exploring the Relationship between Preprocessing and Hyperparameter Tuning for Vibration-Based Machine Fault Diagnosis Using CNNsJacob Hendriks0Patrick Dumond1Department of Mechanical Engineering, Faculty of Engineering, University of Ottawa, Ottawa, ON K1N 6N5, CanadaDepartment of Mechanical Engineering, Faculty of Engineering, University of Ottawa, Ottawa, ON K1N 6N5, CanadaThis paper demonstrates the differences between popular transformation-based input representations for vibration-based machine fault diagnosis. This paper highlights the dependency of different input representations on hyperparameter selection with the results of training different configurations of classical convolutional neural networks (CNNs) with three common benchmarking datasets. Raw temporal measurement, Fourier spectrum, envelope spectrum, and spectrogram input types are individually used to train CNNs. Many configurations of CNNs are trained, with variable input sizes, convolutional kernel sizes and stride. The results show that each input type favors different combinations of hyperparameters, and that each of the datasets studied yield different performance characteristics. The input sizes are found to be the most significant determiner of whether overfitting will occur. It is demonstrated that CNNs trained with spectrograms are less dependent on hyperparameter optimization over all three datasets. This paper demonstrates the wide range of performance achieved by CNNs when preprocessing method and hyperparameters are varied as well as their complex interaction, providing researchers with useful background information and a starting place for further optimization.https://www.mdpi.com/2571-631X/4/2/19condition monitoringfault diagnosisconvolutional neural networkshyperparametersdata representations
collection DOAJ
language English
format Article
sources DOAJ
author Jacob Hendriks
Patrick Dumond
spellingShingle Jacob Hendriks
Patrick Dumond
Exploring the Relationship between Preprocessing and Hyperparameter Tuning for Vibration-Based Machine Fault Diagnosis Using CNNs
Vibration
condition monitoring
fault diagnosis
convolutional neural networks
hyperparameters
data representations
author_facet Jacob Hendriks
Patrick Dumond
author_sort Jacob Hendriks
title Exploring the Relationship between Preprocessing and Hyperparameter Tuning for Vibration-Based Machine Fault Diagnosis Using CNNs
title_short Exploring the Relationship between Preprocessing and Hyperparameter Tuning for Vibration-Based Machine Fault Diagnosis Using CNNs
title_full Exploring the Relationship between Preprocessing and Hyperparameter Tuning for Vibration-Based Machine Fault Diagnosis Using CNNs
title_fullStr Exploring the Relationship between Preprocessing and Hyperparameter Tuning for Vibration-Based Machine Fault Diagnosis Using CNNs
title_full_unstemmed Exploring the Relationship between Preprocessing and Hyperparameter Tuning for Vibration-Based Machine Fault Diagnosis Using CNNs
title_sort exploring the relationship between preprocessing and hyperparameter tuning for vibration-based machine fault diagnosis using cnns
publisher MDPI AG
series Vibration
issn 2571-631X
publishDate 2021-04-01
description This paper demonstrates the differences between popular transformation-based input representations for vibration-based machine fault diagnosis. This paper highlights the dependency of different input representations on hyperparameter selection with the results of training different configurations of classical convolutional neural networks (CNNs) with three common benchmarking datasets. Raw temporal measurement, Fourier spectrum, envelope spectrum, and spectrogram input types are individually used to train CNNs. Many configurations of CNNs are trained, with variable input sizes, convolutional kernel sizes and stride. The results show that each input type favors different combinations of hyperparameters, and that each of the datasets studied yield different performance characteristics. The input sizes are found to be the most significant determiner of whether overfitting will occur. It is demonstrated that CNNs trained with spectrograms are less dependent on hyperparameter optimization over all three datasets. This paper demonstrates the wide range of performance achieved by CNNs when preprocessing method and hyperparameters are varied as well as their complex interaction, providing researchers with useful background information and a starting place for further optimization.
topic condition monitoring
fault diagnosis
convolutional neural networks
hyperparameters
data representations
url https://www.mdpi.com/2571-631X/4/2/19
work_keys_str_mv AT jacobhendriks exploringtherelationshipbetweenpreprocessingandhyperparametertuningforvibrationbasedmachinefaultdiagnosisusingcnns
AT patrickdumond exploringtherelationshipbetweenpreprocessingandhyperparametertuningforvibrationbasedmachinefaultdiagnosisusingcnns
_version_ 1721543385084329984