Hybrid Computerized Method for Environmental Sound Classification
Classification of environmental sounds plays a key role in security, investigation, robotics since the study of the sounds present in a specific environment can allow to get significant insights. Lack of standardized methods for an automatic and effective environmental sound classification (ESC) cre...
Main Authors: | , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
IEEE
2020-01-01
|
Series: | IEEE Access |
Subjects: | |
Online Access: | https://ieeexplore.ieee.org/document/9129696/ |
id |
doaj-2c435412f80c416cb69926af23188bb8 |
---|---|
record_format |
Article |
spelling |
doaj-2c435412f80c416cb69926af23188bb82021-03-30T01:55:20ZengIEEEIEEE Access2169-35362020-01-01812405512406510.1109/ACCESS.2020.30060829129696Hybrid Computerized Method for Environmental Sound ClassificationSilvia Liberata Ullo0https://orcid.org/0000-0001-6294-0581Smith K. Khare1https://orcid.org/0000-0001-8365-1092Varun Bajaj2https://orcid.org/0000-0002-8721-1219G. R. Sinha3https://orcid.org/0000-0003-2384-4591Department of Engineering, Universita Degli Studi Del Sannio, Benevento, ItalyDepartment of Electronics and Communication, PDPM Indian Institute of Information Technology, Design and Manufacturing, Jabalpur, IndiaDepartment of Electronics and Communication, PDPM Indian Institute of Information Technology, Design and Manufacturing, Jabalpur, IndiaMyanmar Institute of Information Technology (MIIT), Mandalay, MyanmarClassification of environmental sounds plays a key role in security, investigation, robotics since the study of the sounds present in a specific environment can allow to get significant insights. Lack of standardized methods for an automatic and effective environmental sound classification (ESC) creates a need to be urgently satisfied. As a response to this limitation, in this paper, a hybrid model for automatic and accurate classification of environmental sounds is proposed. Optimum allocation sampling (OAS) is used to elicit the informative samples from each class. The representative samples obtained by OAS are turned into the spectrogram containing their time-frequency-amplitude representation by using a short-time Fourier transform (STFT). The spectrogram is then given as an input to pre-trained AlexNet and Visual Geometry Group (VGG)-16 networks. Multiple deep features are extracted using the pre-trained networks and classified by using multiple classification techniques namely decision tree (fine, medium, coarse kernel), k-nearest neighbor (fine, medium, cosine, cubic, coarse and weighted kernel), support vector machine, linear discriminant analysis, bagged tree and softmax classifiers. The ESC-10, a ten-class environmental sound dataset, is used for the evaluation of the methodology. An accuracy of 90.1%, 95.8%, 94.7%, 87.9%, 95.6%, and 92.4% is obtained with a decision tree, k-neared neighbor, support vector machine, linear discriminant analysis, bagged tree and softmax classifier respectively. The proposed method proved to be robust, effective, and promising in comparison with other existing state-of-the-art techniques, using the same dataset.https://ieeexplore.ieee.org/document/9129696/Environmental sound classificationoptimal allocation samplingspectrogramconvolutional neural networkclassification techniques |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Silvia Liberata Ullo Smith K. Khare Varun Bajaj G. R. Sinha |
spellingShingle |
Silvia Liberata Ullo Smith K. Khare Varun Bajaj G. R. Sinha Hybrid Computerized Method for Environmental Sound Classification IEEE Access Environmental sound classification optimal allocation sampling spectrogram convolutional neural network classification techniques |
author_facet |
Silvia Liberata Ullo Smith K. Khare Varun Bajaj G. R. Sinha |
author_sort |
Silvia Liberata Ullo |
title |
Hybrid Computerized Method for Environmental Sound Classification |
title_short |
Hybrid Computerized Method for Environmental Sound Classification |
title_full |
Hybrid Computerized Method for Environmental Sound Classification |
title_fullStr |
Hybrid Computerized Method for Environmental Sound Classification |
title_full_unstemmed |
Hybrid Computerized Method for Environmental Sound Classification |
title_sort |
hybrid computerized method for environmental sound classification |
publisher |
IEEE |
series |
IEEE Access |
issn |
2169-3536 |
publishDate |
2020-01-01 |
description |
Classification of environmental sounds plays a key role in security, investigation, robotics since the study of the sounds present in a specific environment can allow to get significant insights. Lack of standardized methods for an automatic and effective environmental sound classification (ESC) creates a need to be urgently satisfied. As a response to this limitation, in this paper, a hybrid model for automatic and accurate classification of environmental sounds is proposed. Optimum allocation sampling (OAS) is used to elicit the informative samples from each class. The representative samples obtained by OAS are turned into the spectrogram containing their time-frequency-amplitude representation by using a short-time Fourier transform (STFT). The spectrogram is then given as an input to pre-trained AlexNet and Visual Geometry Group (VGG)-16 networks. Multiple deep features are extracted using the pre-trained networks and classified by using multiple classification techniques namely decision tree (fine, medium, coarse kernel), k-nearest neighbor (fine, medium, cosine, cubic, coarse and weighted kernel), support vector machine, linear discriminant analysis, bagged tree and softmax classifiers. The ESC-10, a ten-class environmental sound dataset, is used for the evaluation of the methodology. An accuracy of 90.1%, 95.8%, 94.7%, 87.9%, 95.6%, and 92.4% is obtained with a decision tree, k-neared neighbor, support vector machine, linear discriminant analysis, bagged tree and softmax classifier respectively. The proposed method proved to be robust, effective, and promising in comparison with other existing state-of-the-art techniques, using the same dataset. |
topic |
Environmental sound classification optimal allocation sampling spectrogram convolutional neural network classification techniques |
url |
https://ieeexplore.ieee.org/document/9129696/ |
work_keys_str_mv |
AT silvialiberataullo hybridcomputerizedmethodforenvironmentalsoundclassification AT smithkkhare hybridcomputerizedmethodforenvironmentalsoundclassification AT varunbajaj hybridcomputerizedmethodforenvironmentalsoundclassification AT grsinha hybridcomputerizedmethodforenvironmentalsoundclassification |
_version_ |
1724186196716814336 |