Hybrid Computerized Method for Environmental Sound Classification

Classification of environmental sounds plays a key role in security, investigation, robotics since the study of the sounds present in a specific environment can allow to get significant insights. Lack of standardized methods for an automatic and effective environmental sound classification (ESC) cre...

Full description

Bibliographic Details
Main Authors: Silvia Liberata Ullo, Smith K. Khare, Varun Bajaj, G. R. Sinha
Format: Article
Language:English
Published: IEEE 2020-01-01
Series:IEEE Access
Subjects:
Online Access:https://ieeexplore.ieee.org/document/9129696/
id doaj-2c435412f80c416cb69926af23188bb8
record_format Article
spelling doaj-2c435412f80c416cb69926af23188bb82021-03-30T01:55:20ZengIEEEIEEE Access2169-35362020-01-01812405512406510.1109/ACCESS.2020.30060829129696Hybrid Computerized Method for Environmental Sound ClassificationSilvia Liberata Ullo0https://orcid.org/0000-0001-6294-0581Smith K. Khare1https://orcid.org/0000-0001-8365-1092Varun Bajaj2https://orcid.org/0000-0002-8721-1219G. R. Sinha3https://orcid.org/0000-0003-2384-4591Department of Engineering, Universita Degli Studi Del Sannio, Benevento, ItalyDepartment of Electronics and Communication, PDPM Indian Institute of Information Technology, Design and Manufacturing, Jabalpur, IndiaDepartment of Electronics and Communication, PDPM Indian Institute of Information Technology, Design and Manufacturing, Jabalpur, IndiaMyanmar Institute of Information Technology (MIIT), Mandalay, MyanmarClassification of environmental sounds plays a key role in security, investigation, robotics since the study of the sounds present in a specific environment can allow to get significant insights. Lack of standardized methods for an automatic and effective environmental sound classification (ESC) creates a need to be urgently satisfied. As a response to this limitation, in this paper, a hybrid model for automatic and accurate classification of environmental sounds is proposed. Optimum allocation sampling (OAS) is used to elicit the informative samples from each class. The representative samples obtained by OAS are turned into the spectrogram containing their time-frequency-amplitude representation by using a short-time Fourier transform (STFT). The spectrogram is then given as an input to pre-trained AlexNet and Visual Geometry Group (VGG)-16 networks. Multiple deep features are extracted using the pre-trained networks and classified by using multiple classification techniques namely decision tree (fine, medium, coarse kernel), k-nearest neighbor (fine, medium, cosine, cubic, coarse and weighted kernel), support vector machine, linear discriminant analysis, bagged tree and softmax classifiers. The ESC-10, a ten-class environmental sound dataset, is used for the evaluation of the methodology. An accuracy of 90.1%, 95.8%, 94.7%, 87.9%, 95.6%, and 92.4% is obtained with a decision tree, k-neared neighbor, support vector machine, linear discriminant analysis, bagged tree and softmax classifier respectively. The proposed method proved to be robust, effective, and promising in comparison with other existing state-of-the-art techniques, using the same dataset.https://ieeexplore.ieee.org/document/9129696/Environmental sound classificationoptimal allocation samplingspectrogramconvolutional neural networkclassification techniques
collection DOAJ
language English
format Article
sources DOAJ
author Silvia Liberata Ullo
Smith K. Khare
Varun Bajaj
G. R. Sinha
spellingShingle Silvia Liberata Ullo
Smith K. Khare
Varun Bajaj
G. R. Sinha
Hybrid Computerized Method for Environmental Sound Classification
IEEE Access
Environmental sound classification
optimal allocation sampling
spectrogram
convolutional neural network
classification techniques
author_facet Silvia Liberata Ullo
Smith K. Khare
Varun Bajaj
G. R. Sinha
author_sort Silvia Liberata Ullo
title Hybrid Computerized Method for Environmental Sound Classification
title_short Hybrid Computerized Method for Environmental Sound Classification
title_full Hybrid Computerized Method for Environmental Sound Classification
title_fullStr Hybrid Computerized Method for Environmental Sound Classification
title_full_unstemmed Hybrid Computerized Method for Environmental Sound Classification
title_sort hybrid computerized method for environmental sound classification
publisher IEEE
series IEEE Access
issn 2169-3536
publishDate 2020-01-01
description Classification of environmental sounds plays a key role in security, investigation, robotics since the study of the sounds present in a specific environment can allow to get significant insights. Lack of standardized methods for an automatic and effective environmental sound classification (ESC) creates a need to be urgently satisfied. As a response to this limitation, in this paper, a hybrid model for automatic and accurate classification of environmental sounds is proposed. Optimum allocation sampling (OAS) is used to elicit the informative samples from each class. The representative samples obtained by OAS are turned into the spectrogram containing their time-frequency-amplitude representation by using a short-time Fourier transform (STFT). The spectrogram is then given as an input to pre-trained AlexNet and Visual Geometry Group (VGG)-16 networks. Multiple deep features are extracted using the pre-trained networks and classified by using multiple classification techniques namely decision tree (fine, medium, coarse kernel), k-nearest neighbor (fine, medium, cosine, cubic, coarse and weighted kernel), support vector machine, linear discriminant analysis, bagged tree and softmax classifiers. The ESC-10, a ten-class environmental sound dataset, is used for the evaluation of the methodology. An accuracy of 90.1%, 95.8%, 94.7%, 87.9%, 95.6%, and 92.4% is obtained with a decision tree, k-neared neighbor, support vector machine, linear discriminant analysis, bagged tree and softmax classifier respectively. The proposed method proved to be robust, effective, and promising in comparison with other existing state-of-the-art techniques, using the same dataset.
topic Environmental sound classification
optimal allocation sampling
spectrogram
convolutional neural network
classification techniques
url https://ieeexplore.ieee.org/document/9129696/
work_keys_str_mv AT silvialiberataullo hybridcomputerizedmethodforenvironmentalsoundclassification
AT smithkkhare hybridcomputerizedmethodforenvironmentalsoundclassification
AT varunbajaj hybridcomputerizedmethodforenvironmentalsoundclassification
AT grsinha hybridcomputerizedmethodforenvironmentalsoundclassification
_version_ 1724186196716814336