Development of High Accuracy Classifier for the Speaker Recognition System

Speech signal is enriched with plenty of features used for biometrical recognition and other applications like gender and emotional recognition. Channel conditions manifested by background noise and reverberation are the main challenges causing feature shifts in the test and training data. In this p...

Full description

Bibliographic Details
Main Authors:	Raghad Tariq Al-Hassani, Dogu Cagdas Atilla, Çağatay Aydin
Format:	Article
Language:	English
Published:	Hindawi Limited 2021-01-01
Series:	Applied Bionics and Biomechanics
Online Access:	http://dx.doi.org/10.1155/2021/5559616

id	doaj-a2547c10308f440baeefab5bbec5aba9
record_format	Article
spelling	doaj-a2547c10308f440baeefab5bbec5aba92021-07-02T19:07:11ZengHindawi LimitedApplied Bionics and Biomechanics1754-21032021-01-01202110.1155/2021/5559616Development of High Accuracy Classifier for the Speaker Recognition SystemRaghad Tariq Al-Hassani0Dogu Cagdas Atilla1Çağatay Aydin2Faculty of EngineeringFaculty of EngineeringFaculty of EngineeringSpeech signal is enriched with plenty of features used for biometrical recognition and other applications like gender and emotional recognition. Channel conditions manifested by background noise and reverberation are the main challenges causing feature shifts in the test and training data. In this paper, a hybrid speaker identification model for consistent speech features and high recognition accuracy is made. Features using Mel frequency spectrum coefficients (MFCC) have been improved by incorporating a pitch frequency coefficient from speech time domain analysis. In order to enhance noise immunity, we proposed a single hidden layer feed-forward neural network (FFNN) tuned by an optimized particle swarm optimization (OPSO) algorithm. The proposed model is tested using 10-fold cross-validation over different levels of Adaptive White Gaussian Noise (AWGN) (0-50 dB). A recognition accuracy of 97.83% was obtained from the proposed model in clean voice environments. However, a noisy channel is realized with lesser impact on the proposed model as compared with other baseline classifiers such as plain-FFNN, random forest (RF), K-nearest neighbour (KNN), and support vector machine (SVM).http://dx.doi.org/10.1155/2021/5559616
collection	DOAJ
language	English
format	Article
sources	DOAJ
author	Raghad Tariq Al-Hassani Dogu Cagdas Atilla Çağatay Aydin
spellingShingle	Raghad Tariq Al-Hassani Dogu Cagdas Atilla Çağatay Aydin Development of High Accuracy Classifier for the Speaker Recognition System Applied Bionics and Biomechanics
author_facet	Raghad Tariq Al-Hassani Dogu Cagdas Atilla Çağatay Aydin
author_sort	Raghad Tariq Al-Hassani
title	Development of High Accuracy Classifier for the Speaker Recognition System
title_short	Development of High Accuracy Classifier for the Speaker Recognition System
title_full	Development of High Accuracy Classifier for the Speaker Recognition System
title_fullStr	Development of High Accuracy Classifier for the Speaker Recognition System
title_full_unstemmed	Development of High Accuracy Classifier for the Speaker Recognition System
title_sort	development of high accuracy classifier for the speaker recognition system
publisher	Hindawi Limited
series	Applied Bionics and Biomechanics
issn	1754-2103
publishDate	2021-01-01
description	Speech signal is enriched with plenty of features used for biometrical recognition and other applications like gender and emotional recognition. Channel conditions manifested by background noise and reverberation are the main challenges causing feature shifts in the test and training data. In this paper, a hybrid speaker identification model for consistent speech features and high recognition accuracy is made. Features using Mel frequency spectrum coefficients (MFCC) have been improved by incorporating a pitch frequency coefficient from speech time domain analysis. In order to enhance noise immunity, we proposed a single hidden layer feed-forward neural network (FFNN) tuned by an optimized particle swarm optimization (OPSO) algorithm. The proposed model is tested using 10-fold cross-validation over different levels of Adaptive White Gaussian Noise (AWGN) (0-50 dB). A recognition accuracy of 97.83% was obtained from the proposed model in clean voice environments. However, a noisy channel is realized with lesser impact on the proposed model as compared with other baseline classifiers such as plain-FFNN, random forest (RF), K-nearest neighbour (KNN), and support vector machine (SVM).
url	http://dx.doi.org/10.1155/2021/5559616
work_keys_str_mv	AT raghadtariqalhassani developmentofhighaccuracyclassifierforthespeakerrecognitionsystem AT dogucagdasatilla developmentofhighaccuracyclassifierforthespeakerrecognitionsystem AT cagatayaydin developmentofhighaccuracyclassifierforthespeakerrecognitionsystem
_version_	1721324226736029696

Development of High Accuracy Classifier for the Speaker Recognition System

Similar Items