Improving speech recognition using bionic wavelet features
Bionic wavelet transform is a continuous wavelet, based on adaptive time frequency technique. This paper presents a speech recognition system for recognizing isolated words by discretizing the continuous Bionic Wavelet (BW). Conversion from continuous to discrete is achieved by adopting central freq...
Main Authors: | , |
---|---|
Format: | Article |
Language: | English |
Published: |
AIMS Press
2020-07-01
|
Series: | AIMS Electronics and Electrical Engineering |
Subjects: | |
Online Access: | https://www.aimspress.com/article/10.3934/ElectrEng.2020.2.200/fulltext.html |
id |
doaj-eb62c6d0932c4f0b9d4b56566f77499b |
---|---|
record_format |
Article |
spelling |
doaj-eb62c6d0932c4f0b9d4b56566f77499b2020-11-25T02:44:53ZengAIMS PressAIMS Electronics and Electrical Engineering2578-15882020-07-014220021510.3934/ElectrEng.2020.2.200Improving speech recognition using bionic wavelet featuresVani H Y0Anusuya M A11 Department of Information Science & Engg., JSS Science & Technology University, Mysore, Karnataka, India2 Department of Computer Science & Engg., JSS Science & Technology University, Mysore, Karnataka, IndiaBionic wavelet transform is a continuous wavelet, based on adaptive time frequency technique. This paper presents a speech recognition system for recognizing isolated words by discretizing the continuous Bionic Wavelet (BW). Conversion from continuous to discrete is achieved by adopting central frequency and thresholding techniques. The BW features of noisy signal are processed through MFCC to obtain the optimal features of the speech signal. SVM, Artificial Neural Network (ANN) and LSTM techniques are used to improve the recognition rate by enhancing the speech signals. The experiments are conducted on FSDD and Kannada data set. The speech feature vector is calculated using the parameters extracted by Bionic wavelet with different central frequencies of Morlet, Daubechies and Bior3.5, coiflet5 mother wavelets. The obtained Bionic-MFCC optimal features are fed to SVM, ANN and LSTM models for the classification and recognition process. The performance of the models is tabulated for correct recognition that varies from 95% to 96% among these models. The models are tested for various SNRs noise levels like 5 dB, 10 dB, 15 dB and the recognition accuracies of these models are presented for convoluted noisy speech data.https://www.aimspress.com/article/10.3934/ElectrEng.2020.2.200/fulltext.htmlbionic wavelet transform (bwt)speech recognitionbionic-mfccwavelet transform (wt)support vector machine (svm)artificial neural network (ann)long short term memory (lstm)continuous wavelet transform (cwt)discrete waveletmorlet waveletadaptive thresholdingcenter frequencyt-function |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Vani H Y Anusuya M A |
spellingShingle |
Vani H Y Anusuya M A Improving speech recognition using bionic wavelet features AIMS Electronics and Electrical Engineering bionic wavelet transform (bwt) speech recognition bionic-mfcc wavelet transform (wt) support vector machine (svm) artificial neural network (ann) long short term memory (lstm) continuous wavelet transform (cwt) discrete wavelet morlet wavelet adaptive thresholding center frequency t-function |
author_facet |
Vani H Y Anusuya M A |
author_sort |
Vani H Y |
title |
Improving speech recognition using bionic wavelet features |
title_short |
Improving speech recognition using bionic wavelet features |
title_full |
Improving speech recognition using bionic wavelet features |
title_fullStr |
Improving speech recognition using bionic wavelet features |
title_full_unstemmed |
Improving speech recognition using bionic wavelet features |
title_sort |
improving speech recognition using bionic wavelet features |
publisher |
AIMS Press |
series |
AIMS Electronics and Electrical Engineering |
issn |
2578-1588 |
publishDate |
2020-07-01 |
description |
Bionic wavelet transform is a continuous wavelet, based on adaptive time frequency technique. This paper presents a speech recognition system for recognizing isolated words by discretizing the continuous Bionic Wavelet (BW). Conversion from continuous to discrete is achieved by adopting central frequency and thresholding techniques. The BW features of noisy signal are processed through MFCC to obtain the optimal features of the speech signal. SVM, Artificial Neural Network (ANN) and LSTM techniques are used to improve the recognition rate by enhancing the speech signals. The experiments are conducted on FSDD and Kannada data set. The speech feature vector is calculated using the parameters extracted by Bionic wavelet with different central frequencies of Morlet, Daubechies and Bior3.5, coiflet5 mother wavelets. The obtained Bionic-MFCC optimal features are fed to SVM, ANN and LSTM models for the classification and recognition process. The performance of the models is tabulated for correct recognition that varies from 95% to 96% among these models. The models are tested for various SNRs noise levels like 5 dB, 10 dB, 15 dB and the recognition accuracies of these models are presented for convoluted noisy speech data. |
topic |
bionic wavelet transform (bwt) speech recognition bionic-mfcc wavelet transform (wt) support vector machine (svm) artificial neural network (ann) long short term memory (lstm) continuous wavelet transform (cwt) discrete wavelet morlet wavelet adaptive thresholding center frequency t-function |
url |
https://www.aimspress.com/article/10.3934/ElectrEng.2020.2.200/fulltext.html |
work_keys_str_mv |
AT vanihy improvingspeechrecognitionusingbionicwaveletfeatures AT anusuyama improvingspeechrecognitionusingbionicwaveletfeatures |
_version_ |
1724765442686320640 |