A Robust Feature Extraction Method for Real-Time Speech Recognition System on a Raspberry Pi 3 Board

The development of a real-time automatic speech recognition system (ASR) better adapted to environmental variabilities, such as noisy surroundings, speaker variations and accents has become a high priority. Robustness is required, and it can be performed at the feature extraction stage which avoids...

Full description

Bibliographic Details
Main Authors:	A. Mnassri, M. Bennasr, C. Adnane
Format:	Article
Language:	English
Published:	D. G. Pylarinos 2019-04-01
Series:	Engineering, Technology & Applied Science Research
Subjects:	automatic speech recognition discrete wavelet transform Mel frequency cestrum coefficients median filter support vector machines Raspberry Pi
Online Access:	https://etasr.com/index.php/ETASR/article/view/2533

id	doaj-cf5eeadf6afc4d54ba8982fe8d49335e
record_format	Article
spelling	doaj-cf5eeadf6afc4d54ba8982fe8d49335e2020-12-02T18:06:05ZengD. G. PylarinosEngineering, Technology & Applied Science Research2241-44871792-80362019-04-0192747A Robust Feature Extraction Method for Real-Time Speech Recognition System on a Raspberry Pi 3 BoardA. Mnassri0M. Bennasr1C. Adnane2Department of Physics, Faculty of Sciences, Tunis - El Manar University, TunisiaDepartment of Physics, Faculty of Sciences, Tunis - El Manar University, TunisiaDepartment of Physics, Faculty of Sciences, Tunis - El Manar University, TunisiaThe development of a real-time automatic speech recognition system (ASR) better adapted to environmental variabilities, such as noisy surroundings, speaker variations and accents has become a high priority. Robustness is required, and it can be performed at the feature extraction stage which avoids the need for other pre-processing steps. In this paper, a new robust feature extraction method for real-time ASR system is presented. A combination of Mel-frequency cepstral coefficients (MFCC) and discrete wavelet transform (DWT) is proposed. This hybrid system can conserve more extracted speech features which tend to be invariant to noise. The main idea is to extract MFCC features by denoising the obtained coefficients in the wavelet domain by using a median filter (MF). The proposed system has been implemented on Raspberry Pi 3 which is a suitable platform for real-time requirements. The experiments showed a high recognition rate (100%) in clean environment and satisfying results (ranging from 80% to 100%) in noisy environments at different signal to noise ratios (SNRs). https://etasr.com/index.php/ETASR/article/view/2533automatic speech recognitiondiscrete wavelet transformMel frequency cestrum coefficientsmedian filtersupport vector machinesRaspberry Pi
collection	DOAJ
language	English
format	Article
sources	DOAJ
author	A. Mnassri M. Bennasr C. Adnane
spellingShingle	A. Mnassri M. Bennasr C. Adnane A Robust Feature Extraction Method for Real-Time Speech Recognition System on a Raspberry Pi 3 Board Engineering, Technology & Applied Science Research automatic speech recognition discrete wavelet transform Mel frequency cestrum coefficients median filter support vector machines Raspberry Pi
author_facet	A. Mnassri M. Bennasr C. Adnane
author_sort	A. Mnassri
title	A Robust Feature Extraction Method for Real-Time Speech Recognition System on a Raspberry Pi 3 Board
title_short	A Robust Feature Extraction Method for Real-Time Speech Recognition System on a Raspberry Pi 3 Board
title_full	A Robust Feature Extraction Method for Real-Time Speech Recognition System on a Raspberry Pi 3 Board
title_fullStr	A Robust Feature Extraction Method for Real-Time Speech Recognition System on a Raspberry Pi 3 Board
title_full_unstemmed	A Robust Feature Extraction Method for Real-Time Speech Recognition System on a Raspberry Pi 3 Board
title_sort	robust feature extraction method for real-time speech recognition system on a raspberry pi 3 board
publisher	D. G. Pylarinos
series	Engineering, Technology & Applied Science Research
issn	2241-4487 1792-8036
publishDate	2019-04-01
description	The development of a real-time automatic speech recognition system (ASR) better adapted to environmental variabilities, such as noisy surroundings, speaker variations and accents has become a high priority. Robustness is required, and it can be performed at the feature extraction stage which avoids the need for other pre-processing steps. In this paper, a new robust feature extraction method for real-time ASR system is presented. A combination of Mel-frequency cepstral coefficients (MFCC) and discrete wavelet transform (DWT) is proposed. This hybrid system can conserve more extracted speech features which tend to be invariant to noise. The main idea is to extract MFCC features by denoising the obtained coefficients in the wavelet domain by using a median filter (MF). The proposed system has been implemented on Raspberry Pi 3 which is a suitable platform for real-time requirements. The experiments showed a high recognition rate (100%) in clean environment and satisfying results (ranging from 80% to 100%) in noisy environments at different signal to noise ratios (SNRs).
topic	automatic speech recognition discrete wavelet transform Mel frequency cestrum coefficients median filter support vector machines Raspberry Pi
url	https://etasr.com/index.php/ETASR/article/view/2533
work_keys_str_mv	AT amnassri arobustfeatureextractionmethodforrealtimespeechrecognitionsystemonaraspberrypi3board AT mbennasr arobustfeatureextractionmethodforrealtimespeechrecognitionsystemonaraspberrypi3board AT cadnane arobustfeatureextractionmethodforrealtimespeechrecognitionsystemonaraspberrypi3board AT amnassri robustfeatureextractionmethodforrealtimespeechrecognitionsystemonaraspberrypi3board AT mbennasr robustfeatureextractionmethodforrealtimespeechrecognitionsystemonaraspberrypi3board AT cadnane robustfeatureextractionmethodforrealtimespeechrecognitionsystemonaraspberrypi3board
_version_	1724404431246589952

A Robust Feature Extraction Method for Real-Time Speech Recognition System on a Raspberry Pi 3 Board

Similar Items