A Robust Feature Extraction Method for Real-Time Speech Recognition System on a Raspberry Pi 3 Board
The development of a real-time automatic speech recognition system (ASR) better adapted to environmental variabilities, such as noisy surroundings, speaker variations and accents has become a high priority. Robustness is required, and it can be performed at the feature extraction stage which avoids...
Main Authors: | , , |
---|---|
Format: | Article |
Language: | English |
Published: |
D. G. Pylarinos
2019-04-01
|
Series: | Engineering, Technology & Applied Science Research |
Subjects: | |
Online Access: | https://etasr.com/index.php/ETASR/article/view/2533 |
Summary: | The development of a real-time automatic speech recognition system (ASR) better adapted to environmental variabilities, such as noisy surroundings, speaker variations and accents has become a high priority. Robustness is required, and it can be performed at the feature extraction stage which avoids the need for other pre-processing steps. In this paper, a new robust feature extraction method for real-time ASR system is presented. A combination of Mel-frequency cepstral coefficients (MFCC) and discrete wavelet transform (DWT) is proposed. This hybrid system can conserve more extracted speech features which tend to be invariant to noise. The main idea is to extract MFCC features by denoising the obtained coefficients in the wavelet domain by using a median filter (MF). The proposed system has been implemented on Raspberry Pi 3 which is a suitable platform for real-time requirements. The experiments showed a high recognition rate (100%) in clean environment and satisfying results (ranging from 80% to 100%) in noisy environments at different signal to noise ratios (SNRs).
|
---|---|
ISSN: | 2241-4487 1792-8036 |