Robust speaker recognition in presence of non-trivial environmental noise (toward greater biometric security)
The aim of this thesis is to investigate speaker recognition in the presence of environmental noise, and to develop a robust speaker recognition method. Recently, Speaker Recognition has been the object of considerable research due to its wide use in various areas. Despite major developments in this...
Main Author: | |
---|---|
Published: |
University of Salford
2017
|
Online Access: | http://ethos.bl.uk/OrderDetails.do?uin=uk.bl.ethos.736442 |
id |
ndltd-bl.uk-oai-ethos.bl.uk-736442 |
---|---|
record_format |
oai_dc |
spelling |
ndltd-bl.uk-oai-ethos.bl.uk-7364422018-06-12T03:24:44ZRobust speaker recognition in presence of non-trivial environmental noise (toward greater biometric security)Al-Noori, A. H. Y.2017The aim of this thesis is to investigate speaker recognition in the presence of environmental noise, and to develop a robust speaker recognition method. Recently, Speaker Recognition has been the object of considerable research due to its wide use in various areas. Despite major developments in this field, there are still many limitations and challenges. Environmental noises and their variations are high up in the list of challenges since it impossible to provide a noise free environment. A novel approach is proposed to address the issue of performance degradation in environmental noise. This approach is based on the estimation of signal-to-noise ratio (SNR) and detection of ambient noise from the recognition signal to re-train the reference model for the claimed speaker and to generate a new adapted noisy model to decrease the noise mismatch with recognition utterances. This approach is termed “Training on the fly” for robustness of speaker recognition under noisy environments. To detect the noise in the recognition signal two different techniques are proposed: the first technique including generating an emulated noise depending on estimated power spectrum of the original noise using 1/3 octave band filter bank and white noise signal. This emulated noise become close enough to original one that includes in the input signal (recognition signal). The second technique deals with extracting the noise from the input signal using one of speech enhancement algorithm with spectral subtraction to find the noise in the signal. Training on the fly approach (using both techniques) has been examined using two feature approaches and two different kinds of artificial clean and noisy speech databases collected in different environments. Furthermore, the speech samples were text independent. The training on the fly approach is a significant improvement in performance when compared with the performance of conventional speaker recognition (based on clean reference models). Moreover, the training on the fly based on noise extraction showed the best results for all types of noisy data.University of Salfordhttp://ethos.bl.uk/OrderDetails.do?uin=uk.bl.ethos.736442http://usir.salford.ac.uk/44604/Electronic Thesis or Dissertation |
collection |
NDLTD |
sources |
NDLTD |
description |
The aim of this thesis is to investigate speaker recognition in the presence of environmental noise, and to develop a robust speaker recognition method. Recently, Speaker Recognition has been the object of considerable research due to its wide use in various areas. Despite major developments in this field, there are still many limitations and challenges. Environmental noises and their variations are high up in the list of challenges since it impossible to provide a noise free environment. A novel approach is proposed to address the issue of performance degradation in environmental noise. This approach is based on the estimation of signal-to-noise ratio (SNR) and detection of ambient noise from the recognition signal to re-train the reference model for the claimed speaker and to generate a new adapted noisy model to decrease the noise mismatch with recognition utterances. This approach is termed “Training on the fly” for robustness of speaker recognition under noisy environments. To detect the noise in the recognition signal two different techniques are proposed: the first technique including generating an emulated noise depending on estimated power spectrum of the original noise using 1/3 octave band filter bank and white noise signal. This emulated noise become close enough to original one that includes in the input signal (recognition signal). The second technique deals with extracting the noise from the input signal using one of speech enhancement algorithm with spectral subtraction to find the noise in the signal. Training on the fly approach (using both techniques) has been examined using two feature approaches and two different kinds of artificial clean and noisy speech databases collected in different environments. Furthermore, the speech samples were text independent. The training on the fly approach is a significant improvement in performance when compared with the performance of conventional speaker recognition (based on clean reference models). Moreover, the training on the fly based on noise extraction showed the best results for all types of noisy data. |
author |
Al-Noori, A. H. Y. |
spellingShingle |
Al-Noori, A. H. Y. Robust speaker recognition in presence of non-trivial environmental noise (toward greater biometric security) |
author_facet |
Al-Noori, A. H. Y. |
author_sort |
Al-Noori, A. H. Y. |
title |
Robust speaker recognition in presence of non-trivial environmental noise (toward greater biometric security) |
title_short |
Robust speaker recognition in presence of non-trivial environmental noise (toward greater biometric security) |
title_full |
Robust speaker recognition in presence of non-trivial environmental noise (toward greater biometric security) |
title_fullStr |
Robust speaker recognition in presence of non-trivial environmental noise (toward greater biometric security) |
title_full_unstemmed |
Robust speaker recognition in presence of non-trivial environmental noise (toward greater biometric security) |
title_sort |
robust speaker recognition in presence of non-trivial environmental noise (toward greater biometric security) |
publisher |
University of Salford |
publishDate |
2017 |
url |
http://ethos.bl.uk/OrderDetails.do?uin=uk.bl.ethos.736442 |
work_keys_str_mv |
AT alnooriahy robustspeakerrecognitioninpresenceofnontrivialenvironmentalnoisetowardgreaterbiometricsecurity |
_version_ |
1718694161361666048 |