Evaluation of Lombard Speech Models in the Context of Speech in Noise Enhancement

The Lombard effect is one of the most well-known effects of noise on speech production. Speech with the Lombard effect is more easily recognizable in noisy environments than normal natural speech. Our previous investigations showed that speech synthesis models might retain Lombard-effect characteris...

Full description

Bibliographic Details
Main Authors:	Grazina Korvel, Krzysztof Kakol, Olga Kurasova, Bozena Kostek
Format:	Article
Language:	English
Published:	IEEE 2020-01-01
Series:	IEEE Access
Subjects:	Lombard speech quality of experience speech modeling techniques
Online Access:	https://ieeexplore.ieee.org/document/9163113/

id	doaj-567a48de021841839a724266c7503e01
record_format	Article
spelling	doaj-567a48de021841839a724266c7503e012021-03-30T03:54:02ZengIEEEIEEE Access2169-35362020-01-01815515615517010.1109/ACCESS.2020.30154219163113Evaluation of Lombard Speech Models in the Context of Speech in Noise EnhancementGrazina Korvel0Krzysztof Kakol1Olga Kurasova2https://orcid.org/0000-0002-0570-1741Bozena Kostek3https://orcid.org/0000-0001-6288-2908Institute of Data Science and Digital Technologies, Vilnius University, Vilnius, LithuaniaAudio Acoustics Laboratory, Faculty of Electronics, Telecommunications and Informatics, Gdańsk University of Technology, Gdańsk, PolandInstitute of Data Science and Digital Technologies, Vilnius University, Vilnius, LithuaniaAudio Acoustics Laboratory, Faculty of Electronics, Telecommunications and Informatics, Gdańsk University of Technology, Gdańsk, PolandThe Lombard effect is one of the most well-known effects of noise on speech production. Speech with the Lombard effect is more easily recognizable in noisy environments than normal natural speech. Our previous investigations showed that speech synthesis models might retain Lombard-effect characteristics. In this study, we investigate several speech models, such as harmonic, source-filter, and sinusoidal, applied to Lombard speech in the context of speech enhancement. For this purpose, 100 utterances of natural speech, and 100 with the Lombard effect induced are used. The goal of this study is to check to what extent speech utterances based on these models are recognizable and at what SNR (Signal-to-Noise Ratio) level threshold a particular model stops working. For this purpose, the synthesized models and Lombard speech are mixed with babble speech and street noise recordings with different SNRs. The quality of these models is measured, employing objective indicators as well as subjective tests. Since there is no standardized measure to apply to enhanced speech, an objective measure of assessing the speech quality of a model synthesizing Lombard speech characteristics, based on a feature vector, is proposed. Our approach is then compared with the standardized metric used in telecommunications as well as with subjective test results. The experimental investigations show the superiority of the source-filter models applied to synthesize Lombard speech over other models utilized. Also, the measure proposed correlates more closely with the results of the subjective evaluation than the outcomes from the ITU-T P.563 recommendation. This was checked with a ANOVA statistical analysis.https://ieeexplore.ieee.org/document/9163113/Lombard speechquality of experiencespeech modeling techniques
collection	DOAJ
language	English
format	Article
sources	DOAJ
author	Grazina Korvel Krzysztof Kakol Olga Kurasova Bozena Kostek
spellingShingle	Grazina Korvel Krzysztof Kakol Olga Kurasova Bozena Kostek Evaluation of Lombard Speech Models in the Context of Speech in Noise Enhancement IEEE Access Lombard speech quality of experience speech modeling techniques
author_facet	Grazina Korvel Krzysztof Kakol Olga Kurasova Bozena Kostek
author_sort	Grazina Korvel
title	Evaluation of Lombard Speech Models in the Context of Speech in Noise Enhancement
title_short	Evaluation of Lombard Speech Models in the Context of Speech in Noise Enhancement
title_full	Evaluation of Lombard Speech Models in the Context of Speech in Noise Enhancement
title_fullStr	Evaluation of Lombard Speech Models in the Context of Speech in Noise Enhancement
title_full_unstemmed	Evaluation of Lombard Speech Models in the Context of Speech in Noise Enhancement
title_sort	evaluation of lombard speech models in the context of speech in noise enhancement
publisher	IEEE
series	IEEE Access
issn	2169-3536
publishDate	2020-01-01
description	The Lombard effect is one of the most well-known effects of noise on speech production. Speech with the Lombard effect is more easily recognizable in noisy environments than normal natural speech. Our previous investigations showed that speech synthesis models might retain Lombard-effect characteristics. In this study, we investigate several speech models, such as harmonic, source-filter, and sinusoidal, applied to Lombard speech in the context of speech enhancement. For this purpose, 100 utterances of natural speech, and 100 with the Lombard effect induced are used. The goal of this study is to check to what extent speech utterances based on these models are recognizable and at what SNR (Signal-to-Noise Ratio) level threshold a particular model stops working. For this purpose, the synthesized models and Lombard speech are mixed with babble speech and street noise recordings with different SNRs. The quality of these models is measured, employing objective indicators as well as subjective tests. Since there is no standardized measure to apply to enhanced speech, an objective measure of assessing the speech quality of a model synthesizing Lombard speech characteristics, based on a feature vector, is proposed. Our approach is then compared with the standardized metric used in telecommunications as well as with subjective test results. The experimental investigations show the superiority of the source-filter models applied to synthesize Lombard speech over other models utilized. Also, the measure proposed correlates more closely with the results of the subjective evaluation than the outcomes from the ITU-T P.563 recommendation. This was checked with a ANOVA statistical analysis.
topic	Lombard speech quality of experience speech modeling techniques
url	https://ieeexplore.ieee.org/document/9163113/
work_keys_str_mv	AT grazinakorvel evaluationoflombardspeechmodelsinthecontextofspeechinnoiseenhancement AT krzysztofkakol evaluationoflombardspeechmodelsinthecontextofspeechinnoiseenhancement AT olgakurasova evaluationoflombardspeechmodelsinthecontextofspeechinnoiseenhancement AT bozenakostek evaluationoflombardspeechmodelsinthecontextofspeechinnoiseenhancement
_version_	1724182622857330688

Evaluation of Lombard Speech Models in the Context of Speech in Noise Enhancement

Similar Items