Multi-Condition Training for Unknown Environment Adaptation in Robust ASR Under Real Conditions

Automatic speech recognition (ASR) systems frequently work in a noisy environment. As they are often trained on clean speech data, noise reduction or adaptation techniques are applied to decrease the influence of background disturbance even in the case of unknown conditions. Speech data mixed with n...

Full description

Bibliographic Details
Main Author:	J. Rajnoha
Format:	Article
Language:	English
Published:	CTU Central Library 2009-01-01
Series:	Acta Polytechnica
Subjects:	speech recognition environment adaptation spectral subtraction MLLR noisy background
Online Access:	https://ojs.cvut.cz/ojs/index.php/ap/article/view/1105

id	doaj-fd9b0f57bb64479ca68c506866c1432b
record_format	Article
spelling	doaj-fd9b0f57bb64479ca68c506866c1432b2020-11-25T01:07:43ZengCTU Central LibraryActa Polytechnica1210-27091805-23632009-01-014921105Multi-Condition Training for Unknown Environment Adaptation in Robust ASR Under Real ConditionsJ. RajnohaAutomatic speech recognition (ASR) systems frequently work in a noisy environment. As they are often trained on clean speech data, noise reduction or adaptation techniques are applied to decrease the influence of background disturbance even in the case of unknown conditions. Speech data mixed with noise recordings from particular environment are often used for the purposes of model adaptation. This paper analyses the improvement of recognition performance within such adaptation when multi-condition training data from a real environment is used for training initial models. Although the quality of such models can decrease with the presence of noise in the training material, they are assumed to include initial information about noise and consequently support the adaptation procedure. Experimental results show significant improvement of the proposed training method in a robust ASR task under unknown noisy conditions. The decrease by 29 % and 14 % in word error rate in comparison with clean speech training data was achieved for the non-adapted and adapted system, respectively. https://ojs.cvut.cz/ojs/index.php/ap/article/view/1105speech recognitionenvironment adaptationspectral subtractionMLLRnoisy background
collection	DOAJ
language	English
format	Article
sources	DOAJ
author	J. Rajnoha
spellingShingle	J. Rajnoha Multi-Condition Training for Unknown Environment Adaptation in Robust ASR Under Real Conditions Acta Polytechnica speech recognition environment adaptation spectral subtraction MLLR noisy background
author_facet	J. Rajnoha
author_sort	J. Rajnoha
title	Multi-Condition Training for Unknown Environment Adaptation in Robust ASR Under Real Conditions
title_short	Multi-Condition Training for Unknown Environment Adaptation in Robust ASR Under Real Conditions
title_full	Multi-Condition Training for Unknown Environment Adaptation in Robust ASR Under Real Conditions
title_fullStr	Multi-Condition Training for Unknown Environment Adaptation in Robust ASR Under Real Conditions
title_full_unstemmed	Multi-Condition Training for Unknown Environment Adaptation in Robust ASR Under Real Conditions
title_sort	multi-condition training for unknown environment adaptation in robust asr under real conditions
publisher	CTU Central Library
series	Acta Polytechnica
issn	1210-2709 1805-2363
publishDate	2009-01-01
description	Automatic speech recognition (ASR) systems frequently work in a noisy environment. As they are often trained on clean speech data, noise reduction or adaptation techniques are applied to decrease the influence of background disturbance even in the case of unknown conditions. Speech data mixed with noise recordings from particular environment are often used for the purposes of model adaptation. This paper analyses the improvement of recognition performance within such adaptation when multi-condition training data from a real environment is used for training initial models. Although the quality of such models can decrease with the presence of noise in the training material, they are assumed to include initial information about noise and consequently support the adaptation procedure. Experimental results show significant improvement of the proposed training method in a robust ASR task under unknown noisy conditions. The decrease by 29 % and 14 % in word error rate in comparison with clean speech training data was achieved for the non-adapted and adapted system, respectively.
topic	speech recognition environment adaptation spectral subtraction MLLR noisy background
url	https://ojs.cvut.cz/ojs/index.php/ap/article/view/1105
work_keys_str_mv	AT jrajnoha multiconditiontrainingforunknownenvironmentadaptationinrobustasrunderrealconditions
_version_	1725185693705043968

Multi-Condition Training for Unknown Environment Adaptation in Robust ASR Under Real Conditions

Similar Items