Multi-Condition Training for Unknown Environment Adaptation in Robust ASR Under Real Conditions

Automatic speech recognition (ASR) systems frequently work in a noisy environment. As they are often trained on clean speech data, noise reduction or adaptation techniques are applied to decrease the influence of background disturbance even in the case of unknown conditions. Speech data mixed with n...

Full description

Bibliographic Details
Main Author: J. Rajnoha
Format: Article
Language:English
Published: CTU Central Library 2009-01-01
Series:Acta Polytechnica
Subjects:
Online Access:https://ojs.cvut.cz/ojs/index.php/ap/article/view/1105
id doaj-fd9b0f57bb64479ca68c506866c1432b
record_format Article
spelling doaj-fd9b0f57bb64479ca68c506866c1432b2020-11-25T01:07:43ZengCTU Central LibraryActa Polytechnica1210-27091805-23632009-01-014921105Multi-Condition Training for Unknown Environment Adaptation in Robust ASR Under Real ConditionsJ. RajnohaAutomatic speech recognition (ASR) systems frequently work in a noisy environment. As they are often trained on clean speech data, noise reduction or adaptation techniques are applied to decrease the influence of background disturbance even in the case of unknown conditions. Speech data mixed with noise recordings from particular environment are often used for the purposes of model adaptation. This paper analyses the improvement of recognition performance within such adaptation when multi-condition training data from a real environment is used for training initial models. Although the quality of such models can decrease with the presence of noise in the training material, they are assumed to include initial information about noise and consequently support the adaptation procedure. Experimental results show significant improvement of the proposed training method in a robust ASR task under unknown noisy conditions. The decrease by 29 % and 14 % in word error rate in comparison with clean speech training data was achieved for the non-adapted and adapted system, respectively. https://ojs.cvut.cz/ojs/index.php/ap/article/view/1105speech recognitionenvironment adaptationspectral subtractionMLLRnoisy background
collection DOAJ
language English
format Article
sources DOAJ
author J. Rajnoha
spellingShingle J. Rajnoha
Multi-Condition Training for Unknown Environment Adaptation in Robust ASR Under Real Conditions
Acta Polytechnica
speech recognition
environment adaptation
spectral subtraction
MLLR
noisy background
author_facet J. Rajnoha
author_sort J. Rajnoha
title Multi-Condition Training for Unknown Environment Adaptation in Robust ASR Under Real Conditions
title_short Multi-Condition Training for Unknown Environment Adaptation in Robust ASR Under Real Conditions
title_full Multi-Condition Training for Unknown Environment Adaptation in Robust ASR Under Real Conditions
title_fullStr Multi-Condition Training for Unknown Environment Adaptation in Robust ASR Under Real Conditions
title_full_unstemmed Multi-Condition Training for Unknown Environment Adaptation in Robust ASR Under Real Conditions
title_sort multi-condition training for unknown environment adaptation in robust asr under real conditions
publisher CTU Central Library
series Acta Polytechnica
issn 1210-2709
1805-2363
publishDate 2009-01-01
description Automatic speech recognition (ASR) systems frequently work in a noisy environment. As they are often trained on clean speech data, noise reduction or adaptation techniques are applied to decrease the influence of background disturbance even in the case of unknown conditions. Speech data mixed with noise recordings from particular environment are often used for the purposes of model adaptation. This paper analyses the improvement of recognition performance within such adaptation when multi-condition training data from a real environment is used for training initial models. Although the quality of such models can decrease with the presence of noise in the training material, they are assumed to include initial information about noise and consequently support the adaptation procedure. Experimental results show significant improvement of the proposed training method in a robust ASR task under unknown noisy conditions. The decrease by 29 % and 14 % in word error rate in comparison with clean speech training data was achieved for the non-adapted and adapted system, respectively. 
topic speech recognition
environment adaptation
spectral subtraction
MLLR
noisy background
url https://ojs.cvut.cz/ojs/index.php/ap/article/view/1105
work_keys_str_mv AT jrajnoha multiconditiontrainingforunknownenvironmentadaptationinrobustasrunderrealconditions
_version_ 1725185693705043968