Multi-Condition Training for Unknown Environment Adaptation in Robust ASR Under Real Conditions
Automatic speech recognition (ASR) systems frequently work in a noisy environment. As they are often trained on clean speech data, noise reduction or adaptation techniques are applied to decrease the influence of background disturbance even in the case of unknown conditions. Speech data mixed with n...
Main Author: | |
---|---|
Format: | Article |
Language: | English |
Published: |
CTU Central Library
2009-01-01
|
Series: | Acta Polytechnica |
Subjects: | |
Online Access: | https://ojs.cvut.cz/ojs/index.php/ap/article/view/1105 |
id |
doaj-fd9b0f57bb64479ca68c506866c1432b |
---|---|
record_format |
Article |
spelling |
doaj-fd9b0f57bb64479ca68c506866c1432b2020-11-25T01:07:43ZengCTU Central LibraryActa Polytechnica1210-27091805-23632009-01-014921105Multi-Condition Training for Unknown Environment Adaptation in Robust ASR Under Real ConditionsJ. RajnohaAutomatic speech recognition (ASR) systems frequently work in a noisy environment. As they are often trained on clean speech data, noise reduction or adaptation techniques are applied to decrease the influence of background disturbance even in the case of unknown conditions. Speech data mixed with noise recordings from particular environment are often used for the purposes of model adaptation. This paper analyses the improvement of recognition performance within such adaptation when multi-condition training data from a real environment is used for training initial models. Although the quality of such models can decrease with the presence of noise in the training material, they are assumed to include initial information about noise and consequently support the adaptation procedure. Experimental results show significant improvement of the proposed training method in a robust ASR task under unknown noisy conditions. The decrease by 29 % and 14 % in word error rate in comparison with clean speech training data was achieved for the non-adapted and adapted system, respectively. https://ojs.cvut.cz/ojs/index.php/ap/article/view/1105speech recognitionenvironment adaptationspectral subtractionMLLRnoisy background |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
J. Rajnoha |
spellingShingle |
J. Rajnoha Multi-Condition Training for Unknown Environment Adaptation in Robust ASR Under Real Conditions Acta Polytechnica speech recognition environment adaptation spectral subtraction MLLR noisy background |
author_facet |
J. Rajnoha |
author_sort |
J. Rajnoha |
title |
Multi-Condition Training for Unknown Environment Adaptation in Robust ASR Under Real Conditions |
title_short |
Multi-Condition Training for Unknown Environment Adaptation in Robust ASR Under Real Conditions |
title_full |
Multi-Condition Training for Unknown Environment Adaptation in Robust ASR Under Real Conditions |
title_fullStr |
Multi-Condition Training for Unknown Environment Adaptation in Robust ASR Under Real Conditions |
title_full_unstemmed |
Multi-Condition Training for Unknown Environment Adaptation in Robust ASR Under Real Conditions |
title_sort |
multi-condition training for unknown environment adaptation in robust asr under real conditions |
publisher |
CTU Central Library |
series |
Acta Polytechnica |
issn |
1210-2709 1805-2363 |
publishDate |
2009-01-01 |
description |
Automatic speech recognition (ASR) systems frequently work in a noisy environment. As they are often trained on clean speech data, noise reduction or adaptation techniques are applied to decrease the influence of background disturbance even in the case of unknown conditions. Speech data mixed with noise recordings from particular environment are often used for the purposes of model adaptation. This paper analyses the improvement of recognition performance within such adaptation when multi-condition training data from a real environment is used for training initial models. Although the quality of such models can decrease with the presence of noise in the training material, they are assumed to include initial information about noise and consequently support the adaptation procedure. Experimental results show significant improvement of the proposed training method in a robust ASR task under unknown noisy conditions. The decrease by 29 % and 14 % in word error rate in comparison with clean speech training data was achieved for the non-adapted and adapted system, respectively. |
topic |
speech recognition environment adaptation spectral subtraction MLLR noisy background |
url |
https://ojs.cvut.cz/ojs/index.php/ap/article/view/1105 |
work_keys_str_mv |
AT jrajnoha multiconditiontrainingforunknownenvironmentadaptationinrobustasrunderrealconditions |
_version_ |
1725185693705043968 |