MAP Based Speaker Adaptation in Very Large Vocabulary Speech Recognition of Czech
The paper deals with the problem of efficient adaptation of speechrecognition systems to individual users. The goal is to achieve betterperformance in specific applications where one known speaker isexpected. In our approach we adopt the MAP (Maximum A Posteriori)method for this purpose. The MAP bas...
Main Authors: | , |
---|---|
Format: | Article |
Language: | English |
Published: |
Spolecnost pro radioelektronicke inzenyrstvi
2004-09-01
|
Series: | Radioengineering |
Online Access: | http://www.radioeng.cz/fulltexts/2004/04_03_42_46.pdf |
id |
doaj-305dd47570324974b0db59ef024213be |
---|---|
record_format |
Article |
spelling |
doaj-305dd47570324974b0db59ef024213be2020-11-24T22:02:21ZengSpolecnost pro radioelektronicke inzenyrstviRadioengineering1210-25122004-09-011334246MAP Based Speaker Adaptation in Very Large Vocabulary Speech Recognition of CzechJ. NouzaP. CervaThe paper deals with the problem of efficient adaptation of speechrecognition systems to individual users. The goal is to achieve betterperformance in specific applications where one known speaker isexpected. In our approach we adopt the MAP (Maximum A Posteriori)method for this purpose. The MAP based formulae for the adaptation ofthe HMM (Hidden Markov Model) parameters are described. Severalalternative versions of this method have been implemented andexperimentally verified in two areas, first in the isolated-wordrecognition (IWR) task and later also in the large vocabularycontinuous speech recognition (LVCSR) system, both developed for theCzech language. The results show that the word error rate (WER) can bereduced by more than 20% for a speaker who provides tens of words (incase of IWR) or tens of sentences (in case of LVCSR) for theadaptation. Recently, we have used the described methods in the designof two practical applications: voice dictation to a PC and automatictranscription of radio and TV news.www.radioeng.cz/fulltexts/2004/04_03_42_46.pdf |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
J. Nouza P. Cerva |
spellingShingle |
J. Nouza P. Cerva MAP Based Speaker Adaptation in Very Large Vocabulary Speech Recognition of Czech Radioengineering |
author_facet |
J. Nouza P. Cerva |
author_sort |
J. Nouza |
title |
MAP Based Speaker Adaptation in Very Large Vocabulary Speech Recognition of Czech |
title_short |
MAP Based Speaker Adaptation in Very Large Vocabulary Speech Recognition of Czech |
title_full |
MAP Based Speaker Adaptation in Very Large Vocabulary Speech Recognition of Czech |
title_fullStr |
MAP Based Speaker Adaptation in Very Large Vocabulary Speech Recognition of Czech |
title_full_unstemmed |
MAP Based Speaker Adaptation in Very Large Vocabulary Speech Recognition of Czech |
title_sort |
map based speaker adaptation in very large vocabulary speech recognition of czech |
publisher |
Spolecnost pro radioelektronicke inzenyrstvi |
series |
Radioengineering |
issn |
1210-2512 |
publishDate |
2004-09-01 |
description |
The paper deals with the problem of efficient adaptation of speechrecognition systems to individual users. The goal is to achieve betterperformance in specific applications where one known speaker isexpected. In our approach we adopt the MAP (Maximum A Posteriori)method for this purpose. The MAP based formulae for the adaptation ofthe HMM (Hidden Markov Model) parameters are described. Severalalternative versions of this method have been implemented andexperimentally verified in two areas, first in the isolated-wordrecognition (IWR) task and later also in the large vocabularycontinuous speech recognition (LVCSR) system, both developed for theCzech language. The results show that the word error rate (WER) can bereduced by more than 20% for a speaker who provides tens of words (incase of IWR) or tens of sentences (in case of LVCSR) for theadaptation. Recently, we have used the described methods in the designof two practical applications: voice dictation to a PC and automatictranscription of radio and TV news. |
url |
http://www.radioeng.cz/fulltexts/2004/04_03_42_46.pdf |
work_keys_str_mv |
AT jnouza mapbasedspeakeradaptationinverylargevocabularyspeechrecognitionofczech AT pcerva mapbasedspeakeradaptationinverylargevocabularyspeechrecognitionofczech |
_version_ |
1725836242526928896 |