MAP Based Speaker Adaptation in Very Large Vocabulary Speech Recognition of Czech

The paper deals with the problem of efficient adaptation of speechrecognition systems to individual users. The goal is to achieve betterperformance in specific applications where one known speaker isexpected. In our approach we adopt the MAP (Maximum A Posteriori)method for this purpose. The MAP bas...

Full description

Bibliographic Details
Main Authors: J. Nouza, P. Cerva
Format: Article
Language:English
Published: Spolecnost pro radioelektronicke inzenyrstvi 2004-09-01
Series:Radioengineering
Online Access:http://www.radioeng.cz/fulltexts/2004/04_03_42_46.pdf
id doaj-305dd47570324974b0db59ef024213be
record_format Article
spelling doaj-305dd47570324974b0db59ef024213be2020-11-24T22:02:21ZengSpolecnost pro radioelektronicke inzenyrstviRadioengineering1210-25122004-09-011334246MAP Based Speaker Adaptation in Very Large Vocabulary Speech Recognition of CzechJ. NouzaP. CervaThe paper deals with the problem of efficient adaptation of speechrecognition systems to individual users. The goal is to achieve betterperformance in specific applications where one known speaker isexpected. In our approach we adopt the MAP (Maximum A Posteriori)method for this purpose. The MAP based formulae for the adaptation ofthe HMM (Hidden Markov Model) parameters are described. Severalalternative versions of this method have been implemented andexperimentally verified in two areas, first in the isolated-wordrecognition (IWR) task and later also in the large vocabularycontinuous speech recognition (LVCSR) system, both developed for theCzech language. The results show that the word error rate (WER) can bereduced by more than 20% for a speaker who provides tens of words (incase of IWR) or tens of sentences (in case of LVCSR) for theadaptation. Recently, we have used the described methods in the designof two practical applications: voice dictation to a PC and automatictranscription of radio and TV news.www.radioeng.cz/fulltexts/2004/04_03_42_46.pdf
collection DOAJ
language English
format Article
sources DOAJ
author J. Nouza
P. Cerva
spellingShingle J. Nouza
P. Cerva
MAP Based Speaker Adaptation in Very Large Vocabulary Speech Recognition of Czech
Radioengineering
author_facet J. Nouza
P. Cerva
author_sort J. Nouza
title MAP Based Speaker Adaptation in Very Large Vocabulary Speech Recognition of Czech
title_short MAP Based Speaker Adaptation in Very Large Vocabulary Speech Recognition of Czech
title_full MAP Based Speaker Adaptation in Very Large Vocabulary Speech Recognition of Czech
title_fullStr MAP Based Speaker Adaptation in Very Large Vocabulary Speech Recognition of Czech
title_full_unstemmed MAP Based Speaker Adaptation in Very Large Vocabulary Speech Recognition of Czech
title_sort map based speaker adaptation in very large vocabulary speech recognition of czech
publisher Spolecnost pro radioelektronicke inzenyrstvi
series Radioengineering
issn 1210-2512
publishDate 2004-09-01
description The paper deals with the problem of efficient adaptation of speechrecognition systems to individual users. The goal is to achieve betterperformance in specific applications where one known speaker isexpected. In our approach we adopt the MAP (Maximum A Posteriori)method for this purpose. The MAP based formulae for the adaptation ofthe HMM (Hidden Markov Model) parameters are described. Severalalternative versions of this method have been implemented andexperimentally verified in two areas, first in the isolated-wordrecognition (IWR) task and later also in the large vocabularycontinuous speech recognition (LVCSR) system, both developed for theCzech language. The results show that the word error rate (WER) can bereduced by more than 20% for a speaker who provides tens of words (incase of IWR) or tens of sentences (in case of LVCSR) for theadaptation. Recently, we have used the described methods in the designof two practical applications: voice dictation to a PC and automatictranscription of radio and TV news.
url http://www.radioeng.cz/fulltexts/2004/04_03_42_46.pdf
work_keys_str_mv AT jnouza mapbasedspeakeradaptationinverylargevocabularyspeechrecognitionofczech
AT pcerva mapbasedspeakeradaptationinverylargevocabularyspeechrecognitionofczech
_version_ 1725836242526928896