Automatic Query Generation and Query Relevance Measurement for Unsupervised Language Model Adaptation of Speech Recognition

<p>Abstract</p> <p>We are developing a method of Web-based unsupervised language model adaptation for recognition of spoken documents. The proposed method chooses keywords from the preliminary recognition result and retrieves Web documents using the chosen keywords. A problem is th...

Full description

Bibliographic Details
Main Authors: Suzuki Motoyuki, Ito Akinori, Kajiura Yasutomo, Makino Shozo
Format: Article
Language:English
Published: SpringerOpen 2009-01-01
Series:EURASIP Journal on Audio, Speech, and Music Processing
Online Access:http://asmp.eurasipjournals.com/content/2009/140575
id doaj-aa770c3fe1f6451791074b8f3bc40444
record_format Article
spelling doaj-aa770c3fe1f6451791074b8f3bc404442020-11-25T00:28:37ZengSpringerOpenEURASIP Journal on Audio, Speech, and Music Processing1687-47141687-47222009-01-0120091140575Automatic Query Generation and Query Relevance Measurement for Unsupervised Language Model Adaptation of Speech RecognitionSuzuki MotoyukiIto AkinoriKajiura YasutomoMakino Shozo<p>Abstract</p> <p>We are developing a method of Web-based unsupervised language model adaptation for recognition of spoken documents. The proposed method chooses keywords from the preliminary recognition result and retrieves Web documents using the chosen keywords. A problem is that the selected keywords tend to contain misrecognized words. The proposed method introduces two new ideas for avoiding the effects of keywords derived from misrecognized words. The first idea is to compose multiple queries from selected keyword candidates so that the misrecognized words and correct words do not fall into one query. The second idea is that the number of Web documents downloaded for each query is determined according to the "query relevance." Combining these two ideas, we can alleviate bad effect of misrecognized keywords by decreasing the number of downloaded Web documents from queries that contain misrecognized keywords. Finally, we examine a method of determining the number of iterative adaptations based on the recognition likelihood. Experiments have shown that the proposed stopping criterion can determine almost the optimum number of iterations. In the final experiment, the word accuracy without adaptation (55.29%) was improved to 60.38%, which was 1.13 point better than the result of the conventional unsupervised adaptation method (59.25%).</p>http://asmp.eurasipjournals.com/content/2009/140575
collection DOAJ
language English
format Article
sources DOAJ
author Suzuki Motoyuki
Ito Akinori
Kajiura Yasutomo
Makino Shozo
spellingShingle Suzuki Motoyuki
Ito Akinori
Kajiura Yasutomo
Makino Shozo
Automatic Query Generation and Query Relevance Measurement for Unsupervised Language Model Adaptation of Speech Recognition
EURASIP Journal on Audio, Speech, and Music Processing
author_facet Suzuki Motoyuki
Ito Akinori
Kajiura Yasutomo
Makino Shozo
author_sort Suzuki Motoyuki
title Automatic Query Generation and Query Relevance Measurement for Unsupervised Language Model Adaptation of Speech Recognition
title_short Automatic Query Generation and Query Relevance Measurement for Unsupervised Language Model Adaptation of Speech Recognition
title_full Automatic Query Generation and Query Relevance Measurement for Unsupervised Language Model Adaptation of Speech Recognition
title_fullStr Automatic Query Generation and Query Relevance Measurement for Unsupervised Language Model Adaptation of Speech Recognition
title_full_unstemmed Automatic Query Generation and Query Relevance Measurement for Unsupervised Language Model Adaptation of Speech Recognition
title_sort automatic query generation and query relevance measurement for unsupervised language model adaptation of speech recognition
publisher SpringerOpen
series EURASIP Journal on Audio, Speech, and Music Processing
issn 1687-4714
1687-4722
publishDate 2009-01-01
description <p>Abstract</p> <p>We are developing a method of Web-based unsupervised language model adaptation for recognition of spoken documents. The proposed method chooses keywords from the preliminary recognition result and retrieves Web documents using the chosen keywords. A problem is that the selected keywords tend to contain misrecognized words. The proposed method introduces two new ideas for avoiding the effects of keywords derived from misrecognized words. The first idea is to compose multiple queries from selected keyword candidates so that the misrecognized words and correct words do not fall into one query. The second idea is that the number of Web documents downloaded for each query is determined according to the "query relevance." Combining these two ideas, we can alleviate bad effect of misrecognized keywords by decreasing the number of downloaded Web documents from queries that contain misrecognized keywords. Finally, we examine a method of determining the number of iterative adaptations based on the recognition likelihood. Experiments have shown that the proposed stopping criterion can determine almost the optimum number of iterations. In the final experiment, the word accuracy without adaptation (55.29%) was improved to 60.38%, which was 1.13 point better than the result of the conventional unsupervised adaptation method (59.25%).</p>
url http://asmp.eurasipjournals.com/content/2009/140575
work_keys_str_mv AT suzukimotoyuki automaticquerygenerationandqueryrelevancemeasurementforunsupervisedlanguagemodeladaptationofspeechrecognition
AT itoakinori automaticquerygenerationandqueryrelevancemeasurementforunsupervisedlanguagemodeladaptationofspeechrecognition
AT kajiurayasutomo automaticquerygenerationandqueryrelevancemeasurementforunsupervisedlanguagemodeladaptationofspeechrecognition
AT makinoshozo automaticquerygenerationandqueryrelevancemeasurementforunsupervisedlanguagemodeladaptationofspeechrecognition
_version_ 1725335303331250176