Integration of Acoustic and Linguistic Features for Maximum Entropy Speech Recognition
碩士 === 國立成功大學 === 資訊工程學系碩博士班 === 93 === In traditional speech recognition system, we assume that acoustic and linguistic information sources are independent. Parameters of acoustic hidden Markov model (HMM) and linguistic n-gram model are estimated individually and then combined together to build a...
Main Authors: | , |
---|---|
Other Authors: | |
Format: | Others |
Language: | zh-TW |
Published: |
2005
|
Online Access: | http://ndltd.ncl.edu.tw/handle/24325293971312481529 |
id |
ndltd-TW-093NCKU5392025 |
---|---|
record_format |
oai_dc |
spelling |
ndltd-TW-093NCKU53920252017-08-27T04:29:43Z http://ndltd.ncl.edu.tw/handle/24325293971312481529 Integration of Acoustic and Linguistic Features for Maximum Entropy Speech Recognition 以最大熵準則結合語音及語言特徵於語音辨識之研究 To-Chang Chien 錢鐸樟 碩士 國立成功大學 資訊工程學系碩博士班 93 In traditional speech recognition system, we assume that acoustic and linguistic information sources are independent. Parameters of acoustic hidden Markov model (HMM) and linguistic n-gram model are estimated individually and then combined together to build a plug-in maximum a posteriori (MAP) classification rule. However, the acoustic model and language model are correlated in essence. We should relax the independence assumption so as to improve speech recognition performance. In this study, we propose an integrated approach based on maximum entropy (ME) principle where acoustic and linguistic features are optimally combined in an unified framework. Using this approach, the associations between acoustic and linguistic features are explored and merged in the integrated models. On the issue of discriminative training, we also establish the relationship between ME and discriminative maximum mutual information (MMI) models. In addition, this ME integrated model is general so that the semantic topics and long distance association patterns can be further combined. In the experiments, we carry out the proposed ME model for broadcast news transcription using MATBN database. In preliminary experimental results, we obtain improvement compared to conventional speech recognition system based on plug-in MAP classification rule. Jen-Tzung Chien 簡仁宗 2005 學位論文 ; thesis 92 zh-TW |
collection |
NDLTD |
language |
zh-TW |
format |
Others
|
sources |
NDLTD |
description |
碩士 === 國立成功大學 === 資訊工程學系碩博士班 === 93 === In traditional speech recognition system, we assume that acoustic and linguistic information sources are independent. Parameters of acoustic hidden Markov model (HMM) and linguistic n-gram model are estimated individually and then combined together to build a plug-in maximum a posteriori (MAP) classification rule. However, the acoustic model and language model are correlated in essence. We should relax the independence assumption so as to improve speech recognition performance. In this study, we propose an integrated approach based on maximum entropy (ME) principle where acoustic and linguistic features are optimally combined in an unified framework. Using this approach, the associations between acoustic and linguistic features are explored and merged in the integrated models. On the issue of discriminative training, we also establish the relationship between ME and discriminative maximum mutual information (MMI) models. In addition, this ME integrated model is general so that the semantic topics and long distance association patterns can be further combined. In the experiments, we carry out the proposed ME model for broadcast news transcription using MATBN database. In preliminary experimental results, we obtain improvement compared to conventional speech recognition system based on plug-in MAP classification rule.
|
author2 |
Jen-Tzung Chien |
author_facet |
Jen-Tzung Chien To-Chang Chien 錢鐸樟 |
author |
To-Chang Chien 錢鐸樟 |
spellingShingle |
To-Chang Chien 錢鐸樟 Integration of Acoustic and Linguistic Features for Maximum Entropy Speech Recognition |
author_sort |
To-Chang Chien |
title |
Integration of Acoustic and Linguistic Features for Maximum Entropy Speech Recognition |
title_short |
Integration of Acoustic and Linguistic Features for Maximum Entropy Speech Recognition |
title_full |
Integration of Acoustic and Linguistic Features for Maximum Entropy Speech Recognition |
title_fullStr |
Integration of Acoustic and Linguistic Features for Maximum Entropy Speech Recognition |
title_full_unstemmed |
Integration of Acoustic and Linguistic Features for Maximum Entropy Speech Recognition |
title_sort |
integration of acoustic and linguistic features for maximum entropy speech recognition |
publishDate |
2005 |
url |
http://ndltd.ncl.edu.tw/handle/24325293971312481529 |
work_keys_str_mv |
AT tochangchien integrationofacousticandlinguisticfeaturesformaximumentropyspeechrecognition AT qiánduózhāng integrationofacousticandlinguisticfeaturesformaximumentropyspeechrecognition AT tochangchien yǐzuìdàshāngzhǔnzéjiéhéyǔyīnjíyǔyántèzhēngyúyǔyīnbiànshízhīyánjiū AT qiánduózhāng yǐzuìdàshāngzhǔnzéjiéhéyǔyīnjíyǔyántèzhēngyúyǔyīnbiànshízhīyánjiū |
_version_ |
1718518479214084096 |