Integration of Acoustic and Linguistic Features for Maximum Entropy Speech Recognition

碩士 === 國立成功大學 === 資訊工程學系碩博士班 === 93 === In traditional speech recognition system, we assume that acoustic and linguistic information sources are independent. Parameters of acoustic hidden Markov model (HMM) and linguistic n-gram model are estimated individually and then combined together to build a...

Full description

Bibliographic Details
Main Authors: To-Chang Chien, 錢鐸樟
Other Authors: Jen-Tzung Chien
Format: Others
Language:zh-TW
Published: 2005
Online Access:http://ndltd.ncl.edu.tw/handle/24325293971312481529
id ndltd-TW-093NCKU5392025
record_format oai_dc
spelling ndltd-TW-093NCKU53920252017-08-27T04:29:43Z http://ndltd.ncl.edu.tw/handle/24325293971312481529 Integration of Acoustic and Linguistic Features for Maximum Entropy Speech Recognition 以最大熵準則結合語音及語言特徵於語音辨識之研究 To-Chang Chien 錢鐸樟 碩士 國立成功大學 資訊工程學系碩博士班 93 In traditional speech recognition system, we assume that acoustic and linguistic information sources are independent. Parameters of acoustic hidden Markov model (HMM) and linguistic n-gram model are estimated individually and then combined together to build a plug-in maximum a posteriori (MAP) classification rule. However, the acoustic model and language model are correlated in essence. We should relax the independence assumption so as to improve speech recognition performance. In this study, we propose an integrated approach based on maximum entropy (ME) principle where acoustic and linguistic features are optimally combined in an unified framework. Using this approach, the associations between acoustic and linguistic features are explored and merged in the integrated models. On the issue of discriminative training, we also establish the relationship between ME and discriminative maximum mutual information (MMI) models. In addition, this ME integrated model is general so that the semantic topics and long distance association patterns can be further combined. In the experiments, we carry out the proposed ME model for broadcast news transcription using MATBN database. In preliminary experimental results, we obtain improvement compared to conventional speech recognition system based on plug-in MAP classification rule. Jen-Tzung Chien 簡仁宗 2005 學位論文 ; thesis 92 zh-TW
collection NDLTD
language zh-TW
format Others
sources NDLTD
description 碩士 === 國立成功大學 === 資訊工程學系碩博士班 === 93 === In traditional speech recognition system, we assume that acoustic and linguistic information sources are independent. Parameters of acoustic hidden Markov model (HMM) and linguistic n-gram model are estimated individually and then combined together to build a plug-in maximum a posteriori (MAP) classification rule. However, the acoustic model and language model are correlated in essence. We should relax the independence assumption so as to improve speech recognition performance. In this study, we propose an integrated approach based on maximum entropy (ME) principle where acoustic and linguistic features are optimally combined in an unified framework. Using this approach, the associations between acoustic and linguistic features are explored and merged in the integrated models. On the issue of discriminative training, we also establish the relationship between ME and discriminative maximum mutual information (MMI) models. In addition, this ME integrated model is general so that the semantic topics and long distance association patterns can be further combined. In the experiments, we carry out the proposed ME model for broadcast news transcription using MATBN database. In preliminary experimental results, we obtain improvement compared to conventional speech recognition system based on plug-in MAP classification rule.
author2 Jen-Tzung Chien
author_facet Jen-Tzung Chien
To-Chang Chien
錢鐸樟
author To-Chang Chien
錢鐸樟
spellingShingle To-Chang Chien
錢鐸樟
Integration of Acoustic and Linguistic Features for Maximum Entropy Speech Recognition
author_sort To-Chang Chien
title Integration of Acoustic and Linguistic Features for Maximum Entropy Speech Recognition
title_short Integration of Acoustic and Linguistic Features for Maximum Entropy Speech Recognition
title_full Integration of Acoustic and Linguistic Features for Maximum Entropy Speech Recognition
title_fullStr Integration of Acoustic and Linguistic Features for Maximum Entropy Speech Recognition
title_full_unstemmed Integration of Acoustic and Linguistic Features for Maximum Entropy Speech Recognition
title_sort integration of acoustic and linguistic features for maximum entropy speech recognition
publishDate 2005
url http://ndltd.ncl.edu.tw/handle/24325293971312481529
work_keys_str_mv AT tochangchien integrationofacousticandlinguisticfeaturesformaximumentropyspeechrecognition
AT qiánduózhāng integrationofacousticandlinguisticfeaturesformaximumentropyspeechrecognition
AT tochangchien yǐzuìdàshāngzhǔnzéjiéhéyǔyīnjíyǔyántèzhēngyúyǔyīnbiànshízhīyánjiū
AT qiánduózhāng yǐzuìdàshāngzhǔnzéjiéhéyǔyīnjíyǔyántèzhēngyúyǔyīnbiànshízhīyánjiū
_version_ 1718518479214084096