Integration of Acoustic and Linguistic Features for Maximum Entropy Speech Recognition

碩士 === 國立成功大學 === 資訊工程學系碩博士班 === 93 === In traditional speech recognition system, we assume that acoustic and linguistic information sources are independent. Parameters of acoustic hidden Markov model (HMM) and linguistic n-gram model are estimated individually and then combined together to build a...

Full description

Bibliographic Details
Main Authors:	To-Chang Chien, 錢鐸樟
Other Authors:	Jen-Tzung Chien
Format:	Others
Language:	zh-TW
Published:	2005
Online Access:	http://ndltd.ncl.edu.tw/handle/24325293971312481529

id	ndltd-TW-093NCKU5392025
record_format	oai_dc
spelling	ndltd-TW-093NCKU53920252017-08-27T04:29:43Z http://ndltd.ncl.edu.tw/handle/24325293971312481529 Integration of Acoustic and Linguistic Features for Maximum Entropy Speech Recognition 以最大熵準則結合語音及語言特徵於語音辨識之研究 To-Chang Chien 錢鐸樟碩士國立成功大學資訊工程學系碩博士班 93 In traditional speech recognition system, we assume that acoustic and linguistic information sources are independent. Parameters of acoustic hidden Markov model (HMM) and linguistic n-gram model are estimated individually and then combined together to build a plug-in maximum a posteriori (MAP) classification rule. However, the acoustic model and language model are correlated in essence. We should relax the independence assumption so as to improve speech recognition performance. In this study, we propose an integrated approach based on maximum entropy (ME) principle where acoustic and linguistic features are optimally combined in an unified framework. Using this approach, the associations between acoustic and linguistic features are explored and merged in the integrated models. On the issue of discriminative training, we also establish the relationship between ME and discriminative maximum mutual information (MMI) models. In addition, this ME integrated model is general so that the semantic topics and long distance association patterns can be further combined. In the experiments, we carry out the proposed ME model for broadcast news transcription using MATBN database. In preliminary experimental results, we obtain improvement compared to conventional speech recognition system based on plug-in MAP classification rule. Jen-Tzung Chien 簡仁宗 2005 學位論文 ; thesis 92 zh-TW
collection	NDLTD
language	zh-TW
format	Others
sources	NDLTD
description	碩士 === 國立成功大學 === 資訊工程學系碩博士班 === 93 === In traditional speech recognition system, we assume that acoustic and linguistic information sources are independent. Parameters of acoustic hidden Markov model (HMM) and linguistic n-gram model are estimated individually and then combined together to build a plug-in maximum a posteriori (MAP) classification rule. However, the acoustic model and language model are correlated in essence. We should relax the independence assumption so as to improve speech recognition performance. In this study, we propose an integrated approach based on maximum entropy (ME) principle where acoustic and linguistic features are optimally combined in an unified framework. Using this approach, the associations between acoustic and linguistic features are explored and merged in the integrated models. On the issue of discriminative training, we also establish the relationship between ME and discriminative maximum mutual information (MMI) models. In addition, this ME integrated model is general so that the semantic topics and long distance association patterns can be further combined. In the experiments, we carry out the proposed ME model for broadcast news transcription using MATBN database. In preliminary experimental results, we obtain improvement compared to conventional speech recognition system based on plug-in MAP classification rule.
author2	Jen-Tzung Chien
author_facet	Jen-Tzung Chien To-Chang Chien 錢鐸樟
author	To-Chang Chien 錢鐸樟
spellingShingle	To-Chang Chien 錢鐸樟 Integration of Acoustic and Linguistic Features for Maximum Entropy Speech Recognition
author_sort	To-Chang Chien
title	Integration of Acoustic and Linguistic Features for Maximum Entropy Speech Recognition
title_short	Integration of Acoustic and Linguistic Features for Maximum Entropy Speech Recognition
title_full	Integration of Acoustic and Linguistic Features for Maximum Entropy Speech Recognition
title_fullStr	Integration of Acoustic and Linguistic Features for Maximum Entropy Speech Recognition
title_full_unstemmed	Integration of Acoustic and Linguistic Features for Maximum Entropy Speech Recognition
title_sort	integration of acoustic and linguistic features for maximum entropy speech recognition
publishDate	2005
url	http://ndltd.ncl.edu.tw/handle/24325293971312481529
work_keys_str_mv	AT tochangchien integrationofacousticandlinguisticfeaturesformaximumentropyspeechrecognition AT qiánduózhāng integrationofacousticandlinguisticfeaturesformaximumentropyspeechrecognition AT tochangchien yǐzuìdàshāngzhǔnzéjiéhéyǔyīnjíyǔyántèzhēngyúyǔyīnbiànshízhīyánjiū AT qiánduózhāng yǐzuìdàshāngzhǔnzéjiéhéyǔyīnjíyǔyántèzhēngyúyǔyīnbiànshízhīyánjiū
_version_	1718518479214084096

Integration of Acoustic and Linguistic Features for Maximum Entropy Speech Recognition

Similar Items