An improvement of initial-final based Mandarin continuous speech recognition

碩士 === 國立交通大學 === 電信研究所 === 83 === In this thesis, several techniques to improve the initial-final based HMM method for continuous Mandarin speech recognition are proposed. The baseline system uses 100 right-context-dependent initial HMM models and 39 con...

Full description

Bibliographic Details
Main Authors: S. M. Chiang, 蔣松茂
Other Authors: S. H. Chen
Format: Others
Language:zh-TW
Published: 1995
Online Access:http://ndltd.ncl.edu.tw/handle/61597883163258818852
id ndltd-TW-083NCTU0436031
record_format oai_dc
spelling ndltd-TW-083NCTU04360312015-10-13T12:53:40Z http://ndltd.ncl.edu.tw/handle/61597883163258818852 An improvement of initial-final based Mandarin continuous speech recognition 以聲韻母為基礎之國語連續音辨認之改進 S. M. Chiang 蔣松茂 碩士 國立交通大學 電信研究所 83 In this thesis, several techniques to improve the initial-final based HMM method for continuous Mandarin speech recognition are proposed. The baseline system uses 100 right-context-dependent initial HMM models and 39 context-independent final HMM models. First, the technique of bounded state duration is employed to model the temporal structure of speech signals and incorporated into the recognition process. The technique of syllable penalty is then used to relieve the suffering of high insertion errors. We then employ the technique of signal normalization to improve the system. The performance of the recognizer is then further improved by using gender-dependent HMM models. Effectiveness of the above proposals was confirmed by simulations on a speaker- independent speech recognition task to recognize continuous Mandarin speech through telephone channel. Syllable recognition rate was raised from 30.86% to 42.14%. Finally, an RNN-based finite state machine is proposed to pre-segment the input signal into 4 states including initial, final, silence, and transient states. State-dependent Constraints are then set to restrict the search of optimal path for relieving the computation load of the one-stage recognition process. Experimental results showed that about half of the computations can be saved with a very minor loss on the recognition rate. S. H. Chen 陳信宏 1995 學位論文 ; thesis 73 zh-TW
collection NDLTD
language zh-TW
format Others
sources NDLTD
description 碩士 === 國立交通大學 === 電信研究所 === 83 === In this thesis, several techniques to improve the initial-final based HMM method for continuous Mandarin speech recognition are proposed. The baseline system uses 100 right-context-dependent initial HMM models and 39 context-independent final HMM models. First, the technique of bounded state duration is employed to model the temporal structure of speech signals and incorporated into the recognition process. The technique of syllable penalty is then used to relieve the suffering of high insertion errors. We then employ the technique of signal normalization to improve the system. The performance of the recognizer is then further improved by using gender-dependent HMM models. Effectiveness of the above proposals was confirmed by simulations on a speaker- independent speech recognition task to recognize continuous Mandarin speech through telephone channel. Syllable recognition rate was raised from 30.86% to 42.14%. Finally, an RNN-based finite state machine is proposed to pre-segment the input signal into 4 states including initial, final, silence, and transient states. State-dependent Constraints are then set to restrict the search of optimal path for relieving the computation load of the one-stage recognition process. Experimental results showed that about half of the computations can be saved with a very minor loss on the recognition rate.
author2 S. H. Chen
author_facet S. H. Chen
S. M. Chiang
蔣松茂
author S. M. Chiang
蔣松茂
spellingShingle S. M. Chiang
蔣松茂
An improvement of initial-final based Mandarin continuous speech recognition
author_sort S. M. Chiang
title An improvement of initial-final based Mandarin continuous speech recognition
title_short An improvement of initial-final based Mandarin continuous speech recognition
title_full An improvement of initial-final based Mandarin continuous speech recognition
title_fullStr An improvement of initial-final based Mandarin continuous speech recognition
title_full_unstemmed An improvement of initial-final based Mandarin continuous speech recognition
title_sort improvement of initial-final based mandarin continuous speech recognition
publishDate 1995
url http://ndltd.ncl.edu.tw/handle/61597883163258818852
work_keys_str_mv AT smchiang animprovementofinitialfinalbasedmandarincontinuousspeechrecognition
AT jiǎngsōngmào animprovementofinitialfinalbasedmandarincontinuousspeechrecognition
AT smchiang yǐshēngyùnmǔwèijīchǔzhīguóyǔliánxùyīnbiànrènzhīgǎijìn
AT jiǎngsōngmào yǐshēngyùnmǔwèijīchǔzhīguóyǔliánxùyīnbiànrènzhīgǎijìn
AT smchiang improvementofinitialfinalbasedmandarincontinuousspeechrecognition
AT jiǎngsōngmào improvementofinitialfinalbasedmandarincontinuousspeechrecognition
_version_ 1716868807340851200