An improvement of initial-final based Mandarin continuous speech recognition
碩士 === 國立交通大學 === 電信研究所 === 83 === In this thesis, several techniques to improve the initial-final based HMM method for continuous Mandarin speech recognition are proposed. The baseline system uses 100 right-context-dependent initial HMM models and 39 con...
Main Authors: | , |
---|---|
Other Authors: | |
Format: | Others |
Language: | zh-TW |
Published: |
1995
|
Online Access: | http://ndltd.ncl.edu.tw/handle/61597883163258818852 |
id |
ndltd-TW-083NCTU0436031 |
---|---|
record_format |
oai_dc |
spelling |
ndltd-TW-083NCTU04360312015-10-13T12:53:40Z http://ndltd.ncl.edu.tw/handle/61597883163258818852 An improvement of initial-final based Mandarin continuous speech recognition 以聲韻母為基礎之國語連續音辨認之改進 S. M. Chiang 蔣松茂 碩士 國立交通大學 電信研究所 83 In this thesis, several techniques to improve the initial-final based HMM method for continuous Mandarin speech recognition are proposed. The baseline system uses 100 right-context-dependent initial HMM models and 39 context-independent final HMM models. First, the technique of bounded state duration is employed to model the temporal structure of speech signals and incorporated into the recognition process. The technique of syllable penalty is then used to relieve the suffering of high insertion errors. We then employ the technique of signal normalization to improve the system. The performance of the recognizer is then further improved by using gender-dependent HMM models. Effectiveness of the above proposals was confirmed by simulations on a speaker- independent speech recognition task to recognize continuous Mandarin speech through telephone channel. Syllable recognition rate was raised from 30.86% to 42.14%. Finally, an RNN-based finite state machine is proposed to pre-segment the input signal into 4 states including initial, final, silence, and transient states. State-dependent Constraints are then set to restrict the search of optimal path for relieving the computation load of the one-stage recognition process. Experimental results showed that about half of the computations can be saved with a very minor loss on the recognition rate. S. H. Chen 陳信宏 1995 學位論文 ; thesis 73 zh-TW |
collection |
NDLTD |
language |
zh-TW |
format |
Others
|
sources |
NDLTD |
description |
碩士 === 國立交通大學 === 電信研究所 === 83 === In this thesis, several techniques to improve the initial-final
based HMM method for continuous Mandarin speech recognition are
proposed. The baseline system uses 100 right-context-dependent
initial HMM models and 39 context-independent final HMM models.
First, the technique of bounded state duration is employed to
model the temporal structure of speech signals and incorporated
into the recognition process. The technique of syllable penalty
is then used to relieve the suffering of high insertion errors.
We then employ the technique of signal normalization to improve
the system. The performance of the recognizer is then further
improved by using gender-dependent HMM models. Effectiveness of
the above proposals was confirmed by simulations on a speaker-
independent speech recognition task to recognize continuous
Mandarin speech through telephone channel. Syllable recognition
rate was raised from 30.86% to 42.14%. Finally, an RNN-based
finite state machine is proposed to pre-segment the input
signal into 4 states including initial, final, silence, and
transient states. State-dependent Constraints are then set to
restrict the search of optimal path for relieving the
computation load of the one-stage recognition process.
Experimental results showed that about half of the computations
can be saved with a very minor loss on the recognition rate.
|
author2 |
S. H. Chen |
author_facet |
S. H. Chen S. M. Chiang 蔣松茂 |
author |
S. M. Chiang 蔣松茂 |
spellingShingle |
S. M. Chiang 蔣松茂 An improvement of initial-final based Mandarin continuous speech recognition |
author_sort |
S. M. Chiang |
title |
An improvement of initial-final based Mandarin continuous speech recognition |
title_short |
An improvement of initial-final based Mandarin continuous speech recognition |
title_full |
An improvement of initial-final based Mandarin continuous speech recognition |
title_fullStr |
An improvement of initial-final based Mandarin continuous speech recognition |
title_full_unstemmed |
An improvement of initial-final based Mandarin continuous speech recognition |
title_sort |
improvement of initial-final based mandarin continuous speech recognition |
publishDate |
1995 |
url |
http://ndltd.ncl.edu.tw/handle/61597883163258818852 |
work_keys_str_mv |
AT smchiang animprovementofinitialfinalbasedmandarincontinuousspeechrecognition AT jiǎngsōngmào animprovementofinitialfinalbasedmandarincontinuousspeechrecognition AT smchiang yǐshēngyùnmǔwèijīchǔzhīguóyǔliánxùyīnbiànrènzhīgǎijìn AT jiǎngsōngmào yǐshēngyùnmǔwèijīchǔzhīguóyǔliánxùyīnbiànrènzhīgǎijìn AT smchiang improvementofinitialfinalbasedmandarincontinuousspeechrecognition AT jiǎngsōngmào improvementofinitialfinalbasedmandarincontinuousspeechrecognition |
_version_ |
1716868807340851200 |