Intelligent Language Modeling and Processing in Mandarin Speech Recognition
碩士 === 國立臺灣大學 === 資訊工程研究所 === 81 === The purpose of this thesis is to develop an intelligent language modeling and processing system in Mandarin speech recognition, so that to get high recognition rate, high speed of processing time and exp...
Main Authors: | , |
---|---|
Other Authors: | |
Format: | Others |
Language: | zh-TW |
Published: |
1993
|
Online Access: | http://ndltd.ncl.edu.tw/handle/67737403331273368411 |
id |
ndltd-TW-081NTU00392017 |
---|---|
record_format |
oai_dc |
spelling |
ndltd-TW-081NTU003920172016-02-10T04:09:01Z http://ndltd.ncl.edu.tw/handle/67737403331273368411 Intelligent Language Modeling and Processing in Mandarin Speech Recognition 國語語音辨認中智慧型語言模型與處理之研究 Yen-Ju Yang 楊燕珠 碩士 國立臺灣大學 資訊工程研究所 81 The purpose of this thesis is to develop an intelligent language modeling and processing system in Mandarin speech recognition, so that to get high recognition rate, high speed of processing time and experience from tested sentence. In this thesis, the recognition method is based on bigram Markov model under various smoothing techniques and various methods to incorporate unigram Markov model. In order to improve recognition rate, speed and the reliability, the special feature of this thesis is to propose some improve methods : syllable filter, learning, dynamic training and cache. They can make the system more and more intelligent. A language modeling and processing system has been developed on SUN workstation. It has tested some news on newspaper, articles in magazine and short story. It has following advantages : . high correct rate The character recognition system has average recognition rate 87.83%. When the whole text has filled into cache, we test the text second time, a very high recognition rate 98.87% can be obtained under the cache effect. Then we test the text third time someday later, the average recognition rate is 92.12% under learning effect. The word recognition system with very large vocabulary dictionary has average recognition rate 91.31%, 99.71% and 94.27% (calculated by character unit). . high speed The character recognition system needs 0.002 second on average to recognize a character. While the word recognition system needs 0.03 second on averge to recognize a word. . flexible data structure The system can switch between different models easily..module design The system can use different algorithm in every module easily. Lin-Shan Lee;Keh-Jiann Chen 李琳山;陳克健 1993 學位論文 ; thesis 114 zh-TW |
collection |
NDLTD |
language |
zh-TW |
format |
Others
|
sources |
NDLTD |
description |
碩士 === 國立臺灣大學 === 資訊工程研究所 === 81 === The purpose of this thesis is to develop an intelligent
language modeling and processing system in Mandarin speech
recognition, so that to get high recognition rate, high speed
of processing time and experience from tested sentence. In this
thesis, the recognition method is based on bigram Markov model
under various smoothing techniques and various methods to
incorporate unigram Markov model. In order to improve
recognition rate, speed and the reliability, the special
feature of this thesis is to propose some improve methods :
syllable filter, learning, dynamic training and cache. They
can make the system more and more intelligent. A language
modeling and processing system has been developed on SUN
workstation. It has tested some news on newspaper, articles in
magazine and short story. It has following advantages : . high
correct rate The character recognition system has average
recognition rate 87.83%. When the whole text has filled into
cache, we test the text second time, a very high recognition
rate 98.87% can be obtained under the cache effect. Then we
test the text third time someday later, the average recognition
rate is 92.12% under learning effect. The word recognition
system with very large vocabulary dictionary has average
recognition rate 91.31%, 99.71% and 94.27% (calculated by
character unit). . high speed The character recognition system
needs 0.002 second on average to recognize a character. While
the word recognition system needs 0.03 second on averge to
recognize a word. . flexible data structure The system can
switch between different models easily..module design The
system can use different algorithm in every module easily.
|
author2 |
Lin-Shan Lee;Keh-Jiann Chen |
author_facet |
Lin-Shan Lee;Keh-Jiann Chen Yen-Ju Yang 楊燕珠 |
author |
Yen-Ju Yang 楊燕珠 |
spellingShingle |
Yen-Ju Yang 楊燕珠 Intelligent Language Modeling and Processing in Mandarin Speech Recognition |
author_sort |
Yen-Ju Yang |
title |
Intelligent Language Modeling and Processing in Mandarin Speech Recognition |
title_short |
Intelligent Language Modeling and Processing in Mandarin Speech Recognition |
title_full |
Intelligent Language Modeling and Processing in Mandarin Speech Recognition |
title_fullStr |
Intelligent Language Modeling and Processing in Mandarin Speech Recognition |
title_full_unstemmed |
Intelligent Language Modeling and Processing in Mandarin Speech Recognition |
title_sort |
intelligent language modeling and processing in mandarin speech recognition |
publishDate |
1993 |
url |
http://ndltd.ncl.edu.tw/handle/67737403331273368411 |
work_keys_str_mv |
AT yenjuyang intelligentlanguagemodelingandprocessinginmandarinspeechrecognition AT yángyànzhū intelligentlanguagemodelingandprocessinginmandarinspeechrecognition AT yenjuyang guóyǔyǔyīnbiànrènzhōngzhìhuìxíngyǔyánmóxíngyǔchùlǐzhīyánjiū AT yángyànzhū guóyǔyǔyīnbiànrènzhōngzhìhuìxíngyǔyánmóxíngyǔchùlǐzhīyánjiū |
_version_ |
1718185269807546368 |