Large Vocabulary Continuous Mandarin Speech Recognition Using Weighted Finite-State Transducer

碩士 === 國立交通大學 === 電信工程研究所 === 100 === This thesis presents an ASR system based on Weighted Finite-State Transducer(WFST). In the first we will introduce some algorithms that used to construct WFST graph, and how we express different models using WFST format. Then, we described a hierarchical languag...

Full description

Bibliographic Details
Main Authors: Hsu, Yu-Chao, 許昱超
Other Authors: Chen, Sin-Horng
Format: Others
Language:zh-TW
Published: 2011
Online Access:http://ndltd.ncl.edu.tw/handle/55880410912229987343
id ndltd-TW-100NCTU5435003
record_format oai_dc
spelling ndltd-TW-100NCTU54350032015-10-13T20:37:27Z http://ndltd.ncl.edu.tw/handle/55880410912229987343 Large Vocabulary Continuous Mandarin Speech Recognition Using Weighted Finite-State Transducer 以加權有限狀態轉換器實現中文連續語音辨認 Hsu, Yu-Chao 許昱超 碩士 國立交通大學 電信工程研究所 100 This thesis presents an ASR system based on Weighted Finite-State Transducer(WFST). In the first we will introduce some algorithms that used to construct WFST graph, and how we express different models using WFST format. Then, we described a hierarchical language model training by NER labels to deal with the problem of chinese person name, which often detected as OOV words. We incorporating the hierachical langugage model into one-stage and two-stage ASR system. In the one-stage ASR system, an on-the-fly replace algoritm was implemented to reduce the memory’s allocation, so we can use a complex hierachical model to calculate the probability of chinese person name. We evaluate our approach on the TCC-300 corpus, which consists of long paragraphic utterances, obtained a 0.38% absolute improvement in word error rate in one-stage ASR system; and at most 2.91% when using two-stage ASR system. Chen, Sin-Horng 陳信宏 2011 學位論文 ; thesis 50 zh-TW
collection NDLTD
language zh-TW
format Others
sources NDLTD
description 碩士 === 國立交通大學 === 電信工程研究所 === 100 === This thesis presents an ASR system based on Weighted Finite-State Transducer(WFST). In the first we will introduce some algorithms that used to construct WFST graph, and how we express different models using WFST format. Then, we described a hierarchical language model training by NER labels to deal with the problem of chinese person name, which often detected as OOV words. We incorporating the hierachical langugage model into one-stage and two-stage ASR system. In the one-stage ASR system, an on-the-fly replace algoritm was implemented to reduce the memory’s allocation, so we can use a complex hierachical model to calculate the probability of chinese person name. We evaluate our approach on the TCC-300 corpus, which consists of long paragraphic utterances, obtained a 0.38% absolute improvement in word error rate in one-stage ASR system; and at most 2.91% when using two-stage ASR system.
author2 Chen, Sin-Horng
author_facet Chen, Sin-Horng
Hsu, Yu-Chao
許昱超
author Hsu, Yu-Chao
許昱超
spellingShingle Hsu, Yu-Chao
許昱超
Large Vocabulary Continuous Mandarin Speech Recognition Using Weighted Finite-State Transducer
author_sort Hsu, Yu-Chao
title Large Vocabulary Continuous Mandarin Speech Recognition Using Weighted Finite-State Transducer
title_short Large Vocabulary Continuous Mandarin Speech Recognition Using Weighted Finite-State Transducer
title_full Large Vocabulary Continuous Mandarin Speech Recognition Using Weighted Finite-State Transducer
title_fullStr Large Vocabulary Continuous Mandarin Speech Recognition Using Weighted Finite-State Transducer
title_full_unstemmed Large Vocabulary Continuous Mandarin Speech Recognition Using Weighted Finite-State Transducer
title_sort large vocabulary continuous mandarin speech recognition using weighted finite-state transducer
publishDate 2011
url http://ndltd.ncl.edu.tw/handle/55880410912229987343
work_keys_str_mv AT hsuyuchao largevocabularycontinuousmandarinspeechrecognitionusingweightedfinitestatetransducer
AT xǔyùchāo largevocabularycontinuousmandarinspeechrecognitionusingweightedfinitestatetransducer
AT hsuyuchao yǐjiāquányǒuxiànzhuàngtàizhuǎnhuànqìshíxiànzhōngwénliánxùyǔyīnbiànrèn
AT xǔyùchāo yǐjiāquányǒuxiànzhuàngtàizhuǎnhuànqìshíxiànzhōngwénliánxùyǔyīnbiànrèn
_version_ 1718050003180584960