A Study of Several Pattern Matching Techniques with Application to Isolated Mandarin Digit Speech Recognition
碩士 === 國立清華大學 === 資訊工程學系 === 91 === Today, mobile phones and personal digital devices are made to be smaller and with more functions in them. In this way, the traditional typing input scheme becomes inconvenient to use. Voice input should be a good resolution. In this thesis,...
Main Authors: | , |
---|---|
Other Authors: | |
Format: | Others |
Language: | zh-TW |
Published: |
2003
|
Online Access: | http://ndltd.ncl.edu.tw/handle/15283721042529571140 |
id |
ndltd-TW-091NTHU0392032 |
---|---|
record_format |
oai_dc |
spelling |
ndltd-TW-091NTHU03920322016-06-22T04:26:24Z http://ndltd.ncl.edu.tw/handle/15283721042529571140 A Study of Several Pattern Matching Techniques with Application to Isolated Mandarin Digit Speech Recognition 樣型比對技術應用於中文數字語音辨認之研究 Hsieh Chia-yang 謝佳揚 碩士 國立清華大學 資訊工程學系 91 Today, mobile phones and personal digital devices are made to be smaller and with more functions in them. In this way, the traditional typing input scheme becomes inconvenient to use. Voice input should be a good resolution. In this thesis, we try to develop a speaker-independent Mandarin digits speech recognition system based on discrete HMM with simple models, low computation, and a high recognition rate. We use several techniques to improve the accuracy, including feature extraction, feature vector quantization, classifier combination, corrective training. In feature extraction part, we use nonuniform frame shifting (NUFS) to increase the weight of beginning parts. In feature vector quantization part, we use separate codebook for each digit. The distance between feature vector and codebook center can be a very good classifier of digit recognition. Furthermore, combining the distance with log probability of DHMM can also increase the accuracy. We also applied corrective training which can correct the model that is classified incorrectly or has a log probability close to the target one. Additionally, we use segmental probability model (SPM) to reduce the computation time. Jyh-Shing Roger Jang 張智星 2003 學位論文 ; thesis 38 zh-TW |
collection |
NDLTD |
language |
zh-TW |
format |
Others
|
sources |
NDLTD |
description |
碩士 === 國立清華大學 === 資訊工程學系 === 91 === Today, mobile phones and personal digital devices are made to be smaller and with more functions in them. In this way, the traditional typing input scheme becomes inconvenient to use. Voice input should be a good resolution.
In this thesis, we try to develop a speaker-independent Mandarin digits speech recognition system based on discrete HMM with simple models, low computation, and a high recognition rate. We use several techniques to improve the accuracy, including feature extraction, feature vector quantization, classifier combination, corrective training. In feature extraction part, we use nonuniform frame shifting (NUFS) to increase the weight of beginning parts. In feature vector quantization part, we use separate codebook for each digit. The distance between feature vector and codebook center can be a very good classifier of digit recognition. Furthermore, combining the distance with log probability of DHMM can also increase the accuracy.
We also applied corrective training which can correct the model that is classified incorrectly or has a log probability close to the target one. Additionally, we use segmental probability model (SPM) to reduce the computation time.
|
author2 |
Jyh-Shing Roger Jang |
author_facet |
Jyh-Shing Roger Jang Hsieh Chia-yang 謝佳揚 |
author |
Hsieh Chia-yang 謝佳揚 |
spellingShingle |
Hsieh Chia-yang 謝佳揚 A Study of Several Pattern Matching Techniques with Application to Isolated Mandarin Digit Speech Recognition |
author_sort |
Hsieh Chia-yang |
title |
A Study of Several Pattern Matching Techniques with Application to Isolated Mandarin Digit Speech Recognition |
title_short |
A Study of Several Pattern Matching Techniques with Application to Isolated Mandarin Digit Speech Recognition |
title_full |
A Study of Several Pattern Matching Techniques with Application to Isolated Mandarin Digit Speech Recognition |
title_fullStr |
A Study of Several Pattern Matching Techniques with Application to Isolated Mandarin Digit Speech Recognition |
title_full_unstemmed |
A Study of Several Pattern Matching Techniques with Application to Isolated Mandarin Digit Speech Recognition |
title_sort |
study of several pattern matching techniques with application to isolated mandarin digit speech recognition |
publishDate |
2003 |
url |
http://ndltd.ncl.edu.tw/handle/15283721042529571140 |
work_keys_str_mv |
AT hsiehchiayang astudyofseveralpatternmatchingtechniqueswithapplicationtoisolatedmandarindigitspeechrecognition AT xièjiāyáng astudyofseveralpatternmatchingtechniqueswithapplicationtoisolatedmandarindigitspeechrecognition AT hsiehchiayang yàngxíngbǐduìjìshùyīngyòngyúzhōngwénshùzìyǔyīnbiànrènzhīyánjiū AT xièjiāyáng yàngxíngbǐduìjìshùyīngyòngyúzhōngwénshùzìyǔyīnbiànrènzhīyánjiū AT hsiehchiayang studyofseveralpatternmatchingtechniqueswithapplicationtoisolatedmandarindigitspeechrecognition AT xièjiāyáng studyofseveralpatternmatchingtechniqueswithapplicationtoisolatedmandarindigitspeechrecognition |
_version_ |
1718319342737686528 |