Study of Speech Command Recognition

碩士 === 國立彰化師範大學 === 電機工程學系 === 97 === In this paper, Mel-frequency cepstral coefficients(MFCC) is as the feature parameter of speech., and Hidden Markov Models as the acoustics model. Generally, speech recognition system is divided into language and acoustics parts. This paper is intended to investi...

Full description

Bibliographic Details
Main Author: 江智堯
Other Authors: 王朝興
Format: Others
Language:zh-TW
Published: 2009
Online Access:http://ndltd.ncl.edu.tw/handle/66990029281793057214
id ndltd-TW-097NCUE5442022
record_format oai_dc
spelling ndltd-TW-097NCUE54420222015-10-13T12:05:45Z http://ndltd.ncl.edu.tw/handle/66990029281793057214 Study of Speech Command Recognition 語音命令辨識系統之研究 江智堯 碩士 國立彰化師範大學 電機工程學系 97 In this paper, Mel-frequency cepstral coefficients(MFCC) is as the feature parameter of speech., and Hidden Markov Models as the acoustics model. Generally, speech recognition system is divided into language and acoustics parts. This paper is intended to investigate Speech Command Recognition. This paper adopts three methods. First method uses the model structure of the whole word into long, middle and short state to look for the best model. The second method revises the whole word model to the initial and final model. It aims to look for the best state number and to modify the mixture number that is promoted for recognition. The third method uses the phoneme to acoustics unit to look for the best state and to modify the mixture number that is promoted for recognition. The third method uses the phoneme to acoustics unit to look for the best state and modify mixture number that is promoted recognizes. There are experimental results that can know the recognize rate of second of method comparatively high. we have contructed a real-time voice command recognition system that can be used to control a robot. 王朝興 2009 學位論文 ; thesis 58 zh-TW
collection NDLTD
language zh-TW
format Others
sources NDLTD
description 碩士 === 國立彰化師範大學 === 電機工程學系 === 97 === In this paper, Mel-frequency cepstral coefficients(MFCC) is as the feature parameter of speech., and Hidden Markov Models as the acoustics model. Generally, speech recognition system is divided into language and acoustics parts. This paper is intended to investigate Speech Command Recognition. This paper adopts three methods. First method uses the model structure of the whole word into long, middle and short state to look for the best model. The second method revises the whole word model to the initial and final model. It aims to look for the best state number and to modify the mixture number that is promoted for recognition. The third method uses the phoneme to acoustics unit to look for the best state and to modify the mixture number that is promoted for recognition. The third method uses the phoneme to acoustics unit to look for the best state and modify mixture number that is promoted recognizes. There are experimental results that can know the recognize rate of second of method comparatively high. we have contructed a real-time voice command recognition system that can be used to control a robot.
author2 王朝興
author_facet 王朝興
江智堯
author 江智堯
spellingShingle 江智堯
Study of Speech Command Recognition
author_sort 江智堯
title Study of Speech Command Recognition
title_short Study of Speech Command Recognition
title_full Study of Speech Command Recognition
title_fullStr Study of Speech Command Recognition
title_full_unstemmed Study of Speech Command Recognition
title_sort study of speech command recognition
publishDate 2009
url http://ndltd.ncl.edu.tw/handle/66990029281793057214
work_keys_str_mv AT jiāngzhìyáo studyofspeechcommandrecognition
AT jiāngzhìyáo yǔyīnmìnglìngbiànshíxìtǒngzhīyánjiū
_version_ 1716853438816452608