Quickly Personalizable Digit Mobile Speech Recognition System Based on Sphinx

碩士 === 國立中山大學 === 資訊工程學系研究所 === 101 === In this paper, we are going to introduce a system which provide digit speech recognition services. This system is built on internet, users can easily utilize the system through the network. Besides the speech recognition service in our system, we also provide...

Full description

Bibliographic Details
Main Authors: Tsung-peng Yen, 顏宗芃
Other Authors: Chia-Ping Chen
Format: Others
Language:zh-TW
Published: 2013
Online Access:http://ndltd.ncl.edu.tw/handle/58419670260226264357
id ndltd-TW-101NSYS5392069
record_format oai_dc
spelling ndltd-TW-101NSYS53920692015-10-13T22:40:49Z http://ndltd.ncl.edu.tw/handle/58419670260226264357 Quickly Personalizable Digit Mobile Speech Recognition System Based on Sphinx 基於 Sphinx 可快速個人化行動語音辨識系統 Tsung-peng Yen 顏宗芃 碩士 國立中山大學 資訊工程學系研究所 101 In this paper, we are going to introduce a system which provide digit speech recognition services. This system is built on internet, users can easily utilize the system through the network. Besides the speech recognition service in our system, we also provide adaptation function to bring up the Noise-Robust between differences environment. In the case of English digit recognition, our recognition system can achieve 80% accuracy for a specific speaker by using a few adaptation. Our system can also be expanded for building program and other relevant application. We use Sphinx-4 as a speech recognition kernel in our system. Because Sphinx-4 is a system prepared exclusively for researchers, it is a flexible, modular and pluggable framework. By the pluggability characteristic of Sphinx-4, we can replace the dictionary, grammar and acoustic model easily by edit the configuration files. In order to make sense about choosing acoustic model, training data and adaptation data. We provide our experiment results on AURORA2, EAT and Android device recording from corpus for references. Chia-Ping Chen 陳嘉平 2013 學位論文 ; thesis 49 zh-TW
collection NDLTD
language zh-TW
format Others
sources NDLTD
description 碩士 === 國立中山大學 === 資訊工程學系研究所 === 101 === In this paper, we are going to introduce a system which provide digit speech recognition services. This system is built on internet, users can easily utilize the system through the network. Besides the speech recognition service in our system, we also provide adaptation function to bring up the Noise-Robust between differences environment. In the case of English digit recognition, our recognition system can achieve 80% accuracy for a specific speaker by using a few adaptation. Our system can also be expanded for building program and other relevant application. We use Sphinx-4 as a speech recognition kernel in our system. Because Sphinx-4 is a system prepared exclusively for researchers, it is a flexible, modular and pluggable framework. By the pluggability characteristic of Sphinx-4, we can replace the dictionary, grammar and acoustic model easily by edit the configuration files. In order to make sense about choosing acoustic model, training data and adaptation data. We provide our experiment results on AURORA2, EAT and Android device recording from corpus for references.
author2 Chia-Ping Chen
author_facet Chia-Ping Chen
Tsung-peng Yen
顏宗芃
author Tsung-peng Yen
顏宗芃
spellingShingle Tsung-peng Yen
顏宗芃
Quickly Personalizable Digit Mobile Speech Recognition System Based on Sphinx
author_sort Tsung-peng Yen
title Quickly Personalizable Digit Mobile Speech Recognition System Based on Sphinx
title_short Quickly Personalizable Digit Mobile Speech Recognition System Based on Sphinx
title_full Quickly Personalizable Digit Mobile Speech Recognition System Based on Sphinx
title_fullStr Quickly Personalizable Digit Mobile Speech Recognition System Based on Sphinx
title_full_unstemmed Quickly Personalizable Digit Mobile Speech Recognition System Based on Sphinx
title_sort quickly personalizable digit mobile speech recognition system based on sphinx
publishDate 2013
url http://ndltd.ncl.edu.tw/handle/58419670260226264357
work_keys_str_mv AT tsungpengyen quicklypersonalizabledigitmobilespeechrecognitionsystembasedonsphinx
AT yánzōngpéng quicklypersonalizabledigitmobilespeechrecognitionsystembasedonsphinx
AT tsungpengyen jīyúsphinxkěkuàisùgèrénhuàxíngdòngyǔyīnbiànshíxìtǒng
AT yánzōngpéng jīyúsphinxkěkuàisùgèrénhuàxíngdòngyǔyīnbiànshíxìtǒng
_version_ 1718079642097680384