Quickly Personalizable Digit Mobile Speech Recognition System Based on Sphinx
碩士 === 國立中山大學 === 資訊工程學系研究所 === 101 === In this paper, we are going to introduce a system which provide digit speech recognition services. This system is built on internet, users can easily utilize the system through the network. Besides the speech recognition service in our system, we also provide...
Main Authors: | , |
---|---|
Other Authors: | |
Format: | Others |
Language: | zh-TW |
Published: |
2013
|
Online Access: | http://ndltd.ncl.edu.tw/handle/58419670260226264357 |
id |
ndltd-TW-101NSYS5392069 |
---|---|
record_format |
oai_dc |
spelling |
ndltd-TW-101NSYS53920692015-10-13T22:40:49Z http://ndltd.ncl.edu.tw/handle/58419670260226264357 Quickly Personalizable Digit Mobile Speech Recognition System Based on Sphinx 基於 Sphinx 可快速個人化行動語音辨識系統 Tsung-peng Yen 顏宗芃 碩士 國立中山大學 資訊工程學系研究所 101 In this paper, we are going to introduce a system which provide digit speech recognition services. This system is built on internet, users can easily utilize the system through the network. Besides the speech recognition service in our system, we also provide adaptation function to bring up the Noise-Robust between differences environment. In the case of English digit recognition, our recognition system can achieve 80% accuracy for a specific speaker by using a few adaptation. Our system can also be expanded for building program and other relevant application. We use Sphinx-4 as a speech recognition kernel in our system. Because Sphinx-4 is a system prepared exclusively for researchers, it is a flexible, modular and pluggable framework. By the pluggability characteristic of Sphinx-4, we can replace the dictionary, grammar and acoustic model easily by edit the configuration files. In order to make sense about choosing acoustic model, training data and adaptation data. We provide our experiment results on AURORA2, EAT and Android device recording from corpus for references. Chia-Ping Chen 陳嘉平 2013 學位論文 ; thesis 49 zh-TW |
collection |
NDLTD |
language |
zh-TW |
format |
Others
|
sources |
NDLTD |
description |
碩士 === 國立中山大學 === 資訊工程學系研究所 === 101 === In this paper, we are going to introduce a system which provide digit speech recognition
services. This system is built on internet, users can easily utilize the system through the
network. Besides the speech recognition service in our system, we also provide adaptation
function to bring up the Noise-Robust between differences environment. In the case of English
digit recognition, our recognition system can achieve 80% accuracy for a specific
speaker by using a few adaptation. Our system can also be expanded for building program
and other relevant application. We use Sphinx-4 as a speech recognition kernel in our system.
Because Sphinx-4 is a system prepared exclusively for researchers, it is a flexible, modular
and pluggable framework. By the pluggability characteristic of Sphinx-4, we can replace the
dictionary, grammar and acoustic model easily by edit the configuration files. In order to
make sense about choosing acoustic model, training data and adaptation data. We provide
our experiment results on AURORA2, EAT and Android device recording from corpus for
references.
|
author2 |
Chia-Ping Chen |
author_facet |
Chia-Ping Chen Tsung-peng Yen 顏宗芃 |
author |
Tsung-peng Yen 顏宗芃 |
spellingShingle |
Tsung-peng Yen 顏宗芃 Quickly Personalizable Digit Mobile Speech Recognition System Based on Sphinx |
author_sort |
Tsung-peng Yen |
title |
Quickly Personalizable Digit Mobile Speech Recognition System Based on Sphinx |
title_short |
Quickly Personalizable Digit Mobile Speech Recognition System Based on Sphinx |
title_full |
Quickly Personalizable Digit Mobile Speech Recognition System Based on Sphinx |
title_fullStr |
Quickly Personalizable Digit Mobile Speech Recognition System Based on Sphinx |
title_full_unstemmed |
Quickly Personalizable Digit Mobile Speech Recognition System Based on Sphinx |
title_sort |
quickly personalizable digit mobile speech recognition system based on sphinx |
publishDate |
2013 |
url |
http://ndltd.ncl.edu.tw/handle/58419670260226264357 |
work_keys_str_mv |
AT tsungpengyen quicklypersonalizabledigitmobilespeechrecognitionsystembasedonsphinx AT yánzōngpéng quicklypersonalizabledigitmobilespeechrecognitionsystembasedonsphinx AT tsungpengyen jīyúsphinxkěkuàisùgèrénhuàxíngdòngyǔyīnbiànshíxìtǒng AT yánzōngpéng jīyúsphinxkěkuàisùgèrénhuàxíngdòngyǔyīnbiànshíxìtǒng |
_version_ |
1718079642097680384 |