LVCSR Search AlgorithmUsing Reliable Change Point Detection
碩士 === 國立成功大學 === 資訊工程學系碩博士班 === 94 === Basically, the state-of-the-art automatic speech recognition (ASR) systems are based on techniques of dynamic programming and hidden Markov model. There are several crucial issues happening in building desirable ASR performance. Among them, how to reliably det...
Main Authors: | , |
---|---|
Other Authors: | |
Format: | Others |
Language: | zh-TW |
Published: |
2006
|
Online Access: | http://ndltd.ncl.edu.tw/handle/34998477586500190840 |
id |
ndltd-TW-094NCKU5392097 |
---|---|
record_format |
oai_dc |
spelling |
ndltd-TW-094NCKU53920972015-12-16T04:32:12Z http://ndltd.ncl.edu.tw/handle/34998477586500190840 LVCSR Search AlgorithmUsing Reliable Change Point Detection 具信賴性斷點偵測之語音搜尋演算法 Tzu-Hsien Chao 趙子賢 碩士 國立成功大學 資訊工程學系碩博士班 94 Basically, the state-of-the-art automatic speech recognition (ASR) systems are based on techniques of dynamic programming and hidden Markov model. There are several crucial issues happening in building desirable ASR performance. Among them, how to reliably detect change points of continuous speech in presence of high co-articulation effect and distortion environments plays a critical role. In the literature, likelihood-ratio (LR) based confidence measure was developed to improve detection performance. This likelihood ratio (LR) criterion could be used to decide the acceptance or rejection for the alignment between speech frames and acoustic models/units. However, in case of spontaneous-style speech, the probabilistic scores in some intervals turn out to be vibrating and confusing. This causes unreliable alignment during search processing for large vocabulary continuous speech recognition (LVCSR). Previously, some methods were presented to detect change points in HMM state level. But, these works should specify empirical detection threshold and were not considered as a direct solution to overcome vibration problems in boundaries of speech units. In this thesis, we present the run test approach to test the randomness of the states of decision probabilistic scores in observation speech sequence. The non-parametric statistics is calculated and used to determine the optimal change point with the best randomness for the states before and after the change point. Through combining this principle and LR criterion, we can sequentially detect change points for building desirable LVCSR search algorithm. In the experiments, we implement and evaluate this approach using TDT2 Mandarin broadcast news corpus. Jen-Tzung Chien 簡仁宗 2006 學位論文 ; thesis 95 zh-TW |
collection |
NDLTD |
language |
zh-TW |
format |
Others
|
sources |
NDLTD |
description |
碩士 === 國立成功大學 === 資訊工程學系碩博士班 === 94 === Basically, the state-of-the-art automatic speech recognition (ASR) systems are based on techniques of dynamic programming and hidden Markov model. There are several crucial issues happening in building desirable ASR performance. Among them, how to reliably detect change points of continuous speech in presence of high co-articulation effect and distortion environments plays a critical role. In the literature, likelihood-ratio (LR) based confidence measure was developed to improve detection performance. This likelihood ratio (LR) criterion could be used to decide the acceptance or rejection for the alignment between speech frames and acoustic models/units. However, in case of spontaneous-style speech, the probabilistic scores in some intervals turn out to be vibrating and confusing. This causes unreliable alignment during search processing for large vocabulary continuous speech recognition (LVCSR). Previously, some methods were presented to detect change points in HMM state level. But, these works should specify empirical detection threshold and were not considered as a direct solution to overcome vibration problems in boundaries of speech units.
In this thesis, we present the run test approach to test the randomness of the states of decision probabilistic scores in observation speech sequence. The non-parametric statistics is calculated and used to determine the optimal change point with the best randomness for the states before and after the change point. Through combining this principle and LR criterion, we can sequentially detect change points for building desirable LVCSR search algorithm. In the experiments, we implement and evaluate this approach using TDT2 Mandarin broadcast news corpus.
|
author2 |
Jen-Tzung Chien |
author_facet |
Jen-Tzung Chien Tzu-Hsien Chao 趙子賢 |
author |
Tzu-Hsien Chao 趙子賢 |
spellingShingle |
Tzu-Hsien Chao 趙子賢 LVCSR Search AlgorithmUsing Reliable Change Point Detection |
author_sort |
Tzu-Hsien Chao |
title |
LVCSR Search AlgorithmUsing Reliable Change Point Detection |
title_short |
LVCSR Search AlgorithmUsing Reliable Change Point Detection |
title_full |
LVCSR Search AlgorithmUsing Reliable Change Point Detection |
title_fullStr |
LVCSR Search AlgorithmUsing Reliable Change Point Detection |
title_full_unstemmed |
LVCSR Search AlgorithmUsing Reliable Change Point Detection |
title_sort |
lvcsr search algorithmusing reliable change point detection |
publishDate |
2006 |
url |
http://ndltd.ncl.edu.tw/handle/34998477586500190840 |
work_keys_str_mv |
AT tzuhsienchao lvcsrsearchalgorithmusingreliablechangepointdetection AT zhàozixián lvcsrsearchalgorithmusingreliablechangepointdetection AT tzuhsienchao jùxìnlàixìngduàndiǎnzhēncèzhīyǔyīnsōuxúnyǎnsuànfǎ AT zhàozixián jùxìnlàixìngduàndiǎnzhēncèzhīyǔyīnsōuxúnyǎnsuànfǎ |
_version_ |
1718149095107854336 |