Speech Evaluation
碩士 === 國立清華大學 === 資訊工程學系 === 90 === This thesis discusses several methods in speech evaluation, which is a study on computer evaluation of speech contents, fluency and intonation. It requires the techniques from audio signal processing and speech recognition. In order to develop an appropriate and c...
Main Authors: | , |
---|---|
Other Authors: | |
Format: | Others |
Language: | zh-TW |
Published: |
2002
|
Online Access: | http://ndltd.ncl.edu.tw/handle/94181476992573057078 |
id |
ndltd-TW-090NTHU0392038 |
---|---|
record_format |
oai_dc |
spelling |
ndltd-TW-090NTHU03920382015-10-13T10:34:06Z http://ndltd.ncl.edu.tw/handle/94181476992573057078 Speech Evaluation 語音評分 Chun-Yi Lee 李俊毅 碩士 國立清華大學 資訊工程學系 90 This thesis discusses several methods in speech evaluation, which is a study on computer evaluation of speech contents, fluency and intonation. It requires the techniques from audio signal processing and speech recognition. In order to develop an appropriate and consistent speech evaluation system, we define several useful speech features for our speech evaluation system and perform several experiments on feature matching methods. There are two parts in this thesis. The first one is “Evaluation using standard speech”, and the other is “Evaluation using HMM and pitch contour”. “Evaluation using standard speech” is a method that evaluates the similarity between a test speech and the corresponding standard speech. We use various approaches for speech feature extraction, pattern matching, and similarity computation. In particular, we use magnitude contour, pitch contour, and mel-frequency cepstral coefficients as the features to generate a similarity score. Magnitude contours represent the variations in volume. Pitch contours represent the variations in pitches. Mel-frequency cepstral coefficients represent the contents of speech. “Evaluation using HMM and pitch contour” is another speech evaluation paradigm that does not require the existence of a standard speech. Alternatively, we evaluate a test speech based on its similarity to a hidden Markov models (HMM) and tone models. Viterbi decoding is used to segment each character in a continuous sentence. Then the score of each character is computed through the ranking of 411 possible syllables and a tone recognition system. Jyh-Shing Roger Jang 張智星 2002 學位論文 ; thesis 55 zh-TW |
collection |
NDLTD |
language |
zh-TW |
format |
Others
|
sources |
NDLTD |
description |
碩士 === 國立清華大學 === 資訊工程學系 === 90 === This thesis discusses several methods in speech evaluation, which is a study on computer evaluation of speech contents, fluency and intonation. It requires the techniques from audio signal processing and speech recognition. In order to develop an appropriate and consistent speech evaluation system, we define several useful speech features for our speech evaluation system and perform several experiments on feature matching methods. There are two parts in this thesis. The first one is “Evaluation using standard speech”, and the other is “Evaluation using HMM and pitch contour”.
“Evaluation using standard speech” is a method that evaluates the similarity between a test speech and the corresponding standard speech. We use various approaches for speech feature extraction, pattern matching, and similarity computation. In particular, we use magnitude contour, pitch contour, and mel-frequency cepstral coefficients as the features to generate a similarity score. Magnitude contours represent the variations in volume. Pitch contours represent the variations in pitches. Mel-frequency cepstral coefficients represent the contents of speech.
“Evaluation using HMM and pitch contour” is another speech evaluation paradigm that does not require the existence of a standard speech. Alternatively, we evaluate a test speech based on its similarity to a hidden Markov models (HMM) and tone models. Viterbi decoding is used to segment each character in a continuous sentence. Then the score of each character is computed through the ranking of 411 possible syllables and a tone recognition system.
|
author2 |
Jyh-Shing Roger Jang |
author_facet |
Jyh-Shing Roger Jang Chun-Yi Lee 李俊毅 |
author |
Chun-Yi Lee 李俊毅 |
spellingShingle |
Chun-Yi Lee 李俊毅 Speech Evaluation |
author_sort |
Chun-Yi Lee |
title |
Speech Evaluation |
title_short |
Speech Evaluation |
title_full |
Speech Evaluation |
title_fullStr |
Speech Evaluation |
title_full_unstemmed |
Speech Evaluation |
title_sort |
speech evaluation |
publishDate |
2002 |
url |
http://ndltd.ncl.edu.tw/handle/94181476992573057078 |
work_keys_str_mv |
AT chunyilee speechevaluation AT lǐjùnyì speechevaluation AT chunyilee yǔyīnpíngfēn AT lǐjùnyì yǔyīnpíngfēn |
_version_ |
1716829133134102529 |