A Study of Speaker Adaptation on Automatic Taiwanese Dialect Identification

碩士 === 國立交通大學 === 電信工程系 === 87 === Previous work on automatic Chinese-dialect identification using an acoustic-phonotactic model allows the system to differentiate three dialects from each other in a multi-speaker (MS) environment. However, as we extend the task to the speaker-independent (SI) mode,...

Full description

Bibliographic Details
Main Authors: Wan-Hsing Yang, 楊萬興
Other Authors: Wen-Whei Chang
Format: Others
Language:zh-TW
Published: 1999
Online Access:http://ndltd.ncl.edu.tw/handle/84778598399974096028
id ndltd-TW-087NCTU0435104
record_format oai_dc
spelling ndltd-TW-087NCTU04351042016-07-11T04:13:49Z http://ndltd.ncl.edu.tw/handle/84778598399974096028 A Study of Speaker Adaptation on Automatic Taiwanese Dialect Identification 語者調適在台灣方言辨識之研究 Wan-Hsing Yang 楊萬興 碩士 國立交通大學 電信工程系 87 Previous work on automatic Chinese-dialect identification using an acoustic-phonotactic model allows the system to differentiate three dialects from each other in a multi-speaker (MS) environment. However, as we extend the task to the speaker-independent (SI) mode, the well-trained identifier suffers from serious degradation due to the mismatch between the training and the testing conditions. In order to overcome this problem, several well-developed solutions such as CMS, spectral transform, MAP, and MLLR were used. However, the experimental results indicate that such speaker compensation schemes developed for speech recognition are less successful. We speculate that the use of speaker compensation may destroy the discriminability of acoustic-phonotactic model. Recognizing this, an acoustic-based VQ-distortion identifier together with codebook adaptation is developed to alleviate the speaker mismatch problem. Simulation results indicate that a VQ-distortion identifier can easily extend to SI system with little degradation. Wen-Whei Chang 張文輝 1999 學位論文 ; thesis 72 zh-TW
collection NDLTD
language zh-TW
format Others
sources NDLTD
description 碩士 === 國立交通大學 === 電信工程系 === 87 === Previous work on automatic Chinese-dialect identification using an acoustic-phonotactic model allows the system to differentiate three dialects from each other in a multi-speaker (MS) environment. However, as we extend the task to the speaker-independent (SI) mode, the well-trained identifier suffers from serious degradation due to the mismatch between the training and the testing conditions. In order to overcome this problem, several well-developed solutions such as CMS, spectral transform, MAP, and MLLR were used. However, the experimental results indicate that such speaker compensation schemes developed for speech recognition are less successful. We speculate that the use of speaker compensation may destroy the discriminability of acoustic-phonotactic model. Recognizing this, an acoustic-based VQ-distortion identifier together with codebook adaptation is developed to alleviate the speaker mismatch problem. Simulation results indicate that a VQ-distortion identifier can easily extend to SI system with little degradation.
author2 Wen-Whei Chang
author_facet Wen-Whei Chang
Wan-Hsing Yang
楊萬興
author Wan-Hsing Yang
楊萬興
spellingShingle Wan-Hsing Yang
楊萬興
A Study of Speaker Adaptation on Automatic Taiwanese Dialect Identification
author_sort Wan-Hsing Yang
title A Study of Speaker Adaptation on Automatic Taiwanese Dialect Identification
title_short A Study of Speaker Adaptation on Automatic Taiwanese Dialect Identification
title_full A Study of Speaker Adaptation on Automatic Taiwanese Dialect Identification
title_fullStr A Study of Speaker Adaptation on Automatic Taiwanese Dialect Identification
title_full_unstemmed A Study of Speaker Adaptation on Automatic Taiwanese Dialect Identification
title_sort study of speaker adaptation on automatic taiwanese dialect identification
publishDate 1999
url http://ndltd.ncl.edu.tw/handle/84778598399974096028
work_keys_str_mv AT wanhsingyang astudyofspeakeradaptationonautomatictaiwanesedialectidentification
AT yángwànxìng astudyofspeakeradaptationonautomatictaiwanesedialectidentification
AT wanhsingyang yǔzhědiàoshìzàitáiwānfāngyánbiànshízhīyánjiū
AT yángwànxìng yǔzhědiàoshìzàitáiwānfāngyánbiànshízhīyánjiū
AT wanhsingyang studyofspeakeradaptationonautomatictaiwanesedialectidentification
AT yángwànxìng studyofspeakeradaptationonautomatictaiwanesedialectidentification
_version_ 1718343614926422016