Refining Segmental Boundaries by Cross-Database Boundary Model

碩士 === 國立交通大學 === 電信工程研究所 === 105 === This thesis proposed a 2-stage automatic segmentation method, using database available to train traditional GMM-HMM acoustics model and GMM-based boundary model, aimed for processing syllable-level segmental boundaries of a new target database automatically. We...

Full description

Bibliographic Details
Main Authors: Lai, Jia-Hong, 賴佳鴻
Other Authors: Chen, Sin-Horng
Format: Others
Language:zh-TW
Published: 2017
Online Access:http://ndltd.ncl.edu.tw/handle/fy2kj3
id ndltd-TW-105NCTU5435035
record_format oai_dc
spelling ndltd-TW-105NCTU54350352019-05-15T23:32:20Z http://ndltd.ncl.edu.tw/handle/fy2kj3 Refining Segmental Boundaries by Cross-Database Boundary Model 跨語料庫之邊界模型對自動化切割的改善 Lai, Jia-Hong 賴佳鴻 碩士 國立交通大學 電信工程研究所 105 This thesis proposed a 2-stage automatic segmentation method, using database available to train traditional GMM-HMM acoustics model and GMM-based boundary model, aimed for processing syllable-level segmental boundaries of a new target database automatically. We got the initial syllable-level boundaries information by HMM-based forced alignment at the first stage, and then introduce boundary model to do post-refinement upon each boundary within a local range at second stage. A small number of utterances were treated as adaptation data for speaker adaptive training of boundary model so that the statistics of model parameters can match that of the test data, which would enhance the segmental refinement. In the experiment, lecture videos and captions from National Chiao Tung University Open Course Website (NCTU OCW) were choosen as the source of target database, while TCC300 training set was used for training GMM-HMM baseline model; Fast brodacast read speech database and part of the OCW training set was used for boundary model training, including background and speaker adaptation. By this, we would develop a highly-automatic syllable-level segmental boundary labeling system. Chen, Sin-Horng 陳信宏 2017 學位論文 ; thesis 62 zh-TW
collection NDLTD
language zh-TW
format Others
sources NDLTD
description 碩士 === 國立交通大學 === 電信工程研究所 === 105 === This thesis proposed a 2-stage automatic segmentation method, using database available to train traditional GMM-HMM acoustics model and GMM-based boundary model, aimed for processing syllable-level segmental boundaries of a new target database automatically. We got the initial syllable-level boundaries information by HMM-based forced alignment at the first stage, and then introduce boundary model to do post-refinement upon each boundary within a local range at second stage. A small number of utterances were treated as adaptation data for speaker adaptive training of boundary model so that the statistics of model parameters can match that of the test data, which would enhance the segmental refinement. In the experiment, lecture videos and captions from National Chiao Tung University Open Course Website (NCTU OCW) were choosen as the source of target database, while TCC300 training set was used for training GMM-HMM baseline model; Fast brodacast read speech database and part of the OCW training set was used for boundary model training, including background and speaker adaptation. By this, we would develop a highly-automatic syllable-level segmental boundary labeling system.
author2 Chen, Sin-Horng
author_facet Chen, Sin-Horng
Lai, Jia-Hong
賴佳鴻
author Lai, Jia-Hong
賴佳鴻
spellingShingle Lai, Jia-Hong
賴佳鴻
Refining Segmental Boundaries by Cross-Database Boundary Model
author_sort Lai, Jia-Hong
title Refining Segmental Boundaries by Cross-Database Boundary Model
title_short Refining Segmental Boundaries by Cross-Database Boundary Model
title_full Refining Segmental Boundaries by Cross-Database Boundary Model
title_fullStr Refining Segmental Boundaries by Cross-Database Boundary Model
title_full_unstemmed Refining Segmental Boundaries by Cross-Database Boundary Model
title_sort refining segmental boundaries by cross-database boundary model
publishDate 2017
url http://ndltd.ncl.edu.tw/handle/fy2kj3
work_keys_str_mv AT laijiahong refiningsegmentalboundariesbycrossdatabaseboundarymodel
AT làijiāhóng refiningsegmentalboundariesbycrossdatabaseboundarymodel
AT laijiahong kuàyǔliàokùzhībiānjièmóxíngduìzìdònghuàqiègēdegǎishàn
AT làijiāhóng kuàyǔliàokùzhībiānjièmóxíngduìzìdònghuàqiègēdegǎishàn
_version_ 1719149587766181888