Refining Segmental Boundaries by Cross-Database Boundary Model
碩士 === 國立交通大學 === 電信工程研究所 === 105 === This thesis proposed a 2-stage automatic segmentation method, using database available to train traditional GMM-HMM acoustics model and GMM-based boundary model, aimed for processing syllable-level segmental boundaries of a new target database automatically. We...
Main Authors: | , |
---|---|
Other Authors: | |
Format: | Others |
Language: | zh-TW |
Published: |
2017
|
Online Access: | http://ndltd.ncl.edu.tw/handle/fy2kj3 |
id |
ndltd-TW-105NCTU5435035 |
---|---|
record_format |
oai_dc |
spelling |
ndltd-TW-105NCTU54350352019-05-15T23:32:20Z http://ndltd.ncl.edu.tw/handle/fy2kj3 Refining Segmental Boundaries by Cross-Database Boundary Model 跨語料庫之邊界模型對自動化切割的改善 Lai, Jia-Hong 賴佳鴻 碩士 國立交通大學 電信工程研究所 105 This thesis proposed a 2-stage automatic segmentation method, using database available to train traditional GMM-HMM acoustics model and GMM-based boundary model, aimed for processing syllable-level segmental boundaries of a new target database automatically. We got the initial syllable-level boundaries information by HMM-based forced alignment at the first stage, and then introduce boundary model to do post-refinement upon each boundary within a local range at second stage. A small number of utterances were treated as adaptation data for speaker adaptive training of boundary model so that the statistics of model parameters can match that of the test data, which would enhance the segmental refinement. In the experiment, lecture videos and captions from National Chiao Tung University Open Course Website (NCTU OCW) were choosen as the source of target database, while TCC300 training set was used for training GMM-HMM baseline model; Fast brodacast read speech database and part of the OCW training set was used for boundary model training, including background and speaker adaptation. By this, we would develop a highly-automatic syllable-level segmental boundary labeling system. Chen, Sin-Horng 陳信宏 2017 學位論文 ; thesis 62 zh-TW |
collection |
NDLTD |
language |
zh-TW |
format |
Others
|
sources |
NDLTD |
description |
碩士 === 國立交通大學 === 電信工程研究所 === 105 === This thesis proposed a 2-stage automatic segmentation method, using database available to train traditional GMM-HMM acoustics model and GMM-based boundary model, aimed for processing syllable-level segmental boundaries of a new target database automatically. We got the initial syllable-level boundaries information by HMM-based forced alignment at the first stage, and then introduce boundary model to do post-refinement upon each boundary within a local range at second stage. A small number of utterances were treated as adaptation data for speaker adaptive training of boundary model so that the statistics of model parameters can match that of the test data, which would enhance the segmental refinement. In the experiment, lecture videos and captions from National Chiao Tung University Open Course Website (NCTU OCW) were choosen as the source of target database, while TCC300 training set was used for training GMM-HMM baseline model; Fast brodacast read speech database and part of the OCW training set was used for boundary model training, including background and speaker adaptation. By this, we would develop a highly-automatic syllable-level segmental boundary labeling system.
|
author2 |
Chen, Sin-Horng |
author_facet |
Chen, Sin-Horng Lai, Jia-Hong 賴佳鴻 |
author |
Lai, Jia-Hong 賴佳鴻 |
spellingShingle |
Lai, Jia-Hong 賴佳鴻 Refining Segmental Boundaries by Cross-Database Boundary Model |
author_sort |
Lai, Jia-Hong |
title |
Refining Segmental Boundaries by Cross-Database Boundary Model |
title_short |
Refining Segmental Boundaries by Cross-Database Boundary Model |
title_full |
Refining Segmental Boundaries by Cross-Database Boundary Model |
title_fullStr |
Refining Segmental Boundaries by Cross-Database Boundary Model |
title_full_unstemmed |
Refining Segmental Boundaries by Cross-Database Boundary Model |
title_sort |
refining segmental boundaries by cross-database boundary model |
publishDate |
2017 |
url |
http://ndltd.ncl.edu.tw/handle/fy2kj3 |
work_keys_str_mv |
AT laijiahong refiningsegmentalboundariesbycrossdatabaseboundarymodel AT làijiāhóng refiningsegmentalboundariesbycrossdatabaseboundarymodel AT laijiahong kuàyǔliàokùzhībiānjièmóxíngduìzìdònghuàqiègēdegǎishàn AT làijiāhóng kuàyǔliàokùzhībiānjièmóxíngduìzìdònghuàqiègēdegǎishàn |
_version_ |
1719149587766181888 |