Mandarin Mispronunciation Detection and Diagnosis Feedback Using Articulatory Attributes Based Multi-task Learning
碩士 === 國立臺灣大學 === 資訊工程學研究所 === 107 === This paper presents our research on computer assisted pronunciation training (CAPT). We focus on mispronunciation detection and articulation feedback. We propose taking into account the speech attributes, namely place and manner of articulation, in the assessme...
Main Authors: | , |
---|---|
Other Authors: | |
Format: | Others |
Language: | zh-TW |
Published: |
2019
|
Online Access: | http://ndltd.ncl.edu.tw/handle/2a8u7x |
id |
ndltd-TW-107NTU05392124 |
---|---|
record_format |
oai_dc |
spelling |
ndltd-TW-107NTU053921242019-11-16T05:28:01Z http://ndltd.ncl.edu.tw/handle/2a8u7x Mandarin Mispronunciation Detection and Diagnosis Feedback Using Articulatory Attributes Based Multi-task Learning 利用多任務學習模型建立發音特徵來改善華語錯誤發音偵測與診斷之回饋 Xuan-Bo Chen 陳宣伯 碩士 國立臺灣大學 資訊工程學研究所 107 This paper presents our research on computer assisted pronunciation training (CAPT). We focus on mispronunciation detection and articulation feedback. We propose taking into account the speech attributes, namely place and manner of articulation, in the assessment models to improve mispronunciation detection and return precise articulation feedback to learners. We train a discriminative articulatory model based on time-delay neural networks (TDNNs) with the multi-task learning strategy to give the articulatory score and a TDNN-based acoustic model to give the phonetic score. In testing, the system detects mispronunciations and returns precise articulation feedback based on both the phonetic and articulatory scores. The results of experiments conducted on the MATBN Mandarin Chinese broadcast news corpus show that the proposed models outperform the Gaussian mixture model (GMM)-based and deep neural network (DNN)-based baselines in terms of equal error rate (EER) and diagnostic accuracy (DA). Furthermore, our mispronunciation detection system should work in any language, although the current system focuses on Mandarin. Jyh-Shing Roger Jang 張智星 2019 學位論文 ; thesis 61 zh-TW |
collection |
NDLTD |
language |
zh-TW |
format |
Others
|
sources |
NDLTD |
description |
碩士 === 國立臺灣大學 === 資訊工程學研究所 === 107 === This paper presents our research on computer assisted pronunciation training (CAPT). We focus on mispronunciation detection and articulation feedback. We propose taking into account the speech attributes, namely place and manner of articulation, in the assessment models to improve mispronunciation detection and return precise articulation feedback to learners. We train a discriminative articulatory model based on time-delay neural networks (TDNNs) with the multi-task learning strategy to give the articulatory score and a TDNN-based acoustic model to give the phonetic score. In testing, the system detects mispronunciations and returns precise articulation feedback based on both the phonetic and articulatory scores. The results of experiments conducted on the MATBN Mandarin Chinese broadcast news corpus show that the proposed models outperform the Gaussian mixture model (GMM)-based and deep neural network (DNN)-based baselines in terms of equal error rate (EER) and diagnostic accuracy (DA). Furthermore, our mispronunciation detection system should work in any language, although the current system focuses on Mandarin.
|
author2 |
Jyh-Shing Roger Jang |
author_facet |
Jyh-Shing Roger Jang Xuan-Bo Chen 陳宣伯 |
author |
Xuan-Bo Chen 陳宣伯 |
spellingShingle |
Xuan-Bo Chen 陳宣伯 Mandarin Mispronunciation Detection and Diagnosis Feedback Using Articulatory Attributes Based Multi-task Learning |
author_sort |
Xuan-Bo Chen |
title |
Mandarin Mispronunciation Detection and Diagnosis Feedback Using Articulatory Attributes Based Multi-task Learning |
title_short |
Mandarin Mispronunciation Detection and Diagnosis Feedback Using Articulatory Attributes Based Multi-task Learning |
title_full |
Mandarin Mispronunciation Detection and Diagnosis Feedback Using Articulatory Attributes Based Multi-task Learning |
title_fullStr |
Mandarin Mispronunciation Detection and Diagnosis Feedback Using Articulatory Attributes Based Multi-task Learning |
title_full_unstemmed |
Mandarin Mispronunciation Detection and Diagnosis Feedback Using Articulatory Attributes Based Multi-task Learning |
title_sort |
mandarin mispronunciation detection and diagnosis feedback using articulatory attributes based multi-task learning |
publishDate |
2019 |
url |
http://ndltd.ncl.edu.tw/handle/2a8u7x |
work_keys_str_mv |
AT xuanbochen mandarinmispronunciationdetectionanddiagnosisfeedbackusingarticulatoryattributesbasedmultitasklearning AT chénxuānbó mandarinmispronunciationdetectionanddiagnosisfeedbackusingarticulatoryattributesbasedmultitasklearning AT xuanbochen lìyòngduōrènwùxuéxímóxíngjiànlìfāyīntèzhēngláigǎishànhuáyǔcuòwùfāyīnzhēncèyǔzhěnduànzhīhuíkuì AT chénxuānbó lìyòngduōrènwùxuéxímóxíngjiànlìfāyīntèzhēngláigǎishànhuáyǔcuòwùfāyīnzhēncèyǔzhěnduànzhīhuíkuì |
_version_ |
1719292681717284864 |