Mandarin Mispronunciation Detection and Diagnosis Feedback Using Articulatory Attributes Based Multi-task Learning

碩士 === 國立臺灣大學 === 資訊工程學研究所 === 107 === This paper presents our research on computer assisted pronunciation training (CAPT). We focus on mispronunciation detection and articulation feedback. We propose taking into account the speech attributes, namely place and manner of articulation, in the assessme...

Full description

Bibliographic Details
Main Authors: Xuan-Bo Chen, 陳宣伯
Other Authors: Jyh-Shing Roger Jang
Format: Others
Language:zh-TW
Published: 2019
Online Access:http://ndltd.ncl.edu.tw/handle/2a8u7x
id ndltd-TW-107NTU05392124
record_format oai_dc
spelling ndltd-TW-107NTU053921242019-11-16T05:28:01Z http://ndltd.ncl.edu.tw/handle/2a8u7x Mandarin Mispronunciation Detection and Diagnosis Feedback Using Articulatory Attributes Based Multi-task Learning 利用多任務學習模型建立發音特徵來改善華語錯誤發音偵測與診斷之回饋 Xuan-Bo Chen 陳宣伯 碩士 國立臺灣大學 資訊工程學研究所 107 This paper presents our research on computer assisted pronunciation training (CAPT). We focus on mispronunciation detection and articulation feedback. We propose taking into account the speech attributes, namely place and manner of articulation, in the assessment models to improve mispronunciation detection and return precise articulation feedback to learners. We train a discriminative articulatory model based on time-delay neural networks (TDNNs) with the multi-task learning strategy to give the articulatory score and a TDNN-based acoustic model to give the phonetic score. In testing, the system detects mispronunciations and returns precise articulation feedback based on both the phonetic and articulatory scores. The results of experiments conducted on the MATBN Mandarin Chinese broadcast news corpus show that the proposed models outperform the Gaussian mixture model (GMM)-based and deep neural network (DNN)-based baselines in terms of equal error rate (EER) and diagnostic accuracy (DA). Furthermore, our mispronunciation detection system should work in any language, although the current system focuses on Mandarin. Jyh-Shing Roger Jang 張智星 2019 學位論文 ; thesis 61 zh-TW
collection NDLTD
language zh-TW
format Others
sources NDLTD
description 碩士 === 國立臺灣大學 === 資訊工程學研究所 === 107 === This paper presents our research on computer assisted pronunciation training (CAPT). We focus on mispronunciation detection and articulation feedback. We propose taking into account the speech attributes, namely place and manner of articulation, in the assessment models to improve mispronunciation detection and return precise articulation feedback to learners. We train a discriminative articulatory model based on time-delay neural networks (TDNNs) with the multi-task learning strategy to give the articulatory score and a TDNN-based acoustic model to give the phonetic score. In testing, the system detects mispronunciations and returns precise articulation feedback based on both the phonetic and articulatory scores. The results of experiments conducted on the MATBN Mandarin Chinese broadcast news corpus show that the proposed models outperform the Gaussian mixture model (GMM)-based and deep neural network (DNN)-based baselines in terms of equal error rate (EER) and diagnostic accuracy (DA). Furthermore, our mispronunciation detection system should work in any language, although the current system focuses on Mandarin.
author2 Jyh-Shing Roger Jang
author_facet Jyh-Shing Roger Jang
Xuan-Bo Chen
陳宣伯
author Xuan-Bo Chen
陳宣伯
spellingShingle Xuan-Bo Chen
陳宣伯
Mandarin Mispronunciation Detection and Diagnosis Feedback Using Articulatory Attributes Based Multi-task Learning
author_sort Xuan-Bo Chen
title Mandarin Mispronunciation Detection and Diagnosis Feedback Using Articulatory Attributes Based Multi-task Learning
title_short Mandarin Mispronunciation Detection and Diagnosis Feedback Using Articulatory Attributes Based Multi-task Learning
title_full Mandarin Mispronunciation Detection and Diagnosis Feedback Using Articulatory Attributes Based Multi-task Learning
title_fullStr Mandarin Mispronunciation Detection and Diagnosis Feedback Using Articulatory Attributes Based Multi-task Learning
title_full_unstemmed Mandarin Mispronunciation Detection and Diagnosis Feedback Using Articulatory Attributes Based Multi-task Learning
title_sort mandarin mispronunciation detection and diagnosis feedback using articulatory attributes based multi-task learning
publishDate 2019
url http://ndltd.ncl.edu.tw/handle/2a8u7x
work_keys_str_mv AT xuanbochen mandarinmispronunciationdetectionanddiagnosisfeedbackusingarticulatoryattributesbasedmultitasklearning
AT chénxuānbó mandarinmispronunciationdetectionanddiagnosisfeedbackusingarticulatoryattributesbasedmultitasklearning
AT xuanbochen lìyòngduōrènwùxuéxímóxíngjiànlìfāyīntèzhēngláigǎishànhuáyǔcuòwùfāyīnzhēncèyǔzhěnduànzhīhuíkuì
AT chénxuānbó lìyòngduōrènwùxuéxímóxíngjiànlìfāyīntèzhēngláigǎishànhuáyǔcuòwùfāyīnzhēncèyǔzhěnduànzhīhuíkuì
_version_ 1719292681717284864