Integration of Latent Prosody Analysis and Latent Acoustics Analysis for Robust Speaker Verification

碩士 === 國立臺北科技大學 === 電腦與通訊研究所 === 94 === Channel and handset mismatch is the major source of performance degradation for speaker verification in telecommunication environment. This thesis discusses the problem of compensating the channel and handset mismatch with few available train or test data, a l...

Full description

Bibliographic Details
Main Authors: Zhi-Ren Zeng, 曾志仁
Other Authors: 廖元甫
Format: Others
Language:zh-TW
Published: 2006
Online Access:http://ndltd.ncl.edu.tw/handle/spx574
id ndltd-TW-094TIT05652002
record_format oai_dc
spelling ndltd-TW-094TIT056520022019-06-27T05:08:50Z http://ndltd.ncl.edu.tw/handle/spx574 Integration of Latent Prosody Analysis and Latent Acoustics Analysis for Robust Speaker Verification 結合韻律與聲學特徵分析之強健性語者驗證 Zhi-Ren Zeng 曾志仁 碩士 國立臺北科技大學 電腦與通訊研究所 94 Channel and handset mismatch is the major source of performance degradation for speaker verification in telecommunication environment. This thesis discusses the problem of compensating the channel and handset mismatch with few available train or test data, a latent prosody analyses (LPA) and a latent acoustics analysis (LAA) approaches were proposed and fused together for robust speaker verification. The proposed methods are evaluated on the standard one speaker detection task of the 2001 NIST Speaker Recognition Evaluation Corpus where only one 2-minute training and 30-second trial speech (in average) are available. Experimental results have shown that the proposed LPA+LAA fusion approach could improve the equal error rates (EERs) of MAP-GMMs+MVA+T-norm approaches from 8.9% to 6.6%. Therefore, the proposed approaches are promising and worthy further studying for real-life speaker verification systems. 廖元甫 2006 學位論文 ; thesis 71 zh-TW
collection NDLTD
language zh-TW
format Others
sources NDLTD
description 碩士 === 國立臺北科技大學 === 電腦與通訊研究所 === 94 === Channel and handset mismatch is the major source of performance degradation for speaker verification in telecommunication environment. This thesis discusses the problem of compensating the channel and handset mismatch with few available train or test data, a latent prosody analyses (LPA) and a latent acoustics analysis (LAA) approaches were proposed and fused together for robust speaker verification. The proposed methods are evaluated on the standard one speaker detection task of the 2001 NIST Speaker Recognition Evaluation Corpus where only one 2-minute training and 30-second trial speech (in average) are available. Experimental results have shown that the proposed LPA+LAA fusion approach could improve the equal error rates (EERs) of MAP-GMMs+MVA+T-norm approaches from 8.9% to 6.6%. Therefore, the proposed approaches are promising and worthy further studying for real-life speaker verification systems.
author2 廖元甫
author_facet 廖元甫
Zhi-Ren Zeng
曾志仁
author Zhi-Ren Zeng
曾志仁
spellingShingle Zhi-Ren Zeng
曾志仁
Integration of Latent Prosody Analysis and Latent Acoustics Analysis for Robust Speaker Verification
author_sort Zhi-Ren Zeng
title Integration of Latent Prosody Analysis and Latent Acoustics Analysis for Robust Speaker Verification
title_short Integration of Latent Prosody Analysis and Latent Acoustics Analysis for Robust Speaker Verification
title_full Integration of Latent Prosody Analysis and Latent Acoustics Analysis for Robust Speaker Verification
title_fullStr Integration of Latent Prosody Analysis and Latent Acoustics Analysis for Robust Speaker Verification
title_full_unstemmed Integration of Latent Prosody Analysis and Latent Acoustics Analysis for Robust Speaker Verification
title_sort integration of latent prosody analysis and latent acoustics analysis for robust speaker verification
publishDate 2006
url http://ndltd.ncl.edu.tw/handle/spx574
work_keys_str_mv AT zhirenzeng integrationoflatentprosodyanalysisandlatentacousticsanalysisforrobustspeakerverification
AT céngzhìrén integrationoflatentprosodyanalysisandlatentacousticsanalysisforrobustspeakerverification
AT zhirenzeng jiéhéyùnlǜyǔshēngxuétèzhēngfēnxīzhīqiángjiànxìngyǔzhěyànzhèng
AT céngzhìrén jiéhéyùnlǜyǔshēngxuétèzhēngfēnxīzhīqiángjiànxìngyǔzhěyànzhèng
_version_ 1719209894872088576