Integration of Latent Prosody Analysis and Latent Acoustics Analysis for Robust Speaker Verification

碩士 === 國立臺北科技大學 === 電腦與通訊研究所 === 94 === Channel and handset mismatch is the major source of performance degradation for speaker verification in telecommunication environment. This thesis discusses the problem of compensating the channel and handset mismatch with few available train or test data, a l...

Full description

Bibliographic Details
Main Authors:	Zhi-Ren Zeng, 曾志仁
Other Authors:	廖元甫
Format:	Others
Language:	zh-TW
Published:	2006
Online Access:	http://ndltd.ncl.edu.tw/handle/spx574

id	ndltd-TW-094TIT05652002
record_format	oai_dc
spelling	ndltd-TW-094TIT056520022019-06-27T05:08:50Z http://ndltd.ncl.edu.tw/handle/spx574 Integration of Latent Prosody Analysis and Latent Acoustics Analysis for Robust Speaker Verification 結合韻律與聲學特徵分析之強健性語者驗證 Zhi-Ren Zeng 曾志仁碩士國立臺北科技大學電腦與通訊研究所 94 Channel and handset mismatch is the major source of performance degradation for speaker verification in telecommunication environment. This thesis discusses the problem of compensating the channel and handset mismatch with few available train or test data, a latent prosody analyses (LPA) and a latent acoustics analysis (LAA) approaches were proposed and fused together for robust speaker verification. The proposed methods are evaluated on the standard one speaker detection task of the 2001 NIST Speaker Recognition Evaluation Corpus where only one 2-minute training and 30-second trial speech (in average) are available. Experimental results have shown that the proposed LPA+LAA fusion approach could improve the equal error rates (EERs) of MAP-GMMs+MVA+T-norm approaches from 8.9% to 6.6%. Therefore, the proposed approaches are promising and worthy further studying for real-life speaker verification systems. 廖元甫 2006 學位論文 ; thesis 71 zh-TW
collection	NDLTD
language	zh-TW
format	Others
sources	NDLTD
description	碩士 === 國立臺北科技大學 === 電腦與通訊研究所 === 94 === Channel and handset mismatch is the major source of performance degradation for speaker verification in telecommunication environment. This thesis discusses the problem of compensating the channel and handset mismatch with few available train or test data, a latent prosody analyses (LPA) and a latent acoustics analysis (LAA) approaches were proposed and fused together for robust speaker verification. The proposed methods are evaluated on the standard one speaker detection task of the 2001 NIST Speaker Recognition Evaluation Corpus where only one 2-minute training and 30-second trial speech (in average) are available. Experimental results have shown that the proposed LPA+LAA fusion approach could improve the equal error rates (EERs) of MAP-GMMs+MVA+T-norm approaches from 8.9% to 6.6%. Therefore, the proposed approaches are promising and worthy further studying for real-life speaker verification systems.
author2	廖元甫
author_facet	廖元甫 Zhi-Ren Zeng 曾志仁
author	Zhi-Ren Zeng 曾志仁
spellingShingle	Zhi-Ren Zeng 曾志仁 Integration of Latent Prosody Analysis and Latent Acoustics Analysis for Robust Speaker Verification
author_sort	Zhi-Ren Zeng
title	Integration of Latent Prosody Analysis and Latent Acoustics Analysis for Robust Speaker Verification
title_short	Integration of Latent Prosody Analysis and Latent Acoustics Analysis for Robust Speaker Verification
title_full	Integration of Latent Prosody Analysis and Latent Acoustics Analysis for Robust Speaker Verification
title_fullStr	Integration of Latent Prosody Analysis and Latent Acoustics Analysis for Robust Speaker Verification
title_full_unstemmed	Integration of Latent Prosody Analysis and Latent Acoustics Analysis for Robust Speaker Verification
title_sort	integration of latent prosody analysis and latent acoustics analysis for robust speaker verification
publishDate	2006
url	http://ndltd.ncl.edu.tw/handle/spx574
work_keys_str_mv	AT zhirenzeng integrationoflatentprosodyanalysisandlatentacousticsanalysisforrobustspeakerverification AT céngzhìrén integrationoflatentprosodyanalysisandlatentacousticsanalysisforrobustspeakerverification AT zhirenzeng jiéhéyùnlǜyǔshēngxuétèzhēngfēnxīzhīqiángjiànxìngyǔzhěyànzhèng AT céngzhìrén jiéhéyùnlǜyǔshēngxuétèzhēngfēnxīzhīqiángjiànxìngyǔzhěyànzhèng
_version_	1719209894872088576

Integration of Latent Prosody Analysis and Latent Acoustics Analysis for Robust Speaker Verification

Similar Items