Integration of Latent Prosody Analysis and Latent Acoustics Analysis for Robust Speaker Verification
碩士 === 國立臺北科技大學 === 電腦與通訊研究所 === 94 === Channel and handset mismatch is the major source of performance degradation for speaker verification in telecommunication environment. This thesis discusses the problem of compensating the channel and handset mismatch with few available train or test data, a l...
Main Authors: | , |
---|---|
Other Authors: | |
Format: | Others |
Language: | zh-TW |
Published: |
2006
|
Online Access: | http://ndltd.ncl.edu.tw/handle/spx574 |
id |
ndltd-TW-094TIT05652002 |
---|---|
record_format |
oai_dc |
spelling |
ndltd-TW-094TIT056520022019-06-27T05:08:50Z http://ndltd.ncl.edu.tw/handle/spx574 Integration of Latent Prosody Analysis and Latent Acoustics Analysis for Robust Speaker Verification 結合韻律與聲學特徵分析之強健性語者驗證 Zhi-Ren Zeng 曾志仁 碩士 國立臺北科技大學 電腦與通訊研究所 94 Channel and handset mismatch is the major source of performance degradation for speaker verification in telecommunication environment. This thesis discusses the problem of compensating the channel and handset mismatch with few available train or test data, a latent prosody analyses (LPA) and a latent acoustics analysis (LAA) approaches were proposed and fused together for robust speaker verification. The proposed methods are evaluated on the standard one speaker detection task of the 2001 NIST Speaker Recognition Evaluation Corpus where only one 2-minute training and 30-second trial speech (in average) are available. Experimental results have shown that the proposed LPA+LAA fusion approach could improve the equal error rates (EERs) of MAP-GMMs+MVA+T-norm approaches from 8.9% to 6.6%. Therefore, the proposed approaches are promising and worthy further studying for real-life speaker verification systems. 廖元甫 2006 學位論文 ; thesis 71 zh-TW |
collection |
NDLTD |
language |
zh-TW |
format |
Others
|
sources |
NDLTD |
description |
碩士 === 國立臺北科技大學 === 電腦與通訊研究所 === 94 === Channel and handset mismatch is the major source of performance degradation for speaker verification in telecommunication environment. This thesis discusses the problem of compensating the channel and handset mismatch with few available train or test data, a latent prosody analyses (LPA) and a latent acoustics analysis (LAA) approaches were proposed and fused together for robust speaker verification. The proposed methods are evaluated on the standard one speaker detection task of the 2001 NIST Speaker Recognition Evaluation Corpus where only one 2-minute training and 30-second trial speech (in average) are available. Experimental results have shown that the proposed LPA+LAA fusion approach could improve the equal error rates (EERs) of MAP-GMMs+MVA+T-norm approaches from 8.9% to 6.6%. Therefore, the proposed approaches are promising and worthy further studying for real-life speaker verification systems.
|
author2 |
廖元甫 |
author_facet |
廖元甫 Zhi-Ren Zeng 曾志仁 |
author |
Zhi-Ren Zeng 曾志仁 |
spellingShingle |
Zhi-Ren Zeng 曾志仁 Integration of Latent Prosody Analysis and Latent Acoustics Analysis for Robust Speaker Verification |
author_sort |
Zhi-Ren Zeng |
title |
Integration of Latent Prosody Analysis and Latent Acoustics Analysis for Robust Speaker Verification |
title_short |
Integration of Latent Prosody Analysis and Latent Acoustics Analysis for Robust Speaker Verification |
title_full |
Integration of Latent Prosody Analysis and Latent Acoustics Analysis for Robust Speaker Verification |
title_fullStr |
Integration of Latent Prosody Analysis and Latent Acoustics Analysis for Robust Speaker Verification |
title_full_unstemmed |
Integration of Latent Prosody Analysis and Latent Acoustics Analysis for Robust Speaker Verification |
title_sort |
integration of latent prosody analysis and latent acoustics analysis for robust speaker verification |
publishDate |
2006 |
url |
http://ndltd.ncl.edu.tw/handle/spx574 |
work_keys_str_mv |
AT zhirenzeng integrationoflatentprosodyanalysisandlatentacousticsanalysisforrobustspeakerverification AT céngzhìrén integrationoflatentprosodyanalysisandlatentacousticsanalysisforrobustspeakerverification AT zhirenzeng jiéhéyùnlǜyǔshēngxuétèzhēngfēnxīzhīqiángjiànxìngyǔzhěyànzhèng AT céngzhìrén jiéhéyùnlǜyǔshēngxuétèzhēngfēnxīzhīqiángjiànxìngyǔzhěyànzhèng |
_version_ |
1719209894872088576 |