Evaluation of the Vocal Tract Length Normalization Based Classifiers for Speaker Verification

This paper proposes and evaluates classifiers based on Vocal Tract Length Normalization (VTLN) in a text-dependent speaker verification (SV) task with short testing utterances. This type of tasks is important in commercial applications and is not easily addressed with methods designed for long utter...

Full description

Bibliographic Details
Main Authors: Walid Hussein, Sarah Akram Essmat, Nestor Yoma, Fernando Huenupán
Format: Article
Language:English
Published: International Association of Online Engineering (IAOE) 2016-12-01
Series:International Journal of Recent Contributions from Engineering, Science & IT
Online Access:http://online-journals.org/index.php/i-jes/article/view/6544
id doaj-3535bceafa0441c7b025a23aac0c8071
record_format Article
spelling doaj-3535bceafa0441c7b025a23aac0c80712021-09-02T01:51:07ZengInternational Association of Online Engineering (IAOE)International Journal of Recent Contributions from Engineering, Science & IT2197-85812016-12-0144414410.3991/ijes.v4i4.65443005Evaluation of the Vocal Tract Length Normalization Based Classifiers for Speaker VerificationWalid Hussein0Sarah Akram Essmat1Nestor Yoma2Fernando Huenupán3Faculty of Informatics and Computer Science, The British University in EgyptFaculty of Informatics and Computer Science, The British University in EgyptUniversidad de ChileUniversidad de La Frontera, ChileThis paper proposes and evaluates classifiers based on Vocal Tract Length Normalization (VTLN) in a text-dependent speaker verification (SV) task with short testing utterances. This type of tasks is important in commercial applications and is not easily addressed with methods designed for long utterances such as JFA and i-Vectors. In contrast, VTLN is a speaker compensation scheme that can lead to significant improvements in speech recognition accuracy with just a few seconds of speech samples. A novel scheme to generate new classifiers is employed by incorporating the observation vector sequence compensated with VTLN. The modified sequence of feature vectors and the corresponding warping factors are used to generate classifiers whose scores are combined by a Support Vector Machine (SVM) based SV system. The proposed scheme can provide an average reduction in EER equal to 14% when compared with the baseline system based on the likelihood of observation vectors.http://online-journals.org/index.php/i-jes/article/view/6544
collection DOAJ
language English
format Article
sources DOAJ
author Walid Hussein
Sarah Akram Essmat
Nestor Yoma
Fernando Huenupán
spellingShingle Walid Hussein
Sarah Akram Essmat
Nestor Yoma
Fernando Huenupán
Evaluation of the Vocal Tract Length Normalization Based Classifiers for Speaker Verification
International Journal of Recent Contributions from Engineering, Science & IT
author_facet Walid Hussein
Sarah Akram Essmat
Nestor Yoma
Fernando Huenupán
author_sort Walid Hussein
title Evaluation of the Vocal Tract Length Normalization Based Classifiers for Speaker Verification
title_short Evaluation of the Vocal Tract Length Normalization Based Classifiers for Speaker Verification
title_full Evaluation of the Vocal Tract Length Normalization Based Classifiers for Speaker Verification
title_fullStr Evaluation of the Vocal Tract Length Normalization Based Classifiers for Speaker Verification
title_full_unstemmed Evaluation of the Vocal Tract Length Normalization Based Classifiers for Speaker Verification
title_sort evaluation of the vocal tract length normalization based classifiers for speaker verification
publisher International Association of Online Engineering (IAOE)
series International Journal of Recent Contributions from Engineering, Science & IT
issn 2197-8581
publishDate 2016-12-01
description This paper proposes and evaluates classifiers based on Vocal Tract Length Normalization (VTLN) in a text-dependent speaker verification (SV) task with short testing utterances. This type of tasks is important in commercial applications and is not easily addressed with methods designed for long utterances such as JFA and i-Vectors. In contrast, VTLN is a speaker compensation scheme that can lead to significant improvements in speech recognition accuracy with just a few seconds of speech samples. A novel scheme to generate new classifiers is employed by incorporating the observation vector sequence compensated with VTLN. The modified sequence of feature vectors and the corresponding warping factors are used to generate classifiers whose scores are combined by a Support Vector Machine (SVM) based SV system. The proposed scheme can provide an average reduction in EER equal to 14% when compared with the baseline system based on the likelihood of observation vectors.
url http://online-journals.org/index.php/i-jes/article/view/6544
work_keys_str_mv AT walidhussein evaluationofthevocaltractlengthnormalizationbasedclassifiersforspeakerverification
AT sarahakramessmat evaluationofthevocaltractlengthnormalizationbasedclassifiersforspeakerverification
AT nestoryoma evaluationofthevocaltractlengthnormalizationbasedclassifiersforspeakerverification
AT fernandohuenupan evaluationofthevocaltractlengthnormalizationbasedclassifiersforspeakerverification
_version_ 1721181665245528064