Evaluation of the Vocal Tract Length Normalization Based Classifiers for Speaker Verification
This paper proposes and evaluates classifiers based on Vocal Tract Length Normalization (VTLN) in a text-dependent speaker verification (SV) task with short testing utterances. This type of tasks is important in commercial applications and is not easily addressed with methods designed for long utter...
Main Authors: | , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
International Association of Online Engineering (IAOE)
2016-12-01
|
Series: | International Journal of Recent Contributions from Engineering, Science & IT |
Online Access: | http://online-journals.org/index.php/i-jes/article/view/6544 |
id |
doaj-3535bceafa0441c7b025a23aac0c8071 |
---|---|
record_format |
Article |
spelling |
doaj-3535bceafa0441c7b025a23aac0c80712021-09-02T01:51:07ZengInternational Association of Online Engineering (IAOE)International Journal of Recent Contributions from Engineering, Science & IT2197-85812016-12-0144414410.3991/ijes.v4i4.65443005Evaluation of the Vocal Tract Length Normalization Based Classifiers for Speaker VerificationWalid Hussein0Sarah Akram Essmat1Nestor Yoma2Fernando Huenupán3Faculty of Informatics and Computer Science, The British University in EgyptFaculty of Informatics and Computer Science, The British University in EgyptUniversidad de ChileUniversidad de La Frontera, ChileThis paper proposes and evaluates classifiers based on Vocal Tract Length Normalization (VTLN) in a text-dependent speaker verification (SV) task with short testing utterances. This type of tasks is important in commercial applications and is not easily addressed with methods designed for long utterances such as JFA and i-Vectors. In contrast, VTLN is a speaker compensation scheme that can lead to significant improvements in speech recognition accuracy with just a few seconds of speech samples. A novel scheme to generate new classifiers is employed by incorporating the observation vector sequence compensated with VTLN. The modified sequence of feature vectors and the corresponding warping factors are used to generate classifiers whose scores are combined by a Support Vector Machine (SVM) based SV system. The proposed scheme can provide an average reduction in EER equal to 14% when compared with the baseline system based on the likelihood of observation vectors.http://online-journals.org/index.php/i-jes/article/view/6544 |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Walid Hussein Sarah Akram Essmat Nestor Yoma Fernando Huenupán |
spellingShingle |
Walid Hussein Sarah Akram Essmat Nestor Yoma Fernando Huenupán Evaluation of the Vocal Tract Length Normalization Based Classifiers for Speaker Verification International Journal of Recent Contributions from Engineering, Science & IT |
author_facet |
Walid Hussein Sarah Akram Essmat Nestor Yoma Fernando Huenupán |
author_sort |
Walid Hussein |
title |
Evaluation of the Vocal Tract Length Normalization Based Classifiers for Speaker Verification |
title_short |
Evaluation of the Vocal Tract Length Normalization Based Classifiers for Speaker Verification |
title_full |
Evaluation of the Vocal Tract Length Normalization Based Classifiers for Speaker Verification |
title_fullStr |
Evaluation of the Vocal Tract Length Normalization Based Classifiers for Speaker Verification |
title_full_unstemmed |
Evaluation of the Vocal Tract Length Normalization Based Classifiers for Speaker Verification |
title_sort |
evaluation of the vocal tract length normalization based classifiers for speaker verification |
publisher |
International Association of Online Engineering (IAOE) |
series |
International Journal of Recent Contributions from Engineering, Science & IT |
issn |
2197-8581 |
publishDate |
2016-12-01 |
description |
This paper proposes and evaluates classifiers based on Vocal Tract Length Normalization (VTLN) in a text-dependent speaker verification (SV) task with short testing utterances. This type of tasks is important in commercial applications and is not easily addressed with methods designed for long utterances such as JFA and i-Vectors. In contrast, VTLN is a speaker compensation scheme that can lead to significant improvements in speech recognition accuracy with just a few seconds of speech samples. A novel scheme to generate new classifiers is employed by incorporating the observation vector sequence compensated with VTLN. The modified sequence of feature vectors and the corresponding warping factors are used to generate classifiers whose scores are combined by a Support Vector Machine (SVM) based SV system. The proposed scheme can provide an average reduction in EER equal to 14% when compared with the baseline system based on the likelihood of observation vectors. |
url |
http://online-journals.org/index.php/i-jes/article/view/6544 |
work_keys_str_mv |
AT walidhussein evaluationofthevocaltractlengthnormalizationbasedclassifiersforspeakerverification AT sarahakramessmat evaluationofthevocaltractlengthnormalizationbasedclassifiersforspeakerverification AT nestoryoma evaluationofthevocaltractlengthnormalizationbasedclassifiersforspeakerverification AT fernandohuenupan evaluationofthevocaltractlengthnormalizationbasedclassifiersforspeakerverification |
_version_ |
1721181665245528064 |