A Tutorial on Text-Independent Speaker Verification

This paper presents an overview of a state-of-the-art text-independent speaker verification system. First, an introduction proposes a modular scheme of the training and test phases of a speaker verification system. Then, the most commonly speech parameterization used in speaker verification, namely,...

Full description

Bibliographic Details
Main Authors: Frédéric Bimbot, Jean-François Bonastre, Corinne Fredouille, Guillaume Gravier, Ivan Magrin-Chagnolleau, Sylvain Meignier, Teva Merlin, Javier Ortega-García, Dijana Petrovska-Delacrétaz, Douglas A. Reynolds
Format: Article
Language:English
Published: SpringerOpen 2004-04-01
Series:EURASIP Journal on Advances in Signal Processing
Subjects:
Online Access:http://dx.doi.org/10.1155/S1110865704310024
id doaj-25ca6e24b24746e2bf1765c3bf65355b
record_format Article
spelling doaj-25ca6e24b24746e2bf1765c3bf65355b2020-11-24T23:29:34ZengSpringerOpenEURASIP Journal on Advances in Signal Processing1687-61721687-61802004-04-012004443045110.1155/S1687617204310024A Tutorial on Text-Independent Speaker VerificationFrédéric BimbotJean-François BonastreCorinne FredouilleGuillaume GravierIvan Magrin-ChagnolleauSylvain MeignierTeva MerlinJavier Ortega-GarcíaDijana Petrovska-DelacrétazDouglas A. ReynoldsThis paper presents an overview of a state-of-the-art text-independent speaker verification system. First, an introduction proposes a modular scheme of the training and test phases of a speaker verification system. Then, the most commonly speech parameterization used in speaker verification, namely, cepstral analysis, is detailed. Gaussian mixture modeling, which is the speaker modeling technique used in most systems, is then explained. A few speaker modeling alternatives, namely, neural networks and support vector machines, are mentioned. Normalization of scores is then explained, as this is a very important step to deal with real-world data. The evaluation of a speaker verification system is then detailed, and the detection error trade-off (DET) curve is explained. Several extensions of speaker verification are then enumerated, including speaker tracking and segmentation by speakers. Then, some applications of speaker verification are proposed, including on-site applications, remote applications, applications relative to structuring audio information, and games. Issues concerning the forensic area are then recalled, as we believe it is very important to inform people about the actual performance and limitations of speaker verification systems. This paper concludes by giving a few research trends in speaker verification for the next couple of years.http://dx.doi.org/10.1155/S1110865704310024speaker verificationtext-independentcepstral analysisGaussian mixture modeling.
collection DOAJ
language English
format Article
sources DOAJ
author Frédéric Bimbot
Jean-François Bonastre
Corinne Fredouille
Guillaume Gravier
Ivan Magrin-Chagnolleau
Sylvain Meignier
Teva Merlin
Javier Ortega-García
Dijana Petrovska-Delacrétaz
Douglas A. Reynolds
spellingShingle Frédéric Bimbot
Jean-François Bonastre
Corinne Fredouille
Guillaume Gravier
Ivan Magrin-Chagnolleau
Sylvain Meignier
Teva Merlin
Javier Ortega-García
Dijana Petrovska-Delacrétaz
Douglas A. Reynolds
A Tutorial on Text-Independent Speaker Verification
EURASIP Journal on Advances in Signal Processing
speaker verification
text-independent
cepstral analysis
Gaussian mixture modeling.
author_facet Frédéric Bimbot
Jean-François Bonastre
Corinne Fredouille
Guillaume Gravier
Ivan Magrin-Chagnolleau
Sylvain Meignier
Teva Merlin
Javier Ortega-García
Dijana Petrovska-Delacrétaz
Douglas A. Reynolds
author_sort Frédéric Bimbot
title A Tutorial on Text-Independent Speaker Verification
title_short A Tutorial on Text-Independent Speaker Verification
title_full A Tutorial on Text-Independent Speaker Verification
title_fullStr A Tutorial on Text-Independent Speaker Verification
title_full_unstemmed A Tutorial on Text-Independent Speaker Verification
title_sort tutorial on text-independent speaker verification
publisher SpringerOpen
series EURASIP Journal on Advances in Signal Processing
issn 1687-6172
1687-6180
publishDate 2004-04-01
description This paper presents an overview of a state-of-the-art text-independent speaker verification system. First, an introduction proposes a modular scheme of the training and test phases of a speaker verification system. Then, the most commonly speech parameterization used in speaker verification, namely, cepstral analysis, is detailed. Gaussian mixture modeling, which is the speaker modeling technique used in most systems, is then explained. A few speaker modeling alternatives, namely, neural networks and support vector machines, are mentioned. Normalization of scores is then explained, as this is a very important step to deal with real-world data. The evaluation of a speaker verification system is then detailed, and the detection error trade-off (DET) curve is explained. Several extensions of speaker verification are then enumerated, including speaker tracking and segmentation by speakers. Then, some applications of speaker verification are proposed, including on-site applications, remote applications, applications relative to structuring audio information, and games. Issues concerning the forensic area are then recalled, as we believe it is very important to inform people about the actual performance and limitations of speaker verification systems. This paper concludes by giving a few research trends in speaker verification for the next couple of years.
topic speaker verification
text-independent
cepstral analysis
Gaussian mixture modeling.
url http://dx.doi.org/10.1155/S1110865704310024
work_keys_str_mv AT fredericbimbot atutorialontextindependentspeakerverification
AT jeanfrancoisbonastre atutorialontextindependentspeakerverification
AT corinnefredouille atutorialontextindependentspeakerverification
AT guillaumegravier atutorialontextindependentspeakerverification
AT ivanmagrinchagnolleau atutorialontextindependentspeakerverification
AT sylvainmeignier atutorialontextindependentspeakerverification
AT tevamerlin atutorialontextindependentspeakerverification
AT javierortegagarcia atutorialontextindependentspeakerverification
AT dijanapetrovskadelacretaz atutorialontextindependentspeakerverification
AT douglasareynolds atutorialontextindependentspeakerverification
AT fredericbimbot tutorialontextindependentspeakerverification
AT jeanfrancoisbonastre tutorialontextindependentspeakerverification
AT corinnefredouille tutorialontextindependentspeakerverification
AT guillaumegravier tutorialontextindependentspeakerverification
AT ivanmagrinchagnolleau tutorialontextindependentspeakerverification
AT sylvainmeignier tutorialontextindependentspeakerverification
AT tevamerlin tutorialontextindependentspeakerverification
AT javierortegagarcia tutorialontextindependentspeakerverification
AT dijanapetrovskadelacretaz tutorialontextindependentspeakerverification
AT douglasareynolds tutorialontextindependentspeakerverification
_version_ 1725545005562462208