Adaptive Speaker Recognition Based on Hidden Markov Model Parameter Optimization

The Hidden Markov Model (HMM) is a widely used method for speaker recognition. During its training, the composite order of the measurement probability matrix and the number of re-evaluations of the initial model affect the speed and accuracy of a recognition system. However, theoretical analysis and...

Full description

Bibliographic Details
Main Author: Yangjie Wei
Format: Article
Language:English
Published: IEEE 2020-01-01
Series:IEEE Access
Subjects:
Online Access:https://ieeexplore.ieee.org/document/8995577/
id doaj-aaaff5dbd8df498ea6cefde427e47c23
record_format Article
spelling doaj-aaaff5dbd8df498ea6cefde427e47c232021-03-30T02:05:10ZengIEEEIEEE Access2169-35362020-01-018349423494810.1109/ACCESS.2020.29725118995577Adaptive Speaker Recognition Based on Hidden Markov Model Parameter OptimizationYangjie Wei0https://orcid.org/0000-0001-6615-5484College of Computer Science and Engineering, Northeastern University, Shenyang, ChinaThe Hidden Markov Model (HMM) is a widely used method for speaker recognition. During its training, the composite order of the measurement probability matrix and the number of re-evaluations of the initial model affect the speed and accuracy of a recognition system. However, theoretical analysis and related quantitative methods are rarely used for adaptively acquiring them. In this paper, a quantitative method for adaptively selecting the optimal composite order and the optimal number of re-evaluations is proposed based on theoretical analysis and experimental results. First, the standard deviation (SD) is introduced to calculate the recognition rate considering its relationship with Mel frequency cepstrum coefficients (MFCCs) dimension, then the composite order is optimized according to its relationship curve with the SD. Second, the composited measurement probability with different number of re-evaluations is calculated and the number of re-evaluations is optimized when a convergence condition is satisfied. Experiments show that the recognition rate with the optimal composite order obtained in this paper is 97.02%, and the recognition rate with the optimal number of re-evaluations is 98.9%.https://ieeexplore.ieee.org/document/8995577/Speaker recognitionGaussian composite orderre-evaluationparameter optimization
collection DOAJ
language English
format Article
sources DOAJ
author Yangjie Wei
spellingShingle Yangjie Wei
Adaptive Speaker Recognition Based on Hidden Markov Model Parameter Optimization
IEEE Access
Speaker recognition
Gaussian composite order
re-evaluation
parameter optimization
author_facet Yangjie Wei
author_sort Yangjie Wei
title Adaptive Speaker Recognition Based on Hidden Markov Model Parameter Optimization
title_short Adaptive Speaker Recognition Based on Hidden Markov Model Parameter Optimization
title_full Adaptive Speaker Recognition Based on Hidden Markov Model Parameter Optimization
title_fullStr Adaptive Speaker Recognition Based on Hidden Markov Model Parameter Optimization
title_full_unstemmed Adaptive Speaker Recognition Based on Hidden Markov Model Parameter Optimization
title_sort adaptive speaker recognition based on hidden markov model parameter optimization
publisher IEEE
series IEEE Access
issn 2169-3536
publishDate 2020-01-01
description The Hidden Markov Model (HMM) is a widely used method for speaker recognition. During its training, the composite order of the measurement probability matrix and the number of re-evaluations of the initial model affect the speed and accuracy of a recognition system. However, theoretical analysis and related quantitative methods are rarely used for adaptively acquiring them. In this paper, a quantitative method for adaptively selecting the optimal composite order and the optimal number of re-evaluations is proposed based on theoretical analysis and experimental results. First, the standard deviation (SD) is introduced to calculate the recognition rate considering its relationship with Mel frequency cepstrum coefficients (MFCCs) dimension, then the composite order is optimized according to its relationship curve with the SD. Second, the composited measurement probability with different number of re-evaluations is calculated and the number of re-evaluations is optimized when a convergence condition is satisfied. Experiments show that the recognition rate with the optimal composite order obtained in this paper is 97.02%, and the recognition rate with the optimal number of re-evaluations is 98.9%.
topic Speaker recognition
Gaussian composite order
re-evaluation
parameter optimization
url https://ieeexplore.ieee.org/document/8995577/
work_keys_str_mv AT yangjiewei adaptivespeakerrecognitionbasedonhiddenmarkovmodelparameteroptimization
_version_ 1724185882608533504