Adaptive Speaker Recognition Based on Hidden Markov Model Parameter Optimization
The Hidden Markov Model (HMM) is a widely used method for speaker recognition. During its training, the composite order of the measurement probability matrix and the number of re-evaluations of the initial model affect the speed and accuracy of a recognition system. However, theoretical analysis and...
Main Author: | |
---|---|
Format: | Article |
Language: | English |
Published: |
IEEE
2020-01-01
|
Series: | IEEE Access |
Subjects: | |
Online Access: | https://ieeexplore.ieee.org/document/8995577/ |
id |
doaj-aaaff5dbd8df498ea6cefde427e47c23 |
---|---|
record_format |
Article |
spelling |
doaj-aaaff5dbd8df498ea6cefde427e47c232021-03-30T02:05:10ZengIEEEIEEE Access2169-35362020-01-018349423494810.1109/ACCESS.2020.29725118995577Adaptive Speaker Recognition Based on Hidden Markov Model Parameter OptimizationYangjie Wei0https://orcid.org/0000-0001-6615-5484College of Computer Science and Engineering, Northeastern University, Shenyang, ChinaThe Hidden Markov Model (HMM) is a widely used method for speaker recognition. During its training, the composite order of the measurement probability matrix and the number of re-evaluations of the initial model affect the speed and accuracy of a recognition system. However, theoretical analysis and related quantitative methods are rarely used for adaptively acquiring them. In this paper, a quantitative method for adaptively selecting the optimal composite order and the optimal number of re-evaluations is proposed based on theoretical analysis and experimental results. First, the standard deviation (SD) is introduced to calculate the recognition rate considering its relationship with Mel frequency cepstrum coefficients (MFCCs) dimension, then the composite order is optimized according to its relationship curve with the SD. Second, the composited measurement probability with different number of re-evaluations is calculated and the number of re-evaluations is optimized when a convergence condition is satisfied. Experiments show that the recognition rate with the optimal composite order obtained in this paper is 97.02%, and the recognition rate with the optimal number of re-evaluations is 98.9%.https://ieeexplore.ieee.org/document/8995577/Speaker recognitionGaussian composite orderre-evaluationparameter optimization |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Yangjie Wei |
spellingShingle |
Yangjie Wei Adaptive Speaker Recognition Based on Hidden Markov Model Parameter Optimization IEEE Access Speaker recognition Gaussian composite order re-evaluation parameter optimization |
author_facet |
Yangjie Wei |
author_sort |
Yangjie Wei |
title |
Adaptive Speaker Recognition Based on Hidden Markov Model Parameter Optimization |
title_short |
Adaptive Speaker Recognition Based on Hidden Markov Model Parameter Optimization |
title_full |
Adaptive Speaker Recognition Based on Hidden Markov Model Parameter Optimization |
title_fullStr |
Adaptive Speaker Recognition Based on Hidden Markov Model Parameter Optimization |
title_full_unstemmed |
Adaptive Speaker Recognition Based on Hidden Markov Model Parameter Optimization |
title_sort |
adaptive speaker recognition based on hidden markov model parameter optimization |
publisher |
IEEE |
series |
IEEE Access |
issn |
2169-3536 |
publishDate |
2020-01-01 |
description |
The Hidden Markov Model (HMM) is a widely used method for speaker recognition. During its training, the composite order of the measurement probability matrix and the number of re-evaluations of the initial model affect the speed and accuracy of a recognition system. However, theoretical analysis and related quantitative methods are rarely used for adaptively acquiring them. In this paper, a quantitative method for adaptively selecting the optimal composite order and the optimal number of re-evaluations is proposed based on theoretical analysis and experimental results. First, the standard deviation (SD) is introduced to calculate the recognition rate considering its relationship with Mel frequency cepstrum coefficients (MFCCs) dimension, then the composite order is optimized according to its relationship curve with the SD. Second, the composited measurement probability with different number of re-evaluations is calculated and the number of re-evaluations is optimized when a convergence condition is satisfied. Experiments show that the recognition rate with the optimal composite order obtained in this paper is 97.02%, and the recognition rate with the optimal number of re-evaluations is 98.9%. |
topic |
Speaker recognition Gaussian composite order re-evaluation parameter optimization |
url |
https://ieeexplore.ieee.org/document/8995577/ |
work_keys_str_mv |
AT yangjiewei adaptivespeakerrecognitionbasedonhiddenmarkovmodelparameteroptimization |
_version_ |
1724185882608533504 |