An Improved Ranking-Based Feature Enhancement Approach for Robust Speaker Recognition

Although the field of automatic speaker or speech recognition has been extensively studied over the past decades, the lack of robustness has remained a major challenge. Feature warping is a promising approach and its effectiveness significantly depends on the relative positions of each of the featur...

Full description

Bibliographic Details
Main Authors:	Furong Yan, Aidong Men, Bo Yang, Zhuqing Jiang
Format:	Article
Language:	English
Published:	IEEE 2016-01-01
Series:	IEEE Access
Subjects:	Robustness feature warping missing data method ranking feature autocorrelation rank correlation
Online Access:	https://ieeexplore.ieee.org/document/7565581/

id	doaj-eff57210ea394a2b8a41e2769386c185
record_format	Article
spelling	doaj-eff57210ea394a2b8a41e2769386c1852021-03-29T19:45:03ZengIEEEIEEE Access2169-35362016-01-0145258526710.1109/ACCESS.2016.26077787565581An Improved Ranking-Based Feature Enhancement Approach for Robust Speaker RecognitionFurong Yan0https://orcid.org/0000-0002-7969-7204Aidong Men1Bo Yang2Zhuqing Jiang3School of Information and Communication Engineering, Beijing University of Posts and Telecommunications, Beijing, ChinaSchool of Information and Communication Engineering, Beijing University of Posts and Telecommunications, Beijing, ChinaSchool of Information and Communication Engineering, Beijing University of Posts and Telecommunications, Beijing, ChinaSchool of Information and Communication Engineering, Beijing University of Posts and Telecommunications, Beijing, ChinaAlthough the field of automatic speaker or speech recognition has been extensively studied over the past decades, the lack of robustness has remained a major challenge. Feature warping is a promising approach and its effectiveness significantly depends on the relative positions of each of the features in a sliding window. However, the relative positions are changed due to the non-linear effect of noise. Aiming at the problem, this paper takes the advantage of ranking feature, which is obtained directly by sorting a feature sequence in descending order, to propose a method. It first labels the central frame in a sliding window as speech or noise dominant (“reliable” or “unreliable”). In the unreliable case, the ranking of the central frame is estimated. Subsequently, the estimated ranking is mapped to a warped feature using a desired target distribution for recognition experiments. Through the theoretical analysis and experimental results, it is found that autocorrelation of a ranking sequence is larger than that of the corresponding feature sequence. What is more, rank correlation is not easily influenced by abnormal data or data that are highly variable. Thus, this paper deals with a ranking sequence rather than a feature sequence. The proposed feature enhancement approach is evaluated in an open-set speaker recognition system. The experimental results show that it outperforms missing data method based on linear interpolation and feature warping in terms of recognition performance in all noise conditions. Furthermore, the method proposed here is a feature-based method, which may be combined with other technologies, such as model-based, scores-based, to enhance the robustness of speaker or speech recognition system.https://ieeexplore.ieee.org/document/7565581/Robustnessfeature warpingmissing data methodranking featureautocorrelationrank correlation
collection	DOAJ
language	English
format	Article
sources	DOAJ
author	Furong Yan Aidong Men Bo Yang Zhuqing Jiang
spellingShingle	Furong Yan Aidong Men Bo Yang Zhuqing Jiang An Improved Ranking-Based Feature Enhancement Approach for Robust Speaker Recognition IEEE Access Robustness feature warping missing data method ranking feature autocorrelation rank correlation
author_facet	Furong Yan Aidong Men Bo Yang Zhuqing Jiang
author_sort	Furong Yan
title	An Improved Ranking-Based Feature Enhancement Approach for Robust Speaker Recognition
title_short	An Improved Ranking-Based Feature Enhancement Approach for Robust Speaker Recognition
title_full	An Improved Ranking-Based Feature Enhancement Approach for Robust Speaker Recognition
title_fullStr	An Improved Ranking-Based Feature Enhancement Approach for Robust Speaker Recognition
title_full_unstemmed	An Improved Ranking-Based Feature Enhancement Approach for Robust Speaker Recognition
title_sort	improved ranking-based feature enhancement approach for robust speaker recognition
publisher	IEEE
series	IEEE Access
issn	2169-3536
publishDate	2016-01-01
description	Although the field of automatic speaker or speech recognition has been extensively studied over the past decades, the lack of robustness has remained a major challenge. Feature warping is a promising approach and its effectiveness significantly depends on the relative positions of each of the features in a sliding window. However, the relative positions are changed due to the non-linear effect of noise. Aiming at the problem, this paper takes the advantage of ranking feature, which is obtained directly by sorting a feature sequence in descending order, to propose a method. It first labels the central frame in a sliding window as speech or noise dominant (“reliable” or “unreliable”). In the unreliable case, the ranking of the central frame is estimated. Subsequently, the estimated ranking is mapped to a warped feature using a desired target distribution for recognition experiments. Through the theoretical analysis and experimental results, it is found that autocorrelation of a ranking sequence is larger than that of the corresponding feature sequence. What is more, rank correlation is not easily influenced by abnormal data or data that are highly variable. Thus, this paper deals with a ranking sequence rather than a feature sequence. The proposed feature enhancement approach is evaluated in an open-set speaker recognition system. The experimental results show that it outperforms missing data method based on linear interpolation and feature warping in terms of recognition performance in all noise conditions. Furthermore, the method proposed here is a feature-based method, which may be combined with other technologies, such as model-based, scores-based, to enhance the robustness of speaker or speech recognition system.
topic	Robustness feature warping missing data method ranking feature autocorrelation rank correlation
url	https://ieeexplore.ieee.org/document/7565581/
work_keys_str_mv	AT furongyan animprovedrankingbasedfeatureenhancementapproachforrobustspeakerrecognition AT aidongmen animprovedrankingbasedfeatureenhancementapproachforrobustspeakerrecognition AT boyang animprovedrankingbasedfeatureenhancementapproachforrobustspeakerrecognition AT zhuqingjiang animprovedrankingbasedfeatureenhancementapproachforrobustspeakerrecognition AT furongyan improvedrankingbasedfeatureenhancementapproachforrobustspeakerrecognition AT aidongmen improvedrankingbasedfeatureenhancementapproachforrobustspeakerrecognition AT boyang improvedrankingbasedfeatureenhancementapproachforrobustspeakerrecognition AT zhuqingjiang improvedrankingbasedfeatureenhancementapproachforrobustspeakerrecognition
_version_	1724195781867470848

An Improved Ranking-Based Feature Enhancement Approach for Robust Speaker Recognition

Similar Items