Permutation Correction in the Frequency Domain in Blind Separation of Speech Mixtures

<p/> <p>This paper presents a method for blind separation of convolutive mixtures of speech signals, based on the joint diagonalization of the time varying spectral matrices of the observation records. The main and still largely open problem in a frequency domain approach is permutation...

Full description

Bibliographic Details
Main Authors:	Pham DT, Servière Ch
Format:	Article
Language:	English
Published:	SpringerOpen 2006-01-01
Series:	EURASIP Journal on Advances in Signal Processing
Online Access:	http://dx.doi.org/10.1155/ASP/2006/75206

Description
Summary:	<p/> <p>This paper presents a method for blind separation of convolutive mixtures of speech signals, based on the joint diagonalization of the time varying spectral matrices of the observation records. The main and still largely open problem in a frequency domain approach is permutation ambiguity. In an earlier paper of the authors, the continuity of the frequency response of the unmixing filters is exploited, but it leaves some frequency permutation jumps. This paper therefore proposes a new method based on two assumptions. The frequency continuity of the unmixing filters is still used in the initialization of the diagonalization algorithm. Then, the paper introduces a new method based on the time-frequency representations of the sources. They are assumed to vary smoothly with frequency. This hypothesis of the continuity of the time variation of the source energy is exploited on a sliding frequency bandwidth. It allows us to detect the remaining frequency permutation jumps. The method is compared with other approaches and results on real world recordings demonstrate superior performances of the proposed algorithm.</p>
ISSN:	1687-6172 1687-6180

Permutation Correction in the Frequency Domain in Blind Separation of Speech Mixtures

Similar Items