Summary: | The accuracy performance of traditional direction of arrival (DOA) estimation algorithms is seriously affected by the reverberation. Considering the advantage of the sparse characteristic of speech signal in time-frequency (T-F) domain, this paper presents a new blind DOA estimation method based on integrated deep learning and convolutional non-negative matrix factorization (NMF). Firstly, mathematic models of microphone array and room impulse response are built. In addition, we extracted blindly initialization parameters of 2-D convolutional NMF using k-means clustering algorithm and singular value decomposition algorithm, which can be used to accurately estimate the main components of desired sound source in the reverberation environment of multi-path propagation. Moreover, the feedback mechanism is introduced into deep 2-D convolutional NMF and correlation coefficient between the signal decomposed by NMF and the signal to be decomposed is used to select the best separated signal for DOA estimation, which make the separation algorithm simpler and more efficient. Finally, test of orthogonality of projected subspaces (TOPS) algorithm is used to validate the DOA estimation capability of this algorithm. Compared with the unprocessed reverberation speech, the estimation error is reduced, which shows that the proposed algorithm can effectively improve the estimation accuracy of DOA estimation when the received signals are in a reverberant environment.
|