Solving the Permutation Problem in Frequency Domain Source Separation Based on the Correlation of Envelopes between Frequencies
碩士 === 國立清華大學 === 電機工程學系 === 103 === In a real environment, sound sources are coupled to the microphones by convolution with room responses. It is difficult and time-consuming to deal with source separation in the time domain. Existing approaches deal with source separation by converting the mixed s...
Main Authors: | , |
---|---|
Other Authors: | |
Format: | Others |
Language: | zh-TW |
Published: |
2015
|
Online Access: | http://ndltd.ncl.edu.tw/handle/83802685580596926982 |
id |
ndltd-TW-103NTHU5442105 |
---|---|
record_format |
oai_dc |
spelling |
ndltd-TW-103NTHU54421052016-08-15T04:17:34Z http://ndltd.ncl.edu.tw/handle/83802685580596926982 Solving the Permutation Problem in Frequency Domain Source Separation Based on the Correlation of Envelopes between Frequencies 基於頻率間包絡之相關性解決頻域上聲源分離之排列問題 Li, Huang-Yi 李皇儀 碩士 國立清華大學 電機工程學系 103 In a real environment, sound sources are coupled to the microphones by convolution with room responses. It is difficult and time-consuming to deal with source separation in the time domain. Existing approaches deal with source separation by converting the mixed signals to the time-frequency domain by short-time Fourier transform (STFT). Then, Independent Component Analysis (ICA) is applied in each frequency bin to separate the sources, however, the drawbacks for this particular method were the scaling problem and the permutation problem. Among these two problems, the permutation problem is much more difficult to resolve and it is also the focus of this thesis. Based on the assumption that the correlations should be high between the temporal envelopes of neighboring frequencies from the same sound source, we have developed an algorithm to solve the permutation problem. After solving the scaling problem and the permutation problem, the separated signals are converted to the time domain by inverse short-time Fourier transform (ISTFT) to complete the separation. In experiment 1 to 4, the sound sources were obtained by recording in the room, and by using the steps above to acquire the separated signals. The effectiveness of the algorithm were assessed by subjective and objective measures. From the results of experiment 1 to 4, the sound sources which is labeled as 1-4 are rated by the participants with an average score higher than 4.18 out of 5. In experiment 5, we compared the method from our thesis to the methods from [23] and [25], and the present method improves the source to interferences ratio (SIR) by 3.1 dB. The results of experiments have shown that the method of our thesis was able to effectively solve the permutation problem, and improve the separation performance. Liu, Yi-Wen 劉奕汶 2015 學位論文 ; thesis 68 zh-TW |
collection |
NDLTD |
language |
zh-TW |
format |
Others
|
sources |
NDLTD |
description |
碩士 === 國立清華大學 === 電機工程學系 === 103 === In a real environment, sound sources are coupled to the microphones by convolution with room responses. It is difficult and time-consuming to deal with source separation in the time domain. Existing approaches deal with source separation by converting the mixed signals to the time-frequency domain by short-time Fourier transform (STFT). Then, Independent Component Analysis (ICA) is applied in each frequency bin to separate the sources, however, the drawbacks for this particular method were the scaling problem and the permutation problem. Among these two problems, the permutation problem is much more difficult to resolve and it is also the focus of this thesis. Based on the assumption that the correlations should be high between the temporal envelopes of neighboring frequencies from the same sound source, we have developed an algorithm to solve the permutation problem. After solving the scaling problem and the permutation problem, the separated signals are converted to the time domain by inverse short-time Fourier transform (ISTFT) to complete the separation. In experiment 1 to 4, the sound sources were obtained by recording in the room, and by using the steps above to acquire the separated signals. The effectiveness of the algorithm were assessed by subjective and objective measures. From the results of experiment 1 to 4, the sound sources which is labeled as 1-4 are rated by the participants with an average score higher than 4.18 out of 5. In experiment 5, we compared the method from our thesis to the methods from [23] and [25], and the present method improves the source to interferences ratio (SIR) by 3.1 dB. The results of experiments have shown that the method of our thesis was able to effectively solve the permutation problem, and improve the separation performance.
|
author2 |
Liu, Yi-Wen |
author_facet |
Liu, Yi-Wen Li, Huang-Yi 李皇儀 |
author |
Li, Huang-Yi 李皇儀 |
spellingShingle |
Li, Huang-Yi 李皇儀 Solving the Permutation Problem in Frequency Domain Source Separation Based on the Correlation of Envelopes between Frequencies |
author_sort |
Li, Huang-Yi |
title |
Solving the Permutation Problem in Frequency Domain Source Separation Based on the Correlation of Envelopes between Frequencies |
title_short |
Solving the Permutation Problem in Frequency Domain Source Separation Based on the Correlation of Envelopes between Frequencies |
title_full |
Solving the Permutation Problem in Frequency Domain Source Separation Based on the Correlation of Envelopes between Frequencies |
title_fullStr |
Solving the Permutation Problem in Frequency Domain Source Separation Based on the Correlation of Envelopes between Frequencies |
title_full_unstemmed |
Solving the Permutation Problem in Frequency Domain Source Separation Based on the Correlation of Envelopes between Frequencies |
title_sort |
solving the permutation problem in frequency domain source separation based on the correlation of envelopes between frequencies |
publishDate |
2015 |
url |
http://ndltd.ncl.edu.tw/handle/83802685580596926982 |
work_keys_str_mv |
AT lihuangyi solvingthepermutationprobleminfrequencydomainsourceseparationbasedonthecorrelationofenvelopesbetweenfrequencies AT lǐhuángyí solvingthepermutationprobleminfrequencydomainsourceseparationbasedonthecorrelationofenvelopesbetweenfrequencies AT lihuangyi jīyúpínlǜjiānbāoluòzhīxiāngguānxìngjiějuépínyùshàngshēngyuánfēnlízhīpáilièwèntí AT lǐhuángyí jīyúpínlǜjiānbāoluòzhīxiāngguānxìngjiějuépínyùshàngshēngyuánfēnlízhīpáilièwèntí |
_version_ |
1718376313187729408 |