Time-Frequency Filter Bank: A Simple Approach for Audio and Music Separation

Blind Source Separation techniques are widely used in the field of wireless communication for a very long time to extract signals of interest from a set of multiple signals without training data. In this paper, we investigate the problem of separation of the human voice from a mixture of human voice...

Full description

Bibliographic Details
Main Authors:	Ning Yang, Muhammad Usman, Xiangjian He, Mian Ahmad Jan, Liming Zhang
Format:	Article
Language:	English
Published:	IEEE 2017-01-01
Series:	IEEE Access
Subjects:	Blind Source Separation Short Time Fourier Transform OverLap-Add SIR SDR
Online Access:	https://ieeexplore.ieee.org/document/8063868/

id	doaj-6a85c8273354403e87fe695dcec53bcd
record_format	Article
spelling	doaj-6a85c8273354403e87fe695dcec53bcd2021-03-29T20:19:08ZengIEEEIEEE Access2169-35362017-01-015271142712510.1109/ACCESS.2017.27617418063868Time-Frequency Filter Bank: A Simple Approach for Audio and Music SeparationNing Yang0Muhammad Usman1https://orcid.org/0000-0003-2165-4575Xiangjian He2Mian Ahmad Jan3https://orcid.org/0000-0002-5298-1328Liming Zhang4School of Automation, Northwestern Polytechnical University, Xi’an, ChinaSchool of Electrical and Data Engineering, University of Technology Sydney, Ultimo, NSW, AustraliaSchool of Electrical and Data Engineering, University of Technology Sydney, Ultimo, NSW, AustraliaDepartment of Computer Science, Abdul Wali Khan University, Mardan, PakistanSchool of Computer Science, University of Macau, Zhuhai, ChinaBlind Source Separation techniques are widely used in the field of wireless communication for a very long time to extract signals of interest from a set of multiple signals without training data. In this paper, we investigate the problem of separation of the human voice from a mixture of human voice and sounds from different musical instruments. The human voice may be a singing voice in a song or may be a part of some news, broadcast by a channel with background music. This paper proposes a generalized Short Time Fourier Transform (STFT)-based technique, combined with filter bank to extract vocals from background music. The main purpose is to design a filter bank and to eliminate background aliasing errors with best reconstruction conditions, having approximated scaling factors. Stereo signals in time-frequency domain are used in experiments. The input stereo signals are processed in the form of frames and passed through the proposed STFT-based technique. The output of the STFT-based technique is passed through the filter bank to minimize the background aliasing errors. For reconstruction, first an inverse STFT is applied and then the signals are reconstructed by the OverLap-Add method to get the final output, containing vocals only. The experiments show that the proposed approach performs better than the other state-of-the-art approaches, in terms of Signal-to-Interference Ratio (SIR) and Signal-to-Distortion Ratio (SDR), respectively.https://ieeexplore.ieee.org/document/8063868/Blind Source SeparationShort Time Fourier TransformOverLap-AddSIRSDR
collection	DOAJ
language	English
format	Article
sources	DOAJ
author	Ning Yang Muhammad Usman Xiangjian He Mian Ahmad Jan Liming Zhang
spellingShingle	Ning Yang Muhammad Usman Xiangjian He Mian Ahmad Jan Liming Zhang Time-Frequency Filter Bank: A Simple Approach for Audio and Music Separation IEEE Access Blind Source Separation Short Time Fourier Transform OverLap-Add SIR SDR
author_facet	Ning Yang Muhammad Usman Xiangjian He Mian Ahmad Jan Liming Zhang
author_sort	Ning Yang
title	Time-Frequency Filter Bank: A Simple Approach for Audio and Music Separation
title_short	Time-Frequency Filter Bank: A Simple Approach for Audio and Music Separation
title_full	Time-Frequency Filter Bank: A Simple Approach for Audio and Music Separation
title_fullStr	Time-Frequency Filter Bank: A Simple Approach for Audio and Music Separation
title_full_unstemmed	Time-Frequency Filter Bank: A Simple Approach for Audio and Music Separation
title_sort	time-frequency filter bank: a simple approach for audio and music separation
publisher	IEEE
series	IEEE Access
issn	2169-3536
publishDate	2017-01-01
description	Blind Source Separation techniques are widely used in the field of wireless communication for a very long time to extract signals of interest from a set of multiple signals without training data. In this paper, we investigate the problem of separation of the human voice from a mixture of human voice and sounds from different musical instruments. The human voice may be a singing voice in a song or may be a part of some news, broadcast by a channel with background music. This paper proposes a generalized Short Time Fourier Transform (STFT)-based technique, combined with filter bank to extract vocals from background music. The main purpose is to design a filter bank and to eliminate background aliasing errors with best reconstruction conditions, having approximated scaling factors. Stereo signals in time-frequency domain are used in experiments. The input stereo signals are processed in the form of frames and passed through the proposed STFT-based technique. The output of the STFT-based technique is passed through the filter bank to minimize the background aliasing errors. For reconstruction, first an inverse STFT is applied and then the signals are reconstructed by the OverLap-Add method to get the final output, containing vocals only. The experiments show that the proposed approach performs better than the other state-of-the-art approaches, in terms of Signal-to-Interference Ratio (SIR) and Signal-to-Distortion Ratio (SDR), respectively.
topic	Blind Source Separation Short Time Fourier Transform OverLap-Add SIR SDR
url	https://ieeexplore.ieee.org/document/8063868/
work_keys_str_mv	AT ningyang timefrequencyfilterbankasimpleapproachforaudioandmusicseparation AT muhammadusman timefrequencyfilterbankasimpleapproachforaudioandmusicseparation AT xiangjianhe timefrequencyfilterbankasimpleapproachforaudioandmusicseparation AT mianahmadjan timefrequencyfilterbankasimpleapproachforaudioandmusicseparation AT limingzhang timefrequencyfilterbankasimpleapproachforaudioandmusicseparation
_version_	1724194888494350336

Time-Frequency Filter Bank: A Simple Approach for Audio and Music Separation

Similar Items