Using Pitch, Amplitude Modulation, and Spatial Cues for Separation of Harmonic Instruments from Stereo Music Recordings

<p/> <p>Recent work in <it>blind source separation</it> applied to anechoic mixtures of speech allows for improved reconstruction of sources that rarely overlap in a time-frequency representation. While the assumption that speech mixtures do not overlap significantly in time-...

Full description

Bibliographic Details
Main Authors:	Pardo Bryan, Woodruff John
Format:	Article
Language:	English
Published:	SpringerOpen 2007-01-01
Series:	EURASIP Journal on Advances in Signal Processing
Online Access:	http://asp.eurasipjournals.com/content/2007/086369

id	doaj-ee24c00eefa6467eb0a0d945c4c1b831
record_format	Article
spelling	doaj-ee24c00eefa6467eb0a0d945c4c1b8312020-11-25T00:09:37ZengSpringerOpenEURASIP Journal on Advances in Signal Processing1687-61721687-61802007-01-0120071086369Using Pitch, Amplitude Modulation, and Spatial Cues for Separation of Harmonic Instruments from Stereo Music RecordingsPardo BryanWoodruff John<p/> <p>Recent work in <it>blind source separation</it> applied to anechoic mixtures of speech allows for improved reconstruction of sources that rarely overlap in a time-frequency representation. While the assumption that speech mixtures do not overlap significantly in time-frequency is reasonable, music mixtures rarely meet this constraint, requiring new approaches. We introduce a method that uses spatial cues from anechoic, stereo music recordings and assumptions regarding the structure of musical source signals to effectively separate mixtures of tonal music. We discuss existing techniques to create partial source signal estimates from regions of the mixture where source signals do not overlap significantly. We use these partial signals within a new demixing framework, in which we estimate <it>harmonic masks</it> for each source, allowing the determination of the number of active sources in important time-frequency frames of the mixture. We then propose a method for distributing energy from time-frequency frames of the mixture to multiple source signals. This allows dealing with mixtures that contain time-frequency frames in which multiple harmonic sources are active without requiring knowledge of source characteristics.</p> http://asp.eurasipjournals.com/content/2007/086369
collection	DOAJ
language	English
format	Article
sources	DOAJ
author	Pardo Bryan Woodruff John
spellingShingle	Pardo Bryan Woodruff John Using Pitch, Amplitude Modulation, and Spatial Cues for Separation of Harmonic Instruments from Stereo Music Recordings EURASIP Journal on Advances in Signal Processing
author_facet	Pardo Bryan Woodruff John
author_sort	Pardo Bryan
title	Using Pitch, Amplitude Modulation, and Spatial Cues for Separation of Harmonic Instruments from Stereo Music Recordings
title_short	Using Pitch, Amplitude Modulation, and Spatial Cues for Separation of Harmonic Instruments from Stereo Music Recordings
title_full	Using Pitch, Amplitude Modulation, and Spatial Cues for Separation of Harmonic Instruments from Stereo Music Recordings
title_fullStr	Using Pitch, Amplitude Modulation, and Spatial Cues for Separation of Harmonic Instruments from Stereo Music Recordings
title_full_unstemmed	Using Pitch, Amplitude Modulation, and Spatial Cues for Separation of Harmonic Instruments from Stereo Music Recordings
title_sort	using pitch, amplitude modulation, and spatial cues for separation of harmonic instruments from stereo music recordings
publisher	SpringerOpen
series	EURASIP Journal on Advances in Signal Processing
issn	1687-6172 1687-6180
publishDate	2007-01-01
description	<p/> <p>Recent work in <it>blind source separation</it> applied to anechoic mixtures of speech allows for improved reconstruction of sources that rarely overlap in a time-frequency representation. While the assumption that speech mixtures do not overlap significantly in time-frequency is reasonable, music mixtures rarely meet this constraint, requiring new approaches. We introduce a method that uses spatial cues from anechoic, stereo music recordings and assumptions regarding the structure of musical source signals to effectively separate mixtures of tonal music. We discuss existing techniques to create partial source signal estimates from regions of the mixture where source signals do not overlap significantly. We use these partial signals within a new demixing framework, in which we estimate <it>harmonic masks</it> for each source, allowing the determination of the number of active sources in important time-frequency frames of the mixture. We then propose a method for distributing energy from time-frequency frames of the mixture to multiple source signals. This allows dealing with mixtures that contain time-frequency frames in which multiple harmonic sources are active without requiring knowledge of source characteristics.</p>
url	http://asp.eurasipjournals.com/content/2007/086369
work_keys_str_mv	AT pardobryan usingpitchamplitudemodulationandspatialcuesforseparationofharmonicinstrumentsfromstereomusicrecordings AT woodruffjohn usingpitchamplitudemodulationandspatialcuesforseparationofharmonicinstrumentsfromstereomusicrecordings
_version_	1725410810666156032

Using Pitch, Amplitude Modulation, and Spatial Cues for Separation of Harmonic Instruments from Stereo Music Recordings

Similar Items