Extended Nonnegative Tensor Factorisation Models for Musical Sound Source Separation

Recently, shift-invariant tensor factorisation algorithms have been proposed for the purposes of sound source separation of pitched musical instruments. However, in practice, existing algorithms require the use of log-frequency spectrograms to allow shift invariance in frequency which causes problem...

Full description

Bibliographic Details
Main Authors: Derry FitzGerald, Matt Cranitch, Eugene Coyle
Format: Article
Language:English
Published: Hindawi Limited 2008-01-01
Series:Computational Intelligence and Neuroscience
Online Access:http://dx.doi.org/10.1155/2008/872425
id doaj-0987db2ec2cd427390fa11eb7349fc68
record_format Article
spelling doaj-0987db2ec2cd427390fa11eb7349fc682020-11-24T21:59:01ZengHindawi LimitedComputational Intelligence and Neuroscience1687-52651687-52732008-01-01200810.1155/2008/872425872425Extended Nonnegative Tensor Factorisation Models for Musical Sound Source SeparationDerry FitzGerald0Matt Cranitch1Eugene Coyle2Department of Electronic Engineering, Cork Institute of Technology, Cork, IrelandDepartment of Electronic Engineering, Cork Institute of Technology, Cork, IrelandSchool of Electrical Engineering Systems, Dublin Institute of Technology, Kevin Street, Dublin, IrelandRecently, shift-invariant tensor factorisation algorithms have been proposed for the purposes of sound source separation of pitched musical instruments. However, in practice, existing algorithms require the use of log-frequency spectrograms to allow shift invariance in frequency which causes problems when attempting to resynthesise the separated sources. Further, it is difficult to impose harmonicity constraints on the recovered basis functions. This paper proposes a new additive synthesis-based approach which allows the use of linear-frequency spectrograms as well as imposing strict harmonic constraints, resulting in an improved model. Further, these additional constraints allow the addition of a source filter model to the factorisation framework, and an extended model which is capable of separating mixtures of pitched and percussive instruments simultaneously.http://dx.doi.org/10.1155/2008/872425
collection DOAJ
language English
format Article
sources DOAJ
author Derry FitzGerald
Matt Cranitch
Eugene Coyle
spellingShingle Derry FitzGerald
Matt Cranitch
Eugene Coyle
Extended Nonnegative Tensor Factorisation Models for Musical Sound Source Separation
Computational Intelligence and Neuroscience
author_facet Derry FitzGerald
Matt Cranitch
Eugene Coyle
author_sort Derry FitzGerald
title Extended Nonnegative Tensor Factorisation Models for Musical Sound Source Separation
title_short Extended Nonnegative Tensor Factorisation Models for Musical Sound Source Separation
title_full Extended Nonnegative Tensor Factorisation Models for Musical Sound Source Separation
title_fullStr Extended Nonnegative Tensor Factorisation Models for Musical Sound Source Separation
title_full_unstemmed Extended Nonnegative Tensor Factorisation Models for Musical Sound Source Separation
title_sort extended nonnegative tensor factorisation models for musical sound source separation
publisher Hindawi Limited
series Computational Intelligence and Neuroscience
issn 1687-5265
1687-5273
publishDate 2008-01-01
description Recently, shift-invariant tensor factorisation algorithms have been proposed for the purposes of sound source separation of pitched musical instruments. However, in practice, existing algorithms require the use of log-frequency spectrograms to allow shift invariance in frequency which causes problems when attempting to resynthesise the separated sources. Further, it is difficult to impose harmonicity constraints on the recovered basis functions. This paper proposes a new additive synthesis-based approach which allows the use of linear-frequency spectrograms as well as imposing strict harmonic constraints, resulting in an improved model. Further, these additional constraints allow the addition of a source filter model to the factorisation framework, and an extended model which is capable of separating mixtures of pitched and percussive instruments simultaneously.
url http://dx.doi.org/10.1155/2008/872425
work_keys_str_mv AT derryfitzgerald extendednonnegativetensorfactorisationmodelsformusicalsoundsourceseparation
AT mattcranitch extendednonnegativetensorfactorisationmodelsformusicalsoundsourceseparation
AT eugenecoyle extendednonnegativetensorfactorisationmodelsformusicalsoundsourceseparation
_version_ 1725849645530218496