A Perceptual Model for Sinusoidal Audio Coding Based on Spectral Integration

<p/> <p>Psychoacoustical models have been used extensively within audio coding applications over the past decades. Recently, parametric coding techniques have been applied to general audio and this has created the need for a psychoacoustical model that is specifically suited for sinusoid...

Full description

Bibliographic Details
Main Authors: Jensen S&#248;ren Holdt, Heusdens Richard, Jensen Jesper, van de Par Steven, Kohlrausch Armin
Format: Article
Language:English
Published: SpringerOpen 2005-01-01
Series:EURASIP Journal on Advances in Signal Processing
Subjects:
Online Access:http://dx.doi.org/10.1155/ASP.2005.1292
id doaj-08047a2794514aaebd557304d8c73346
record_format Article
spelling doaj-08047a2794514aaebd557304d8c733462020-11-25T00:20:37ZengSpringerOpenEURASIP Journal on Advances in Signal Processing1687-61721687-61802005-01-0120059317529A Perceptual Model for Sinusoidal Audio Coding Based on Spectral IntegrationJensen S&#248;ren HoldtHeusdens RichardJensen Jespervan de Par StevenKohlrausch Armin<p/> <p>Psychoacoustical models have been used extensively within audio coding applications over the past decades. Recently, parametric coding techniques have been applied to general audio and this has created the need for a psychoacoustical model that is specifically suited for sinusoidal modelling of audio signals. In this paper, we present a new perceptual model that predicts masked thresholds for sinusoidal distortions. The model relies on signal detection theory and incorporates more recent insights about spectral and temporal integration in auditory masking. As a consequence, the model is able to predict the distortion detectability. In fact, the distortion detectability defines a (perceptually relevant) norm on the underlying signal space which is beneficial for optimisation algorithms such as rate-distortion optimisation or linear predictive coding. We evaluate the merits of the model by combining it with a sinusoidal extraction method and compare the results with those obtained with the ISO MPEG-1 Layer I-II recommended model. Listening tests show a clear preference for the new model. More specifically, the model presented here leads to a reduction of more than 20% in terms of number of sinusoids needed to represent signals at a given quality level.</p>http://dx.doi.org/10.1155/ASP.2005.1292audio codingpsychoacoustical modellingauditory maskingspectral maskingsinusoidal modellingpsychoacoustical matching pursuit
collection DOAJ
language English
format Article
sources DOAJ
author Jensen S&#248;ren Holdt
Heusdens Richard
Jensen Jesper
van de Par Steven
Kohlrausch Armin
spellingShingle Jensen S&#248;ren Holdt
Heusdens Richard
Jensen Jesper
van de Par Steven
Kohlrausch Armin
A Perceptual Model for Sinusoidal Audio Coding Based on Spectral Integration
EURASIP Journal on Advances in Signal Processing
audio coding
psychoacoustical modelling
auditory masking
spectral masking
sinusoidal modelling
psychoacoustical matching pursuit
author_facet Jensen S&#248;ren Holdt
Heusdens Richard
Jensen Jesper
van de Par Steven
Kohlrausch Armin
author_sort Jensen S&#248;ren Holdt
title A Perceptual Model for Sinusoidal Audio Coding Based on Spectral Integration
title_short A Perceptual Model for Sinusoidal Audio Coding Based on Spectral Integration
title_full A Perceptual Model for Sinusoidal Audio Coding Based on Spectral Integration
title_fullStr A Perceptual Model for Sinusoidal Audio Coding Based on Spectral Integration
title_full_unstemmed A Perceptual Model for Sinusoidal Audio Coding Based on Spectral Integration
title_sort perceptual model for sinusoidal audio coding based on spectral integration
publisher SpringerOpen
series EURASIP Journal on Advances in Signal Processing
issn 1687-6172
1687-6180
publishDate 2005-01-01
description <p/> <p>Psychoacoustical models have been used extensively within audio coding applications over the past decades. Recently, parametric coding techniques have been applied to general audio and this has created the need for a psychoacoustical model that is specifically suited for sinusoidal modelling of audio signals. In this paper, we present a new perceptual model that predicts masked thresholds for sinusoidal distortions. The model relies on signal detection theory and incorporates more recent insights about spectral and temporal integration in auditory masking. As a consequence, the model is able to predict the distortion detectability. In fact, the distortion detectability defines a (perceptually relevant) norm on the underlying signal space which is beneficial for optimisation algorithms such as rate-distortion optimisation or linear predictive coding. We evaluate the merits of the model by combining it with a sinusoidal extraction method and compare the results with those obtained with the ISO MPEG-1 Layer I-II recommended model. Listening tests show a clear preference for the new model. More specifically, the model presented here leads to a reduction of more than 20% in terms of number of sinusoids needed to represent signals at a given quality level.</p>
topic audio coding
psychoacoustical modelling
auditory masking
spectral masking
sinusoidal modelling
psychoacoustical matching pursuit
url http://dx.doi.org/10.1155/ASP.2005.1292
work_keys_str_mv AT jensens248renholdt aperceptualmodelforsinusoidalaudiocodingbasedonspectralintegration
AT heusdensrichard aperceptualmodelforsinusoidalaudiocodingbasedonspectralintegration
AT jensenjesper aperceptualmodelforsinusoidalaudiocodingbasedonspectralintegration
AT vandeparsteven aperceptualmodelforsinusoidalaudiocodingbasedonspectralintegration
AT kohlrauscharmin aperceptualmodelforsinusoidalaudiocodingbasedonspectralintegration
AT jensens248renholdt perceptualmodelforsinusoidalaudiocodingbasedonspectralintegration
AT heusdensrichard perceptualmodelforsinusoidalaudiocodingbasedonspectralintegration
AT jensenjesper perceptualmodelforsinusoidalaudiocodingbasedonspectralintegration
AT vandeparsteven perceptualmodelforsinusoidalaudiocodingbasedonspectralintegration
AT kohlrauscharmin perceptualmodelforsinusoidalaudiocodingbasedonspectralintegration
_version_ 1725366303974227968