A Perceptual Model for Sinusoidal Audio Coding Based on Spectral Integration
<p/> <p>Psychoacoustical models have been used extensively within audio coding applications over the past decades. Recently, parametric coding techniques have been applied to general audio and this has created the need for a psychoacoustical model that is specifically suited for sinusoid...
Main Authors: | , , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
SpringerOpen
2005-01-01
|
Series: | EURASIP Journal on Advances in Signal Processing |
Subjects: | |
Online Access: | http://dx.doi.org/10.1155/ASP.2005.1292 |
id |
doaj-08047a2794514aaebd557304d8c73346 |
---|---|
record_format |
Article |
spelling |
doaj-08047a2794514aaebd557304d8c733462020-11-25T00:20:37ZengSpringerOpenEURASIP Journal on Advances in Signal Processing1687-61721687-61802005-01-0120059317529A Perceptual Model for Sinusoidal Audio Coding Based on Spectral IntegrationJensen Søren HoldtHeusdens RichardJensen Jespervan de Par StevenKohlrausch Armin<p/> <p>Psychoacoustical models have been used extensively within audio coding applications over the past decades. Recently, parametric coding techniques have been applied to general audio and this has created the need for a psychoacoustical model that is specifically suited for sinusoidal modelling of audio signals. In this paper, we present a new perceptual model that predicts masked thresholds for sinusoidal distortions. The model relies on signal detection theory and incorporates more recent insights about spectral and temporal integration in auditory masking. As a consequence, the model is able to predict the distortion detectability. In fact, the distortion detectability defines a (perceptually relevant) norm on the underlying signal space which is beneficial for optimisation algorithms such as rate-distortion optimisation or linear predictive coding. We evaluate the merits of the model by combining it with a sinusoidal extraction method and compare the results with those obtained with the ISO MPEG-1 Layer I-II recommended model. Listening tests show a clear preference for the new model. More specifically, the model presented here leads to a reduction of more than 20% in terms of number of sinusoids needed to represent signals at a given quality level.</p>http://dx.doi.org/10.1155/ASP.2005.1292audio codingpsychoacoustical modellingauditory maskingspectral maskingsinusoidal modellingpsychoacoustical matching pursuit |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Jensen Søren Holdt Heusdens Richard Jensen Jesper van de Par Steven Kohlrausch Armin |
spellingShingle |
Jensen Søren Holdt Heusdens Richard Jensen Jesper van de Par Steven Kohlrausch Armin A Perceptual Model for Sinusoidal Audio Coding Based on Spectral Integration EURASIP Journal on Advances in Signal Processing audio coding psychoacoustical modelling auditory masking spectral masking sinusoidal modelling psychoacoustical matching pursuit |
author_facet |
Jensen Søren Holdt Heusdens Richard Jensen Jesper van de Par Steven Kohlrausch Armin |
author_sort |
Jensen Søren Holdt |
title |
A Perceptual Model for Sinusoidal Audio Coding Based on Spectral Integration |
title_short |
A Perceptual Model for Sinusoidal Audio Coding Based on Spectral Integration |
title_full |
A Perceptual Model for Sinusoidal Audio Coding Based on Spectral Integration |
title_fullStr |
A Perceptual Model for Sinusoidal Audio Coding Based on Spectral Integration |
title_full_unstemmed |
A Perceptual Model for Sinusoidal Audio Coding Based on Spectral Integration |
title_sort |
perceptual model for sinusoidal audio coding based on spectral integration |
publisher |
SpringerOpen |
series |
EURASIP Journal on Advances in Signal Processing |
issn |
1687-6172 1687-6180 |
publishDate |
2005-01-01 |
description |
<p/> <p>Psychoacoustical models have been used extensively within audio coding applications over the past decades. Recently, parametric coding techniques have been applied to general audio and this has created the need for a psychoacoustical model that is specifically suited for sinusoidal modelling of audio signals. In this paper, we present a new perceptual model that predicts masked thresholds for sinusoidal distortions. The model relies on signal detection theory and incorporates more recent insights about spectral and temporal integration in auditory masking. As a consequence, the model is able to predict the distortion detectability. In fact, the distortion detectability defines a (perceptually relevant) norm on the underlying signal space which is beneficial for optimisation algorithms such as rate-distortion optimisation or linear predictive coding. We evaluate the merits of the model by combining it with a sinusoidal extraction method and compare the results with those obtained with the ISO MPEG-1 Layer I-II recommended model. Listening tests show a clear preference for the new model. More specifically, the model presented here leads to a reduction of more than 20% in terms of number of sinusoids needed to represent signals at a given quality level.</p> |
topic |
audio coding psychoacoustical modelling auditory masking spectral masking sinusoidal modelling psychoacoustical matching pursuit |
url |
http://dx.doi.org/10.1155/ASP.2005.1292 |
work_keys_str_mv |
AT jensens248renholdt aperceptualmodelforsinusoidalaudiocodingbasedonspectralintegration AT heusdensrichard aperceptualmodelforsinusoidalaudiocodingbasedonspectralintegration AT jensenjesper aperceptualmodelforsinusoidalaudiocodingbasedonspectralintegration AT vandeparsteven aperceptualmodelforsinusoidalaudiocodingbasedonspectralintegration AT kohlrauscharmin aperceptualmodelforsinusoidalaudiocodingbasedonspectralintegration AT jensens248renholdt perceptualmodelforsinusoidalaudiocodingbasedonspectralintegration AT heusdensrichard perceptualmodelforsinusoidalaudiocodingbasedonspectralintegration AT jensenjesper perceptualmodelforsinusoidalaudiocodingbasedonspectralintegration AT vandeparsteven perceptualmodelforsinusoidalaudiocodingbasedonspectralintegration AT kohlrauscharmin perceptualmodelforsinusoidalaudiocodingbasedonspectralintegration |
_version_ |
1725366303974227968 |