Estimation of Interchannel Time Difference in Frequency Subbands Based on Nonuniform Discrete Fourier Transform

<p/> <p>Binaural cue coding (BCC) is an efficient technique for spatial audio rendering by using the side information such as interchannel level difference (ICLD), interchannel time difference (ICTD), and interchannel correlation (ICC). Of the side information, the ICTD plays an importan...

Full description

Bibliographic Details
Main Authors: Qiu Bo, Xu Yong, Lu Yadong, Yang Jun
Format: Article
Language:English
Published: SpringerOpen 2008-01-01
Series:EURASIP Journal on Audio, Speech, and Music Processing
Online Access:http://asmp.eurasipjournals.com/content/2008/618104
id doaj-08f9641b1954408489f130a3fab13591
record_format Article
spelling doaj-08f9641b1954408489f130a3fab135912020-11-25T01:15:21ZengSpringerOpenEURASIP Journal on Audio, Speech, and Music Processing1687-47141687-47222008-01-0120081618104Estimation of Interchannel Time Difference in Frequency Subbands Based on Nonuniform Discrete Fourier TransformQiu BoXu YongLu YadongYang Jun<p/> <p>Binaural cue coding (BCC) is an efficient technique for spatial audio rendering by using the side information such as interchannel level difference (ICLD), interchannel time difference (ICTD), and interchannel correlation (ICC). Of the side information, the ICTD plays an important role to the auditory spatial image. However, inaccurate estimation of the ICTD may lead to the audio quality degradation. In this paper, we develop a novel ICTD estimation algorithm based on the nonuniform discrete Fourier transform (NDFT) and integrate it with the BCC approach to improve the decoded auditory image. Furthermore, a new subjective assessment method is proposed for the evaluation of auditory image widths of decoded signals. The test results demonstrate that the NDFT-based scheme can achieve much wider and more externalized auditory image than the existing BCC scheme based on the discrete Fourier transform (DFT). It is found that the present technique, regardless of the image width, does not deteriorate the sound quality at the decoder compared to the traditional scheme without ICTD estimation. </p> http://asmp.eurasipjournals.com/content/2008/618104
collection DOAJ
language English
format Article
sources DOAJ
author Qiu Bo
Xu Yong
Lu Yadong
Yang Jun
spellingShingle Qiu Bo
Xu Yong
Lu Yadong
Yang Jun
Estimation of Interchannel Time Difference in Frequency Subbands Based on Nonuniform Discrete Fourier Transform
EURASIP Journal on Audio, Speech, and Music Processing
author_facet Qiu Bo
Xu Yong
Lu Yadong
Yang Jun
author_sort Qiu Bo
title Estimation of Interchannel Time Difference in Frequency Subbands Based on Nonuniform Discrete Fourier Transform
title_short Estimation of Interchannel Time Difference in Frequency Subbands Based on Nonuniform Discrete Fourier Transform
title_full Estimation of Interchannel Time Difference in Frequency Subbands Based on Nonuniform Discrete Fourier Transform
title_fullStr Estimation of Interchannel Time Difference in Frequency Subbands Based on Nonuniform Discrete Fourier Transform
title_full_unstemmed Estimation of Interchannel Time Difference in Frequency Subbands Based on Nonuniform Discrete Fourier Transform
title_sort estimation of interchannel time difference in frequency subbands based on nonuniform discrete fourier transform
publisher SpringerOpen
series EURASIP Journal on Audio, Speech, and Music Processing
issn 1687-4714
1687-4722
publishDate 2008-01-01
description <p/> <p>Binaural cue coding (BCC) is an efficient technique for spatial audio rendering by using the side information such as interchannel level difference (ICLD), interchannel time difference (ICTD), and interchannel correlation (ICC). Of the side information, the ICTD plays an important role to the auditory spatial image. However, inaccurate estimation of the ICTD may lead to the audio quality degradation. In this paper, we develop a novel ICTD estimation algorithm based on the nonuniform discrete Fourier transform (NDFT) and integrate it with the BCC approach to improve the decoded auditory image. Furthermore, a new subjective assessment method is proposed for the evaluation of auditory image widths of decoded signals. The test results demonstrate that the NDFT-based scheme can achieve much wider and more externalized auditory image than the existing BCC scheme based on the discrete Fourier transform (DFT). It is found that the present technique, regardless of the image width, does not deteriorate the sound quality at the decoder compared to the traditional scheme without ICTD estimation. </p>
url http://asmp.eurasipjournals.com/content/2008/618104
work_keys_str_mv AT qiubo estimationofinterchanneltimedifferenceinfrequencysubbandsbasedonnonuniformdiscretefouriertransform
AT xuyong estimationofinterchanneltimedifferenceinfrequencysubbandsbasedonnonuniformdiscretefouriertransform
AT luyadong estimationofinterchanneltimedifferenceinfrequencysubbandsbasedonnonuniformdiscretefouriertransform
AT yangjun estimationofinterchanneltimedifferenceinfrequencysubbandsbasedonnonuniformdiscretefouriertransform
_version_ 1725153714654674944