Estimation of Interchannel Time Difference in Frequency Subbands Based on Nonuniform Discrete Fourier Transform
<p/> <p>Binaural cue coding (BCC) is an efficient technique for spatial audio rendering by using the side information such as interchannel level difference (ICLD), interchannel time difference (ICTD), and interchannel correlation (ICC). Of the side information, the ICTD plays an importan...
Main Authors: | , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
SpringerOpen
2008-01-01
|
Series: | EURASIP Journal on Audio, Speech, and Music Processing |
Online Access: | http://asmp.eurasipjournals.com/content/2008/618104 |
id |
doaj-08f9641b1954408489f130a3fab13591 |
---|---|
record_format |
Article |
spelling |
doaj-08f9641b1954408489f130a3fab135912020-11-25T01:15:21ZengSpringerOpenEURASIP Journal on Audio, Speech, and Music Processing1687-47141687-47222008-01-0120081618104Estimation of Interchannel Time Difference in Frequency Subbands Based on Nonuniform Discrete Fourier TransformQiu BoXu YongLu YadongYang Jun<p/> <p>Binaural cue coding (BCC) is an efficient technique for spatial audio rendering by using the side information such as interchannel level difference (ICLD), interchannel time difference (ICTD), and interchannel correlation (ICC). Of the side information, the ICTD plays an important role to the auditory spatial image. However, inaccurate estimation of the ICTD may lead to the audio quality degradation. In this paper, we develop a novel ICTD estimation algorithm based on the nonuniform discrete Fourier transform (NDFT) and integrate it with the BCC approach to improve the decoded auditory image. Furthermore, a new subjective assessment method is proposed for the evaluation of auditory image widths of decoded signals. The test results demonstrate that the NDFT-based scheme can achieve much wider and more externalized auditory image than the existing BCC scheme based on the discrete Fourier transform (DFT). It is found that the present technique, regardless of the image width, does not deteriorate the sound quality at the decoder compared to the traditional scheme without ICTD estimation. </p> http://asmp.eurasipjournals.com/content/2008/618104 |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Qiu Bo Xu Yong Lu Yadong Yang Jun |
spellingShingle |
Qiu Bo Xu Yong Lu Yadong Yang Jun Estimation of Interchannel Time Difference in Frequency Subbands Based on Nonuniform Discrete Fourier Transform EURASIP Journal on Audio, Speech, and Music Processing |
author_facet |
Qiu Bo Xu Yong Lu Yadong Yang Jun |
author_sort |
Qiu Bo |
title |
Estimation of Interchannel Time Difference in Frequency Subbands Based on Nonuniform Discrete Fourier Transform |
title_short |
Estimation of Interchannel Time Difference in Frequency Subbands Based on Nonuniform Discrete Fourier Transform |
title_full |
Estimation of Interchannel Time Difference in Frequency Subbands Based on Nonuniform Discrete Fourier Transform |
title_fullStr |
Estimation of Interchannel Time Difference in Frequency Subbands Based on Nonuniform Discrete Fourier Transform |
title_full_unstemmed |
Estimation of Interchannel Time Difference in Frequency Subbands Based on Nonuniform Discrete Fourier Transform |
title_sort |
estimation of interchannel time difference in frequency subbands based on nonuniform discrete fourier transform |
publisher |
SpringerOpen |
series |
EURASIP Journal on Audio, Speech, and Music Processing |
issn |
1687-4714 1687-4722 |
publishDate |
2008-01-01 |
description |
<p/> <p>Binaural cue coding (BCC) is an efficient technique for spatial audio rendering by using the side information such as interchannel level difference (ICLD), interchannel time difference (ICTD), and interchannel correlation (ICC). Of the side information, the ICTD plays an important role to the auditory spatial image. However, inaccurate estimation of the ICTD may lead to the audio quality degradation. In this paper, we develop a novel ICTD estimation algorithm based on the nonuniform discrete Fourier transform (NDFT) and integrate it with the BCC approach to improve the decoded auditory image. Furthermore, a new subjective assessment method is proposed for the evaluation of auditory image widths of decoded signals. The test results demonstrate that the NDFT-based scheme can achieve much wider and more externalized auditory image than the existing BCC scheme based on the discrete Fourier transform (DFT). It is found that the present technique, regardless of the image width, does not deteriorate the sound quality at the decoder compared to the traditional scheme without ICTD estimation. </p> |
url |
http://asmp.eurasipjournals.com/content/2008/618104 |
work_keys_str_mv |
AT qiubo estimationofinterchanneltimedifferenceinfrequencysubbandsbasedonnonuniformdiscretefouriertransform AT xuyong estimationofinterchanneltimedifferenceinfrequencysubbandsbasedonnonuniformdiscretefouriertransform AT luyadong estimationofinterchanneltimedifferenceinfrequencysubbandsbasedonnonuniformdiscretefouriertransform AT yangjun estimationofinterchanneltimedifferenceinfrequencysubbandsbasedonnonuniformdiscretefouriertransform |
_version_ |
1725153714654674944 |