A Corpus-Based Evaluation of Beamforming Techniques and Phase-Based Frequency Masking

Beamforming is a type of audio array processing techniques used for interference reduction, sound source localization, and as pre-processing stage for audio event classification and speaker identification. The auditory scene analysis community can benefit from a systemic evaluation and comparison be...

Full description

Bibliographic Details
Main Author: Caleb Rascon
Format: Article
Language:English
Published: MDPI AG 2021-07-01
Series:Sensors
Subjects:
Online Access:https://www.mdpi.com/1424-8220/21/15/5005
id doaj-d162895f5fda4ff8b41d816d3188772b
record_format Article
spelling doaj-d162895f5fda4ff8b41d816d3188772b2021-08-06T15:31:10ZengMDPI AGSensors1424-82202021-07-01215005500510.3390/s21155005A Corpus-Based Evaluation of Beamforming Techniques and Phase-Based Frequency MaskingCaleb Rascon0Instituto de Investigaciones en Matematicas Aplicadas y en Sistemas, Universidad Nacional Autonoma de Mexico, Circuito Escolar S/N, Mexico City 04510, MexicoBeamforming is a type of audio array processing techniques used for interference reduction, sound source localization, and as pre-processing stage for audio event classification and speaker identification. The auditory scene analysis community can benefit from a systemic evaluation and comparison between different beamforming techniques. In this paper, five popular beamforming techniques are evaluated in two different acoustic environments, while varying the number of microphones, the number of interferences, and the direction-of-arrival error, by using the Acoustic Interactions for Robot Audition (AIRA) corpus and a common software framework. Additionally, a highly efficient phase-based frequency masking beamformer is also evaluated, which is shown to outperform all five techniques. Both the evaluation corpus and the beamforming implementations are freely available and provided for experiment repeatability and transparency. Raw results are also provided as a complement to this work to the reader, to facilitate an informed decision of which technique to use. Finally, the insights and tendencies observed from the evaluation results are presented.https://www.mdpi.com/1424-8220/21/15/5005evaluationbeamformingAIRAfrequency maskingphase
collection DOAJ
language English
format Article
sources DOAJ
author Caleb Rascon
spellingShingle Caleb Rascon
A Corpus-Based Evaluation of Beamforming Techniques and Phase-Based Frequency Masking
Sensors
evaluation
beamforming
AIRA
frequency masking
phase
author_facet Caleb Rascon
author_sort Caleb Rascon
title A Corpus-Based Evaluation of Beamforming Techniques and Phase-Based Frequency Masking
title_short A Corpus-Based Evaluation of Beamforming Techniques and Phase-Based Frequency Masking
title_full A Corpus-Based Evaluation of Beamforming Techniques and Phase-Based Frequency Masking
title_fullStr A Corpus-Based Evaluation of Beamforming Techniques and Phase-Based Frequency Masking
title_full_unstemmed A Corpus-Based Evaluation of Beamforming Techniques and Phase-Based Frequency Masking
title_sort corpus-based evaluation of beamforming techniques and phase-based frequency masking
publisher MDPI AG
series Sensors
issn 1424-8220
publishDate 2021-07-01
description Beamforming is a type of audio array processing techniques used for interference reduction, sound source localization, and as pre-processing stage for audio event classification and speaker identification. The auditory scene analysis community can benefit from a systemic evaluation and comparison between different beamforming techniques. In this paper, five popular beamforming techniques are evaluated in two different acoustic environments, while varying the number of microphones, the number of interferences, and the direction-of-arrival error, by using the Acoustic Interactions for Robot Audition (AIRA) corpus and a common software framework. Additionally, a highly efficient phase-based frequency masking beamformer is also evaluated, which is shown to outperform all five techniques. Both the evaluation corpus and the beamforming implementations are freely available and provided for experiment repeatability and transparency. Raw results are also provided as a complement to this work to the reader, to facilitate an informed decision of which technique to use. Finally, the insights and tendencies observed from the evaluation results are presented.
topic evaluation
beamforming
AIRA
frequency masking
phase
url https://www.mdpi.com/1424-8220/21/15/5005
work_keys_str_mv AT calebrascon acorpusbasedevaluationofbeamformingtechniquesandphasebasedfrequencymasking
AT calebrascon corpusbasedevaluationofbeamformingtechniquesandphasebasedfrequencymasking
_version_ 1721217631984287744