[HTML][HTML] A survey of sound source localization with deep learning methods

PA Grumiaux, S Kitić, L Girin, A Guérin - The Journal of the Acoustical …, 2022 - pubs.aip.org
This article is a survey of deep learning methods for single and multiple sound source
localization, with a focus on sound source localization in indoor environments, where …

The LOCATA challenge: Acoustic source localization and tracking

C Evers, HW Löllmann, H Mellmann… - … on Audio, Speech …, 2020 - ieeexplore.ieee.org
The ability to localize and track acoustic events is a fundamental prerequisite for equip**
machines with the ability to be aware of and engage with humans in their surrounding …

Audio-visual cross-attention network for robotic speaker tracking

X Qian, Z Wang, J Wang, G Guan… - IEEE/ACM Transactions …, 2022 - ieeexplore.ieee.org
Audio-visual signals can be used jointly for robotic perception as they complement each
other. Such multi-modal sensory fusion has a clear advantage, especially under noisy …

Multiple source direction of arrival estimations using relative sound pressure based MUSIC

Y Hu, TD Abhayapala… - IEEE/ACM Transactions …, 2020 - ieeexplore.ieee.org
Subspace approach of MUSIC (multiple signal classication) has become one of the most
popular multi-source direction of arrival (DOA) estimations due to its easy implementation in …

Active sensing for search and tracking: A review

L Varotto, A Cenedese, A Cavallaro - ar**ing spherical antenna array with unknown mutual coupling using relative signal pressure based multiple signal …
OJ Famoriji, T Shongwe - IEEE Access, 2022 - ieeexplore.ieee.org
! Spherical antenna array (SAA) is a configuration that scans almost all the radiation sphere
with constant directivity. It finds applications in spacecraft and satellite communication …

Variational bayesian inference for audio-visual tracking of multiple speakers

Y Ban, X Alameda-Pineda, L Girin… - IEEE transactions on …, 2019 - ieeexplore.ieee.org
In this article, we address the problem of tracking multiple speakers via the fusion of visual
and auditory information. We propose to exploit the complementary nature and roles of …

Direction of arrival estimation for reverberant speech based on enhanced decomposition of the direct sound

L Madmoni, B Rafaely - IEEE Journal of Selected Topics in …, 2018 - ieeexplore.ieee.org
Direction of arrival (DOA) estimation for speech sources is an important task in audio signal
processing. This task becomes a challenge in reverberant environments, which are typical to …

Enhancing direct‐path relative transfer function using deep neural network for robust sound source localization

B Yang, R Ding, Y Ban, X Li… - CAAI Transactions on …, 2022 - Wiley Online Library
This article proposes a deep neural network (DNN)‐based direct‐path relative transfer
function (DP‐RTF) enhancement method for robust direction of arrival (DOA) estimation in …