[HTML][HTML] A survey of sound source localization with deep learning methods

PA Grumiaux, S Kitić, L Girin, A Guérin - The Journal of the Acoustical …, 2022 - pubs.aip.org
This article is a survey of deep learning methods for single and multiple sound source
localization, with a focus on sound source localization in indoor environments, where …

Binaural sound source distance estimation and localization for a moving listener

DA Krause, G García-Barrios, A Politis… - … /ACM Transactions on …, 2023 - ieeexplore.ieee.org
In this paper, we investigate the tasks of binaural source distance estimation (SDE) and
direction-of-arrival estimation (DOAE) using motion-based cues in a scenario with a walking …

Sound event detection and localization with distance estimation

DA Krause, A Politis, A Mesaros - 2024 32nd European Signal …, 2024 - ieeexplore.ieee.org
Sound Event Detection and Localization (SELD) is a combined task of identifying sound
events and their corresponding direction-of-arrival (DOA). While this task has numerous …

Direction of arrival estimation of sound sources using icosahedral CNNs

D Diaz-Guerra, A Miguel… - IEEE/ACM Transactions …, 2022 - ieeexplore.ieee.org
In this paper, we present a new model for Direction of Arrival (DOA) estimation of sound
sources based on an Icosahedral Convolutional Neural Network (CNN) applied over SRP …

Deep learning-based speech specific source localization by using binaural and monaural microphone arrays in hearing aids

P Goli, S van de Par - IEEE/ACM Transactions on Audio …, 2023 - ieeexplore.ieee.org
A deep learning-based method is proposed for jointly detecting and localizing speech
sources in a complex acoustic scene by using microphones of a hearing aid. Motivated by …

Multi event localization by audio-visual fusion with omnidirectional camera and microphone array

W Zheng, R Yoshihashi, R Kawakami… - Proceedings of the …, 2023 - openaccess.thecvf.com
Audio-visual fusion is a promising approach for identifying multiple events occurring
simultaneously at different locations in the real world. Previous studies on audio-visual event …

Differentiable tracking-based training of deep learning sound source localizers

S Adavanne, A Politis, T Virtanen - 2021 IEEE workshop on …, 2021 - ieeexplore.ieee.org
Data-based and learning-based sound source localization (SSL) has shown promising
results in challenging conditions, and is commonly set as a classification or a regression …

Dual input neural networks for positional sound source localization

E Grinstein, VW Neo, PA Naylor - … Journal on Audio, Speech, and Music …, 2023 - Springer
In many signal processing applications, metadata may be advantageously used in
conjunction with a high dimensional signal to produce a desired output. In the case of …

IFAN: An icosahedral feature attention network for sound source localization

XC Zhu, H Zhang, HT Feng, DH Zhao… - IEEE Transactions …, 2024 - ieeexplore.ieee.org
Currently, sound source localization (SSL) techniques based on deep learning mainly rely
on traditional signal processing methods to generate input features. Nevertheless, the …

Binaural source localization using deep learning and head rotation information

G García-Barrios, DA Krause, A Politis… - 2022 30th European …, 2022 - ieeexplore.ieee.org
This work studies learning-based binaural sound source localization, under the influence of
head rotation in rever-berant conditions. Emphasis is on whether knowledge of head …