[HTML][HTML] A survey of sound source localization with deep learning methods

PA Grumiaux, S Kitić, L Girin, A Guérin - The Journal of the Acoustical …, 2022 - pubs.aip.org
This article is a survey of deep learning methods for single and multiple sound source
localization, with a focus on sound source localization in indoor environments, where …

Wireless localization based on deep learning: state of art and challenges

YX Ye, AN Lu, MY You, K Huang… - … Problems in Engineering, 2020 - Wiley Online Library
The problem of position estimation has always been widely discussed in the field of wireless
communication. In recent years, deep learning technology is rapidly develo** and …

Audio-visual cross-attention network for robotic speaker tracking

X Qian, Z Wang, J Wang, G Guan… - IEEE/ACM Transactions …, 2022 - ieeexplore.ieee.org
Audio-visual signals can be used jointly for robotic perception as they complement each
other. Such multi-modal sensory fusion has a clear advantage, especially under noisy …

Neural network adaptation and data augmentation for multi-speaker direction-of-arrival estimation

W He, P Motlicek, JM Odobez - IEEE/ACM Transactions on …, 2021 - ieeexplore.ieee.org
Deep neural networks have been successfully applied to sound direction-of-arrival
estimation under challenging conditions. However, such a learning-based approach …

Generative adversarial networks with physical sound field priors

X Karakonstantis, E Fernandez-Grande - The Journal of the Acoustical …, 2023 - pubs.aip.org
This paper presents a deep learning-based approach for the spatiotemporal reconstruction
of sound fields using generative adversarial networks. The method utilises a plane wave …

Sound source localization based on GCC-PHAT with diffuseness mask in noisy and reverberant environments

R Lee, MS Kang, BH Kim, KH Park, SQ Lee… - IEEE …, 2020 - ieeexplore.ieee.org
Although sound source localization is a desirable technique in many communication
systems and intelligence applications, the distortion caused by diffuse noise or reverberation …

Listening for sirens: Locating and classifying acoustic alarms in city scenes

L Marchegiani, P Newman - IEEE transactions on intelligent …, 2022 - ieeexplore.ieee.org
This paper is about acoustic event detection and sound source localisation in urban
scenarios. Specifically, we are interested in detecting and localising horns and sirens of …

Deep audio-visual beamforming for speaker localization

X Qian, Q Zhang, G Guan, W Xue - IEEE Signal Processing …, 2022 - ieeexplore.ieee.org
Generalized Cross Correlation (GCC) is the most popular localization technique over the
past decades and can be extended with the beamforming method eg Steered Response …

Direction of Arrival Joint Prediction of Underwater Acoustic Communication Signals Using Faster R-CNN and Frequency–Azimuth Spectrum.

L Cheng, Y Liu, B Zhang, Z Hu, H Zhu… - Remote …, 2024 - search.ebscohost.com
Utilizing hydrophone arrays for detecting underwater acoustic communication (UWAC)
signals leverages spatial information to enhance detection efficiency and expand the …

BeamLearning: An end-to-end deep learning approach for the angular localization of sound sources using raw multichannel acoustic pressure data

H Pujol, E Bavu, A Garcia - The Journal of the Acoustical Society of …, 2021 - pubs.aip.org
BeamLearning: An end-to-end deep learning approach for the angular localization of sound
sources using raw multichannel acoustic pressure dataa) | The Journal of the Acoustical …