[HTML][HTML] A survey of sound source localization and detection methods and their applications

G Jekateryńczuk, Z Piotrowski - Sensors, 2023 - mdpi.com
This study is a survey of sound source localization and detection methods. The study
provides a detailed classification of the methods used in the fields of science mentioned …

[HTML][HTML] A survey of sound source localization with deep learning methods

PA Grumiaux, S Kitić, L Girin, A Guérin - The Journal of the Acoustical …, 2022 - pubs.aip.org
This article is a survey of deep learning methods for single and multiple sound source
localization, with a focus on sound source localization in indoor environments, where …

A dataset of dynamic reverberant sound scenes with directional interferers for sound event localization and detection

A Politis, S Adavanne, D Krause, A Deleforge… - arxiv preprint arxiv …, 2021 - arxiv.org
This report presents the dataset and baseline of Task 3 of the DCASE2021 Challenge on
Sound Event Localization and Detection (SELD). The dataset is based on emulation of real …

Far-field automatic speech recognition

R Haeb-Umbach, J Heymann, L Drude… - Proceedings of the …, 2020 - ieeexplore.ieee.org
The machine recognition of speech spoken at a distance from the microphones, known as
far-field automatic speech recognition (ASR), has received a significant increase in attention …

Robots saving lives: A literature review about search and rescue (sar) in harsh environments

K Tong, Y Hu, B Dikic, S Solmaz… - 2024 IEEE Intelligent …, 2024 - ieeexplore.ieee.org
In recent years, the rise in both natural and man-made disasters, along with armed conflicts
and terrorist threats, has elevated the demand for Search and Rescue (SAR) missions …

A four-stage data augmentation approach to resnet-conformer based acoustic modeling for sound event localization and detection

Q Wang, J Du, HX Wu, J Pan, F Ma… - IEEE/ACM Transactions …, 2023 - ieeexplore.ieee.org
In this paper, we propose a novel four-stage data augmentation approach to ResNet-
Conformer based acoustic modeling for sound event localization and detection (SELD) …

Audio-visual cross-attention network for robotic speaker tracking

X Qian, Z Wang, J Wang, G Guan… - IEEE/ACM Transactions …, 2022 - ieeexplore.ieee.org
Audio-visual signals can be used jointly for robotic perception as they complement each
other. Such multi-modal sensory fusion has a clear advantage, especially under noisy …

Frequency-sliding generalized cross-correlation: A sub-band time delay estimation approach

M Cobos, F Antonacci, L Comanducci… - IEEE/ACM Transactions …, 2020 - ieeexplore.ieee.org
The generalized cross-correlation (GCC) is regarded as the most popular approach for
estimating the time difference of arrival (TDOA) between the signals received at two sensors …

Closed-loop sound source localization in neuromorphic systems

T Schoepe, D Gutierrez-Galan… - Neuromorphic …, 2023 - iopscience.iop.org
Sound source localization (SSL) is used in various applications such as industrial noise-
control, speech detection in mobile phones, speech enhancement in hearing aids and many …

A deep learning framework for robust DOA estimation using spherical harmonic decomposition

V Varanasi, H Gupta, RM Hegde - IEEE/ACM Transactions on …, 2020 - ieeexplore.ieee.org
Spherical harmonic decomposition facilitates decomposing the sound pressure at different
microphones into independent functions of frequency, azimuth and elevation of the source …