[HTML][HTML] A survey of sound source localization with deep learning methods

PA Grumiaux, S Kitić, L Girin, A Guérin - The Journal of the Acoustical …, 2022 - pubs.aip.org
This article is a survey of deep learning methods for single and multiple sound source
localization, with a focus on sound source localization in indoor environments, where …

Symphony: Localizing multiple acoustic sources with a single microphone array

W Wang, J Li, Y He, Y Liu - Proceedings of the 18th Conference on …, 2020 - dl.acm.org
Sound recognition is an important and popular function of smart devices. The location of
sound is basic information associated with the acoustic source. Apart from sound …

Event-independent network for polyphonic sound event localization and detection

Y Cao, T Iqbal, Q Kong, Y Zhong, W Wang… - arxiv preprint arxiv …, 2020 - arxiv.org
Polyphonic sound event localization and detection is not only detecting what sound events
are happening but localizing corresponding sound sources. This series of tasks was first …

Deepear: Sound localization with binaural microphones

Q Yang, Y Zheng - IEEE Transactions on Mobile Computing, 2022 - ieeexplore.ieee.org
The binaural microphone, which refers to a pair of microphones with artificial human-shaped
ears, is widely used in hearing aids and spatial audio recording to improve sound quality. It …

Adaptive direction-of-arrival estimation using deep neural network in marine acoustic environment

W Nie, X Zhang, J Xu, L Guo, Y Yan - IEEE Sensors Journal, 2023 - ieeexplore.ieee.org
Deep learning is widely used for target detection and direction-of-arrival (DOA) estimation
due to its powerful data fitting capability. However, limited by different environments and …

Decoupled multiple speaker direction-of-arrival estimator under reverberant environments

Y Hu, PN Samarasinghe, S Gannot… - … /ACM Transactions on …, 2022 - ieeexplore.ieee.org
Direction-of-arrival (DOA) estimation for multiple simultaneous speakers in reverberant
environments is still one of the challenging tasks in the audio signal processing field. A …

Resilient Multiple Choice Learning: A learned scoring scheme with application to audio scene analysis

V Letzelter, M Fontaine, M Chen… - Advances in neural …, 2023 - proceedings.neurips.cc
Abstract We introduce Resilient Multiple Choice Learning (rMCL), an extension of the MCL
approach for conditional distribution estimation in regression settings where multiple targets …

BeamLearning: An end-to-end deep learning approach for the angular localization of sound sources using raw multichannel acoustic pressure data

H Pujol, E Bavu, A Garcia - The Journal of the Acoustical Society of …, 2021 - pubs.aip.org
BeamLearning: An end-to-end deep learning approach for the angular localization of sound
sources using raw multichannel acoustic pressure dataa) | The Journal of the Acoustical …

SoundSynp: sound source detection from raw waveforms with multi-scale synperiodic filterbanks

Y He, A Markham - International Conference on Artificial …, 2023 - proceedings.mlr.press
We propose synperiodic filter banks, a novel multi-scale learnable filter bank construction
strategy that all filters are synchronized by their rotating periodicity. By synchronizing in a …

Direction of Arrival Joint Prediction of Underwater Acoustic Communication Signals Using Faster R-CNN and Frequency–Azimuth Spectrum.

L Cheng, Y Liu, B Zhang, Z Hu, H Zhu… - Remote …, 2024 - search.ebscohost.com
Utilizing hydrophone arrays for detecting underwater acoustic communication (UWAC)
signals leverages spatial information to enhance detection efficiency and expand the …