[HTML][HTML] A survey of sound source localization with deep learning methods

PA Grumiaux, S Kitić, L Girin, A Guérin - The Journal of the Acoustical …, 2022 - pubs.aip.org
This article is a survey of deep learning methods for single and multiple sound source
localization, with a focus on sound source localization in indoor environments, where …

Explaining and interpreting LSTMs

L Arras, J Arjona-Medina, M Widrich… - … and visualizing deep …, 2019 - Springer
While neural networks have acted as a strong unifying force in the design of modern AI
systems, the neural network architectures themselves remain highly heterogeneous due to …

Towards end-to-end acoustic localization using deep learning: From audio signals to source position coordinates

JM Vera-Diaz, D Pizarro, J Macias-Guarasa - Sensors, 2018 - mdpi.com
This paper presents a novel approach for indoor acoustic source localization using
microphone arrays, based on a Convolutional Neural Network (CNN). In the proposed …

CRNN-based multiple DoA estimation using acoustic intensity features for Ambisonics recordings

L Perotin, R Serizel, E Vincent… - IEEE Journal of Selected …, 2019 - ieeexplore.ieee.org
Localizing audio sources is challenging in real reverberant environments, especially when
several sources are active. We propose to use a neural network built from stacked …

Blind reverberation time estimation using a convolutional neural network

H Gamper, IJ Tashev - 2018 16th International Workshop on …, 2018 - ieeexplore.ieee.org
The reverberation time of an acoustic environment is a useful parameter for applications
including source localisation, speech recognition and mixed reality. However, estimating the …

[HTML][HTML] Augmenting perception: How artificial intelligence transforms sensory substitution

L Longin, O Deroy - Consciousness and Cognition, 2022 - Elsevier
What happens when artificial sensors are coupled with the human senses? Using
technology to extend the senses is an old human dream, on which sensory substitution and …

Quaternion convolutional neural networks for detection and localization of 3D sound events

D Comminiello, M Lella, S Scardapane… - ICASSP 2019-2019 …, 2019 - ieeexplore.ieee.org
Learning from data in the quaternion domain enables us to exploit internal dependencies of
4D signals and treating them as a single entity. One of the models that perfectly suits with …

Blind room volume estimation from single-channel noisy speech

AF Genovese, H Gamper, V Pulkki… - ICASSP 2019-2019 …, 2019 - ieeexplore.ieee.org
Recent work on acoustic parameter estimation indicates that geometric room volume can be
useful for modeling the character of an acoustic environment. However, estimating volume …

Sound source localization for auditory perception of a humanoid robot using deep neural networks

G Boztas - Neural Computing and Applications, 2023 - Springer
This paper presents an estimation of the sound source location using deep neural networks
in order to provide auditory perception of a humanoid robot. Estimation of a moving sound …

[HTML][HTML] Analyzing and visualizing deep neural networks for speech recognition with saliency-adjusted neuron activation profiles

A Krug, M Ebrahimzadeh, J Alemann, J Johannsmeier… - Electronics, 2021 - mdpi.com
Deep Learning-based Automatic Speech Recognition (ASR) models are very successful, but
hard to interpret. To gain a better understanding of how Artificial Neural Networks (ANNs) …