[HTML][HTML] Localization of sound sources in robotics: A review
Sound source localization (SSL) in a robotic platform has been essential in the overall
scheme of robot audition. It allows a robot to locate a sound source by sound alone. It has an …
scheme of robot audition. It allows a robot to locate a sound source by sound alone. It has an …
[HTML][HTML] A survey of sound source localization with deep learning methods
This article is a survey of deep learning methods for single and multiple sound source
localization, with a focus on sound source localization in indoor environments, where …
localization, with a focus on sound source localization in indoor environments, where …
A consolidated perspective on multimicrophone speech enhancement and source separation
Speech enhancement and separation are core problems in audio signal processing, with
commercial applications in devices as diverse as mobile phones, conference call systems …
commercial applications in devices as diverse as mobile phones, conference call systems …
EM algorithms for weighted-data clustering with application to audio-visual scene analysis
Data clustering has received a lot of attention and numerous methods, algorithms and
software packages are available. Among these techniques, parametric finite-mixture models …
software packages are available. Among these techniques, parametric finite-mixture models …
Audio-visual speaker diarization based on spatiotemporal bayesian fusion
Speaker diarization consists of assigning speech signals to people engaged in a dialogue.
An audio-visual spatiotemporal diarization model is proposed. The model is well suited for …
An audio-visual spatiotemporal diarization model is proposed. The model is well suited for …
Neural network adaptation and data augmentation for multi-speaker direction-of-arrival estimation
Deep neural networks have been successfully applied to sound direction-of-arrival
estimation under challenging conditions. However, such a learning-based approach …
estimation under challenging conditions. However, such a learning-based approach …
Multi-target DoA estimation with an audio-visual fusion mechanism
Most of the prior studies in the spatial Direction of Arrival (DoA) domain focus on a single
modality. However, humans use auditory and visual senses to detect the presence of sound …
modality. However, humans use auditory and visual senses to detect the presence of sound …
Audio–visual particle flow smc-phd filtering for multi-speaker tracking
Sequential Monte Carlo probability hypothesis density (SMC-PHD) filtering is a popular
method used recently for audio-visual (AV) multi-speaker tracking. However, due to the …
method used recently for audio-visual (AV) multi-speaker tracking. However, due to the …
Localize to binauralize: Audio spatialization from visual sound source localization
KK Rachavarapu, V Sundaresha… - Proceedings of the …, 2021 - openaccess.thecvf.com
Videos with binaural audios provide an immersive viewing experience by enabling 3D
sound sensation. Recent works attempt to generate binaural audio in a multimodal learning …
sound sensation. Recent works attempt to generate binaural audio in a multimodal learning …
Multi-speaker tracking from an audio–visual sensing device
Compact multi-sensor platforms are portable and thus desirable for robotics and personal-
assistance tasks. However, compared to physically distributed sensors, the size of these …
assistance tasks. However, compared to physically distributed sensors, the size of these …