Google Akademik

C Rascon, I Meza - Robotics and Autonomous Systems, 2017 - Elsevier

Sound source localization (SSL) in a robotic platform has been essential in the overall
scheme of robot audition. It allows a robot to locate a sound source by sound alone. It has an …

Kaydet Alıntı yap Alıntılanma sayısı: 336 İlgili makaleler 7 sürümün hepsi

[Free GPT-4]

[HTML] aip.org

[HTML][HTML] A survey of sound source localization with deep learning methods

PA Grumiaux, S Kitić, L Girin, A Guérin - The Journal of the Acoustical …, 2022 - pubs.aip.org

This article is a survey of deep learning methods for single and multiple sound source
localization, with a focus on sound source localization in indoor environments, where …

Kaydet Alıntı yap Alıntılanma sayısı: 286 İlgili makaleler 13 sürümün hepsi

[Free GPT-4]

[PDF] hal.science

A consolidated perspective on multimicrophone speech enhancement and source separation

S Gannot, E Vincent… - … /ACM Transactions on …, 2017 - ieeexplore.ieee.org

Speech enhancement and separation are core problems in audio signal processing, with
commercial applications in devices as diverse as mobile phones, conference call systems …

Kaydet Alıntı yap Alıntılanma sayısı: 646 İlgili makaleler 12 sürümün hepsi

[Free GPT-4]

[PDF] arxiv.org

EM algorithms for weighted-data clustering with application to audio-visual scene analysis

ID Gebru, X Alameda-Pineda, F Forbes… - IEEE transactions on …, 2016 - ieeexplore.ieee.org

Data clustering has received a lot of attention and numerous methods, algorithms and
software packages are available. Among these techniques, parametric finite-mixture models …

Kaydet Alıntı yap Alıntılanma sayısı: 153 İlgili makaleler 14 sürümün hepsi

[Free GPT-4]

[PDF] arxiv.org

Audio-visual speaker diarization based on spatiotemporal bayesian fusion

ID Gebru, S Ba, X Li, R Horaud - IEEE transactions on pattern …, 2017 - ieeexplore.ieee.org

Speaker diarization consists of assigning speech signals to people engaged in a dialogue.
An audio-visual spatiotemporal diarization model is proposed. The model is well suited for …

Kaydet Alıntı yap Alıntılanma sayısı: 129 İlgili makaleler 12 sürümün hepsi

[Free GPT-4]

[PDF] google.com

Neural network adaptation and data augmentation for multi-speaker direction-of-arrival estimation

W He, P Motlicek, JM Odobez - IEEE/ACM Transactions on …, 2021 - ieeexplore.ieee.org

Deep neural networks have been successfully applied to sound direction-of-arrival
estimation under challenging conditions. However, such a learning-based approach …

Kaydet Alıntı yap Alıntılanma sayısı: 52 İlgili makaleler 5 sürümün hepsi

[Free GPT-4]

[PDF] arxiv.org

Multi-target DoA estimation with an audio-visual fusion mechanism

X Qian, M Madhavi, Z Pan, J Wang… - ICASSP 2021-2021 IEEE …, 2021 - ieeexplore.ieee.org

Most of the prior studies in the spatial Direction of Arrival (DoA) domain focus on a single
modality. However, humans use auditory and visual senses to detect the presence of sound …

Kaydet Alıntı yap Alıntılanma sayısı: 46 İlgili makaleler 3 sürümün hepsi

[Free GPT-4]

[PDF] surrey.ac.uk

Audio–visual particle flow smc-phd filtering for multi-speaker tracking

Y Liu, V Kılıç, J Guan, W Wang - IEEE Transactions on …, 2019 - ieeexplore.ieee.org

Sequential Monte Carlo probability hypothesis density (SMC-PHD) filtering is a popular
method used recently for audio-visual (AV) multi-speaker tracking. However, due to the …

Kaydet Alıntı yap Alıntılanma sayısı: 68 İlgili makaleler 4 sürümün hepsi

[Free GPT-4]

[PDF] thecvf.com

Localize to binauralize: Audio spatialization from visual sound source localization

KK Rachavarapu, V Sundaresha… - Proceedings of the …, 2021 - openaccess.thecvf.com

Videos with binaural audios provide an immersive viewing experience by enabling 3D
sound sensation. Recent works attempt to generate binaural audio in a multimodal learning …

Kaydet Alıntı yap Alıntılanma sayısı: 27 İlgili makaleler 4 sürümün hepsi HTML olarak görüntüle

[Free GPT-4]

[PDF] qmul.ac.uk

Multi-speaker tracking from an audio–visual sensing device

X Qian, A Brutti, O Lanz, M Omologo… - IEEE Transactions on …, 2019 - ieeexplore.ieee.org

Compact multi-sensor platforms are portable and thus desirable for robotics and personal-
assistance tasks. However, compared to physically distributed sensors, the size of these …

Kaydet Alıntı yap Alıntılanma sayısı: 64 İlgili makaleler 11 sürümün hepsi

Uyarı oluştur

Alıntı yap

Gelişmiş arama

Kitaplığım'a kaydedildi

Co-localization of audio sources in images using binaural features and locally-linear regression

[HTML][HTML] Localization of sound sources in robotics: A review

[HTML][HTML] A survey of sound source localization with deep learning methods

A consolidated perspective on multimicrophone speech enhancement and source separation

EM algorithms for weighted-data clustering with application to audio-visual scene analysis

Audio-visual speaker diarization based on spatiotemporal bayesian fusion

Neural network adaptation and data augmentation for multi-speaker direction-of-arrival estimation

Multi-target DoA estimation with an audio-visual fusion mechanism

Audio–visual particle flow smc-phd filtering for multi-speaker tracking

Localize to binauralize: Audio spatialization from visual sound source localization

Multi-speaker tracking from an audio–visual sensing device