Vision-guided robot hearing

X Alameda-Pineda, R Horaud - The International Journal of …, 2015 - journals.sagepub.com
Natural human–robot interaction (HRI) in complex and unpredictable environments is
important with many potential applications. While vision-based HRI has been thoroughly …

Audio-visual speaker localization via weighted clustering

ID Gebru, X Alameda-Pineda… - … on Machine Learning …, 2014 - ieeexplore.ieee.org
In this paper we address the problem of detecting and locating speakers using audiovisual
data. We address this problem in the framework of clustering. We propose a novel weighted …

Finding audio-visual events in informal social gatherings

X Alameda-Pineda, V Khalidov, R Horaud… - Proceedings of the 13th …, 2011 - dl.acm.org
In this paper we address the problem of detecting and localizing objects that can be both
seen and heard, eg, people. This may be solved within the framework of data clustering. We …

The CAVA corpus: synchronised stereoscopic and binaural datasets with head movements

E Arnaud, H Christensen, YC Lu, J Barker… - Proceedings of the 10th …, 2008 - dl.acm.org
This paper describes the acquisition and content of a new multi-modal database. Some tools
for making use of the data streams are also presented. The Computational Audio-Visual …

Computer-implemented event detection using sonification

F Pinel, B Gross, CD Wolfson - US Patent 11,343,545, 2022 - Google Patents
Computer-implemented event detection includes obtaining, at one or more processors,
multimedia data including mul tiple frames of video data and corresponding audio data. The …

Alignment of binocular-binaural data using a moving audio-visual target

V Khalidov, F Forbes, R Horaud - 2013 IEEE 15th International …, 2013 - ieeexplore.ieee.org
In this paper we address the problem of aligning visual (V) and auditory (A) data using a
sensor that is composed of a camera-pair and a microphone-pair. The original contribution …

[PDF][PDF] Multimodal probabilistic person tracking and identification in smart spaces

K Bernardin - 2009 - core.ac.uk
Intelligente Räume, die die sich in ihnen aufhaltenden Personen wahrnehmen und
intelligente, Mensch-zentrierte Dienste anbieten, sind ein aktives Forschungsfeld. In diesem …

[PDF][PDF] Audio-visual fusion: New methods and applications

A Llagostera Casanovas - 2011 - infoscience.epfl.ch
The perception that we have about the world is influenced by elements of diverse nature.
Indeed humans tend to integrate information coming from different sensory modalities to …

Conjugate Mixture Models for the Modeling of Visual and Auditory Perception

V Khalidov - 2010 - theses.hal.science
In this thesis, the modelling of audio-visual perception with a head-like device is considered.
The related problems, namely audio-visual calibration, audio-visual object detection …

A joint audio-visual approach to audio localization

JR Jensen, MG Christensen - 2015 IEEE International …, 2015 - ieeexplore.ieee.org
Localization of audio sources is an important research problem, eg, to facilitate noise
reduction. In the recent years, the problem has been tackled using distributed microphone …