[HTML][HTML] Localization of sound sources in robotics: A review
Sound source localization (SSL) in a robotic platform has been essential in the overall
scheme of robot audition. It allows a robot to locate a sound source by sound alone. It has an …
scheme of robot audition. It allows a robot to locate a sound source by sound alone. It has an …
A new family of multivariate heavy-tailed distributions with variable marginal amounts of tailweight: application to robust clustering
We propose a family of multivariate heavy-tailed distributions that allow variable marginal
amounts of tailweight. The originality comes from introducing multidimensional instead of …
amounts of tailweight. The originality comes from introducing multidimensional instead of …
Multi-speaker tracking from an audio–visual sensing device
Compact multi-sensor platforms are portable and thus desirable for robotics and personal-
assistance tasks. However, compared to physically distributed sensors, the size of these …
assistance tasks. However, compared to physically distributed sensors, the size of these …
The vernissage corpus: A conversational human-robot-interaction dataset
We introduce a new conversational Human-Robot-Interaction (HRI) dataset with a real-
behaving robot inducing interactive behavior with and between humans. Our scenario …
behaving robot inducing interactive behavior with and between humans. Our scenario …
RAVEL: An annotated corpus for training robots with audiovisual abilities
Abstract We introduce Ravel (Robots with Audiovisual Abilities), a publicly available data set
which covers examples of Human Robot Interaction (HRI) scenarios. These scenarios are …
which covers examples of Human Robot Interaction (HRI) scenarios. These scenarios are …
Audio-visual speaker tracking: Progress, challenges, and future directions
Audio-visual speaker tracking has drawn increasing attention over the past few years due to
its academic values and wide application. Audio and visual modalities can provide …
its academic values and wide application. Audio and visual modalities can provide …
Vision-guided robot hearing
Natural human–robot interaction (HRI) in complex and unpredictable environments is
important with many potential applications. While vision-based HRI has been thoroughly …
important with many potential applications. While vision-based HRI has been thoroughly …
Tragic Talkers: A Shakespearean sound-and light-field dataset for audio-visual machine learning research
3D audio-visual production aims to deliver immersive and interactive experiences to the
consumer. Yet, faithfully reproducing real-world 3D scenes remains a challenging task. This …
consumer. Yet, faithfully reproducing real-world 3D scenes remains a challenging task. This …
Conjugate mixture models for clustering multimodal data
The problem of multimodal clustering arises whenever the data are gathered with several
physically different sensors. Observations from different modalities are not necessarily …
physically different sensors. Observations from different modalities are not necessarily …
[PDF][PDF] The Sheffield wargames corpus
Recognition of speech in natural environments is a challenging task, even more so if this
involves conversations between several speakers. Work on meeting recognition has …
involves conversations between several speakers. Work on meeting recognition has …