- Academic Search

C Rascon, I Meza - Robotics and Autonomous Systems, 2017 - Elsevier

Sound source localization (SSL) in a robotic platform has been essential in the overall
scheme of robot audition. It allows a robot to locate a sound source by sound alone. It has an …

保存引用被引用数: 336 関連記事全 7 バージョン

[Free GPT-4]
[DeepSeek]

[PDF] hal.science

A new family of multivariate heavy-tailed distributions with variable marginal amounts of tailweight: application to robust clustering

F Forbes, D Wraith - Statistics and computing, 2014 - Springer

We propose a family of multivariate heavy-tailed distributions that allow variable marginal
amounts of tailweight. The originality comes from introducing multidimensional instead of …

保存引用被引用数: 123 関連記事全 17 バージョン

[Free GPT-4]
[DeepSeek]

[PDF] qmul.ac.uk

Multi-speaker tracking from an audio–visual sensing device

X Qian, A Brutti, O Lanz, M Omologo… - IEEE Transactions on …, 2019 - ieeexplore.ieee.org

Compact multi-sensor platforms are portable and thus desirable for robotics and personal-
assistance tasks. However, compared to physically distributed sensors, the size of these …

保存引用被引用数: 64 関連記事全 11 バージョン

[Free GPT-4]
[DeepSeek]

[PDF] epfl.ch

The vernissage corpus: A conversational human-robot-interaction dataset

DB Jayagopi, S Sheiki, D Klotz… - 2013 8th ACM/IEEE …, 2013 - ieeexplore.ieee.org

We introduce a new conversational Human-Robot-Interaction (HRI) dataset with a real-
behaving robot inducing interactive behavior with and between humans. Our scenario …

保存引用被引用数: 64 関連記事全 16 バージョン

[Free GPT-4]
[DeepSeek]

[PDF] hal.science

RAVEL: An annotated corpus for training robots with audiovisual abilities

X Alameda-Pineda, J Sanchez-Riera, J Wienke… - Journal on Multimodal …, 2013 - Springer

Abstract We introduce Ravel (Robots with Audiovisual Abilities), a publicly available data set
which covers examples of Human Robot Interaction (HRI) scenarios. These scenarios are …

保存引用被引用数: 46 関連記事全 17 バージョン

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Audio-visual speaker tracking: Progress, challenges, and future directions

J Zhao, Y Xu, X Qian, D Berghi, P Wu, M Cui… - arxiv preprint arxiv …, 2023 - arxiv.org

Audio-visual speaker tracking has drawn increasing attention over the past few years due to
its academic values and wide application. Audio and visual modalities can provide …

保存引用被引用数: 6 関連記事全 3 バージョン HTMLバージョン

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Vision-guided robot hearing

X Alameda-Pineda, R Horaud - The International Journal of …, 2015 - journals.sagepub.com

Natural human–robot interaction (HRI) in complex and unpredictable environments is
important with many potential applications. While vision-based HRI has been thoroughly …

保存引用被引用数: 41 関連記事全 13 バージョン

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Tragic Talkers: A Shakespearean sound-and light-field dataset for audio-visual machine learning research

D Berghi, M Volino, PJB Jackson - Proceedings of the 19th ACM …, 2022 - dl.acm.org

3D audio-visual production aims to deliver immersive and interactive experiences to the
consumer. Yet, faithfully reproducing real-world 3D scenes remains a challenging task. This …

保存引用被引用数: 5 関連記事全 6 バージョン

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Conjugate mixture models for clustering multimodal data

V Khalidov, F Forbes, R Horaud - Neural Computation, 2011 - ieeexplore.ieee.org

The problem of multimodal clustering arises whenever the data are gathered with several
physically different sensors. Observations from different modalities are not necessarily …

保存引用被引用数: 38 関連記事全 25 バージョン

[Free GPT-4]
[DeepSeek]

[PDF] whiterose.ac.uk

[PDF][PDF] The Sheffield wargames corpus

CW Fox, Y Liu, E Zwyssig… - … of Interspeech 2013, 2013 - eprints.whiterose.ac.uk

Recognition of speech in natural environments is a challenging task, even more so if this
involves conversations between several speakers. Work on meeting recognition has …

保存引用被引用数: 26 関連記事全 12 バージョン HTMLバージョン

アラートを作成

引用

検索オプション

マイライブラリに保存しました

The CAVA corpus: synchronised stereoscopic and binaural datasets with head movements

[HTML][HTML] Localization of sound sources in robotics: A review

A new family of multivariate heavy-tailed distributions with variable marginal amounts of tailweight: application to robust clustering

Multi-speaker tracking from an audio–visual sensing device

The vernissage corpus: A conversational human-robot-interaction dataset

RAVEL: An annotated corpus for training robots with audiovisual abilities

Audio-visual speaker tracking: Progress, challenges, and future directions

Vision-guided robot hearing

Tragic Talkers: A Shakespearean sound-and light-field dataset for audio-visual machine learning research

Conjugate mixture models for clustering multimodal data

[PDF][PDF] The Sheffield wargames corpus