Augmented datasheets for speech datasets and ethical decision-making

O Papakyriakopoulos, ASG Choi, W Thong… - Proceedings of the …, 2023 - dl.acm.org
Speech datasets are crucial for training Speech Language Technologies (SLT); however,
the lack of diversity of the underlying training data can lead to serious limitations in building …

Tracking gaze and visual focus of attention of people involved in social interaction

B Massé, S Ba, R Horaud - IEEE transactions on pattern …, 2017 - ieeexplore.ieee.org
The visual focus of attention (VFOA) has been recognized as a prominent conversational
cue. We are interested in estimating and tracking the VFOAs associated with multi-party …

A multi-level context-based modeling of engagement in human-robot interaction

H Salam, M Chetouani - … and workshops on automatic face and …, 2015 - ieeexplore.ieee.org
In this paper, we consider engagement in the context of Human-Robot Interaction (HRI).
Previous studies in HRI relate engagement to emotion and attention independently from the …

The vernissage corpus: A conversational human-robot-interaction dataset

DB Jayagopi, S Sheiki, D Klotz… - 2013 8th ACM/IEEE …, 2013 - ieeexplore.ieee.org
We introduce a new conversational Human-Robot-Interaction (HRI) dataset with a real-
behaving robot inducing interactive behavior with and between humans. Our scenario …

Engagement detection based on mutli-party cues for human robot interaction

H Salam, M Chetouani - 2015 international conference on …, 2015 - ieeexplore.ieee.org
In this paper, we address the problematic of automatic detection of engagement in multi-
party Human-Robot Interaction scenarios. The aim is to investigate to what extent are we …

To whom are you talking? a deep learning model to endow social robots with addressee estimation skills

C Mazzola, M Romeo, F Rea, A Sciutti… - arxiv preprint arxiv …, 2023 - arxiv.org
Communicating shapes our social word. For a robot to be considered social and being
consequently integrated in our social environment it is fundamental to understand some of …

Multiple-gaze geometry: Inferring novel 3d locations from gazes observed in monocular video

E Brau, J Guan, T Jeffries… - Proceedings of the …, 2018 - openaccess.thecvf.com
We develop using person gaze direction for scene understanding. In particular, we use
intersecting gazes to learn 3D locations that people tend to look at, which is analogous to …

On inferring intentions in shared tasks for industrial collaborative robots

A Olivares-Alarcos, S Foix, G Alenya - Electronics, 2019 - mdpi.com
Inferring human operators' actions in shared collaborative tasks plays a crucial role in
enhancing the cognitive capabilities of industrial robots. In all these incipient collaborative …

[HTML][HTML] A multi-modal explainability approach for human-aware robots in multi-party conversation

I Bečková, Š Pócoš, G Belgiovine, M Matarese… - Computer Vision and …, 2025 - Elsevier
The addressee estimation (understanding to whom somebody is talking) is a fundamental
task for human activity recognition in multi-party conversation scenarios. Specifically, in the …

Variational inference and learning of piecewise linear dynamical systems

X Alameda-Pineda, V Drouard… - IEEE Transactions on …, 2021 - ieeexplore.ieee.org
Modeling the temporal behavior of data is of primordial importance in many scientific and
engineering fields. Baseline methods assume that both the dynamic and observation …