Augmented datasheets for speech datasets and ethical decision-making
Speech datasets are crucial for training Speech Language Technologies (SLT); however,
the lack of diversity of the underlying training data can lead to serious limitations in building …
the lack of diversity of the underlying training data can lead to serious limitations in building …
Tracking gaze and visual focus of attention of people involved in social interaction
The visual focus of attention (VFOA) has been recognized as a prominent conversational
cue. We are interested in estimating and tracking the VFOAs associated with multi-party …
cue. We are interested in estimating and tracking the VFOAs associated with multi-party …
A multi-level context-based modeling of engagement in human-robot interaction
In this paper, we consider engagement in the context of Human-Robot Interaction (HRI).
Previous studies in HRI relate engagement to emotion and attention independently from the …
Previous studies in HRI relate engagement to emotion and attention independently from the …
The vernissage corpus: A conversational human-robot-interaction dataset
We introduce a new conversational Human-Robot-Interaction (HRI) dataset with a real-
behaving robot inducing interactive behavior with and between humans. Our scenario …
behaving robot inducing interactive behavior with and between humans. Our scenario …
Engagement detection based on mutli-party cues for human robot interaction
In this paper, we address the problematic of automatic detection of engagement in multi-
party Human-Robot Interaction scenarios. The aim is to investigate to what extent are we …
party Human-Robot Interaction scenarios. The aim is to investigate to what extent are we …
To whom are you talking? a deep learning model to endow social robots with addressee estimation skills
Communicating shapes our social word. For a robot to be considered social and being
consequently integrated in our social environment it is fundamental to understand some of …
consequently integrated in our social environment it is fundamental to understand some of …
Multiple-gaze geometry: Inferring novel 3d locations from gazes observed in monocular video
We develop using person gaze direction for scene understanding. In particular, we use
intersecting gazes to learn 3D locations that people tend to look at, which is analogous to …
intersecting gazes to learn 3D locations that people tend to look at, which is analogous to …
On inferring intentions in shared tasks for industrial collaborative robots
Inferring human operators' actions in shared collaborative tasks plays a crucial role in
enhancing the cognitive capabilities of industrial robots. In all these incipient collaborative …
enhancing the cognitive capabilities of industrial robots. In all these incipient collaborative …
[HTML][HTML] A multi-modal explainability approach for human-aware robots in multi-party conversation
The addressee estimation (understanding to whom somebody is talking) is a fundamental
task for human activity recognition in multi-party conversation scenarios. Specifically, in the …
task for human activity recognition in multi-party conversation scenarios. Specifically, in the …
Variational inference and learning of piecewise linear dynamical systems
Modeling the temporal behavior of data is of primordial importance in many scientific and
engineering fields. Baseline methods assume that both the dynamic and observation …
engineering fields. Baseline methods assume that both the dynamic and observation …