Challenges and solutions for vision-based hand gesture interpretation: A review
Hand gesture is one of the most efficient and natural interfaces in current human–computer
interaction (HCI) systems. Despite the great progress achieved in hand gesture-based HCI …
interaction (HCI) systems. Despite the great progress achieved in hand gesture-based HCI …
An outlook into the future of egocentric vision
What will the future be? We wonder! In this survey, we explore the gap between current
research in egocentric vision and the ever-anticipated future, where wearable computing …
research in egocentric vision and the ever-anticipated future, where wearable computing …
Bidirectional progressive transformer for interaction intention anticipation
Interaction intention anticipation aims to jointly predict future hand trajectories and
interaction hotspots. Existing research often treated trajectory forecasting and interaction …
interaction hotspots. Existing research often treated trajectory forecasting and interaction …
Prompting Future Driven Diffusion Model for Hand Motion Prediction
Hand motion prediction from both first-and third-person perspectives is vital for enhancing
user experience in AR/VR and ensuring safe remote robotic arm control. Previous works …
user experience in AR/VR and ensuring safe remote robotic arm control. Previous works …
AFF-ttention! Affordances and Attention Models for Short-Term Object Interaction Anticipation
Abstract Short-Term object-interaction Anticipation (STA) consists of detecting the location of
the next-active objects, the noun and verb categories of the interaction, and the time to …
the next-active objects, the noun and verb categories of the interaction, and the time to …
General flow as foundation affordance for scalable robot learning
We address the challenge of acquiring real-world manipulation skills with a scalable
framework. Inspired by the success of large-scale auto-regressive prediction in Large …
framework. Inspired by the success of large-scale auto-regressive prediction in Large …
Pear: Phrase-based hand-object interaction anticipation
First-person hand-object interaction anticipation aims to predict the interaction process over
a forthcoming period based on current scenes and prompts. This capability is crucial for …
a forthcoming period based on current scenes and prompts. This capability is crucial for …
MADiff: Motion-aware mamba diffusion models for hand trajectory prediction on egocentric videos
Understanding human intentions and actions through egocentric videos is important on the
path to embodied artificial intelligence. As a branch of egocentric vision techniques, hand …
path to embodied artificial intelligence. As a branch of egocentric vision techniques, hand …
UniHOI: Learning Fast, Dense and Generalizable 4D Reconstruction for Egocentric Hand Object Interaction Videos
Egocentric Hand Object Interaction (HOI) videos provide valuable insights into human
interactions with the physical world, attracting growing interest from the computer vision and …
interactions with the physical world, attracting growing interest from the computer vision and …
Diff-IP2D: Diffusion-Based Hand-Object Interaction Prediction on Egocentric Videos
Understanding how humans would behave during hand-object interaction is vital for
applications in service robot manipulation and extended reality. To achieve this, some …
applications in service robot manipulation and extended reality. To achieve this, some …