An outlook into the future of egocentric vision

C Plizzari, G Goletto, A Furnari, S Bansal… - International Journal of …, 2024 - Springer
What will the future be? We wonder! In this survey, we explore the gap between current
research in egocentric vision and the ever-anticipated future, where wearable computing …

Challenges and solutions for vision-based hand gesture interpretation: A review

K Gao, H Zhang, X Liu, X Wang, L **e, B Ji… - Computer Vision and …, 2024 - Elsevier
Hand gesture is one of the most efficient and natural interfaces in current human–computer
interaction (HCI) systems. Despite the great progress achieved in hand gesture-based HCI …

General flow as foundation affordance for scalable robot learning

C Yuan, C Wen, T Zhang, Y Gao - arxiv preprint arxiv:2401.11439, 2024 - arxiv.org
We address the challenge of acquiring real-world manipulation skills with a scalable
framework. We hold the belief that identifying an appropriate prediction target capable of …

Prompting Future Driven Diffusion Model for Hand Motion Prediction

B Tang, K Zhang, W Luo, W Liu, H Li - European Conference on Computer …, 2024 - Springer
Hand motion prediction from both first-and third-person perspectives is vital for enhancing
user experience in AR/VR and ensuring safe remote robotic arm control. Previous works …

Bidirectional progressive transformer for interaction intention anticipation

Z Zhang, H Luo, W Zhai, Y Cao, Y Kang - European Conference on …, 2024 - Springer
Interaction intention anticipation aims to jointly predict future hand trajectories and
interaction hotspots. Existing research often treated trajectory forecasting and interaction …

AFF-ttention! Affordances and Attention Models for Short-Term Object Interaction Anticipation

L Mur-Labadia, R Martinez-Cantin, JJ Guerrero… - … on Computer Vision, 2024 - Springer
Abstract Short-Term object-interaction Anticipation (STA) consists of detecting the location of
the next-active objects, the noun and verb categories of the interaction, and the time to …

Pear: Phrase-based hand-object interaction anticipation

Z Zhang, H Luo, W Zhai, Y Cao, Y Kang - arxiv preprint arxiv:2407.21510, 2024 - arxiv.org
First-person hand-object interaction anticipation aims to predict the interaction process over
a forthcoming period based on current scenes and prompts. This capability is crucial for …

MADiff: Motion-aware mamba diffusion models for hand trajectory prediction on egocentric videos

J Ma, X Chen, W Bao, J Xu, H Wang - arxiv preprint arxiv:2409.02638, 2024 - arxiv.org
Understanding human intentions and actions through egocentric videos is important on the
path to embodied artificial intelligence. As a branch of egocentric vision techniques, hand …

Diff-IP2D: Diffusion-based hand-object interaction prediction on egocentric videos

J Ma, J Xu, X Chen, H Wang - arxiv preprint arxiv:2405.04370, 2024 - arxiv.org
Understanding how humans would behave during hand-object interaction is vital for
applications in service robot manipulation and extended reality. To achieve this, some …

EgoPAT3Dv2: Predicting 3D Action Target from 2D Egocentric Vision for Human-Robot Interaction

I Fang, Y Chen, Y Wang, J Zhang… - … on Robotics and …, 2024 - ieeexplore.ieee.org
A robot's ability to anticipate the 3D action target location of a hand's movement from
egocentric videos can greatly improve safety and efficiency in human-robot interaction (HRI) …