Challenges and solutions for vision-based hand gesture interpretation: A review

K Gao, H Zhang, X Liu, X Wang, L **e, B Ji… - Computer Vision and …, 2024 - Elsevier
Hand gesture is one of the most efficient and natural interfaces in current human–computer
interaction (HCI) systems. Despite the great progress achieved in hand gesture-based HCI …

An outlook into the future of egocentric vision

C Plizzari, G Goletto, A Furnari, S Bansal… - International Journal of …, 2024 - Springer
What will the future be? We wonder! In this survey, we explore the gap between current
research in egocentric vision and the ever-anticipated future, where wearable computing …

Bidirectional progressive transformer for interaction intention anticipation

Z Zhang, H Luo, W Zhai, Y Cao, Y Kang - European Conference on …, 2024 - Springer
Interaction intention anticipation aims to jointly predict future hand trajectories and
interaction hotspots. Existing research often treated trajectory forecasting and interaction …

Prompting Future Driven Diffusion Model for Hand Motion Prediction

B Tang, K Zhang, W Luo, W Liu, H Li - European Conference on Computer …, 2024 - Springer
Hand motion prediction from both first-and third-person perspectives is vital for enhancing
user experience in AR/VR and ensuring safe remote robotic arm control. Previous works …

AFF-ttention! Affordances and Attention Models for Short-Term Object Interaction Anticipation

L Mur-Labadia, R Martinez-Cantin, JJ Guerrero… - … on Computer Vision, 2024 - Springer
Abstract Short-Term object-interaction Anticipation (STA) consists of detecting the location of
the next-active objects, the noun and verb categories of the interaction, and the time to …

General flow as foundation affordance for scalable robot learning

C Yuan, C Wen, T Zhang, Y Gao - arxiv preprint arxiv:2401.11439, 2024 - arxiv.org
We address the challenge of acquiring real-world manipulation skills with a scalable
framework. Inspired by the success of large-scale auto-regressive prediction in Large …

Pear: Phrase-based hand-object interaction anticipation

Z Zhang, H Luo, W Zhai, Y Cao, Y Kang - arxiv preprint arxiv:2407.21510, 2024 - arxiv.org
First-person hand-object interaction anticipation aims to predict the interaction process over
a forthcoming period based on current scenes and prompts. This capability is crucial for …

MADiff: Motion-aware mamba diffusion models for hand trajectory prediction on egocentric videos

J Ma, X Chen, W Bao, J Xu, H Wang - arxiv preprint arxiv:2409.02638, 2024 - arxiv.org
Understanding human intentions and actions through egocentric videos is important on the
path to embodied artificial intelligence. As a branch of egocentric vision techniques, hand …

UniHOI: Learning Fast, Dense and Generalizable 4D Reconstruction for Egocentric Hand Object Interaction Videos

C Yuan, G Chen, L Yi, Y Gao - arxiv preprint arxiv:2411.09145, 2024 - arxiv.org
Egocentric Hand Object Interaction (HOI) videos provide valuable insights into human
interactions with the physical world, attracting growing interest from the computer vision and …

Diff-IP2D: Diffusion-Based Hand-Object Interaction Prediction on Egocentric Videos

J Ma, J Xu, X Chen, H Wang - arxiv preprint arxiv:2405.04370, 2024 - arxiv.org
Understanding how humans would behave during hand-object interaction is vital for
applications in service robot manipulation and extended reality. To achieve this, some …