Actionvos: Actions as prompts for video object segmentation
Delving into the realm of egocentric vision, the advancement of referring video object
segmentation (RVOS) stands as pivotal in understanding human activities. However …
segmentation (RVOS) stands as pivotal in understanding human activities. However …
MADiff: Motion-aware mamba diffusion models for hand trajectory prediction on egocentric videos
Understanding human intentions and actions through egocentric videos is important on the
path to embodied artificial intelligence. As a branch of egocentric vision techniques, hand …
path to embodied artificial intelligence. As a branch of egocentric vision techniques, hand …
Diff-IP2D: Diffusion-Based Hand-Object Interaction Prediction on Egocentric Videos
Understanding how humans would behave during hand-object interaction is vital for
applications in service robot manipulation and extended reality. To achieve this, some …
applications in service robot manipulation and extended reality. To achieve this, some …
PooDLe: Pooled and dense self-supervised learning from naturalistic videos
Self-supervised learning has driven significant progress in learning from single-subject,
iconic images. However, there are still unanswered questions about the use of minimally …
iconic images. However, there are still unanswered questions about the use of minimally …