Analysis of the hands in egocentric vision: A survey

A Bandini, J Zariffa - IEEE transactions on pattern analysis and …, 2020 - ieeexplore.ieee.org
Egocentric vision (aka first-person vision–FPV) applications have thrived over the past few
years, thanks to the availability of affordable wearable cameras and large annotated …

Socratic models: Composing zero-shot multimodal reasoning with language

A Zeng, M Attarian, B Ichter, K Choromanski… - arxiv preprint arxiv …, 2022 - arxiv.org
Large pretrained (eg," foundation") models exhibit distinct capabilities depending on the
domain of data they are trained on. While these domains are generic, they may only barely …

[HTML][HTML] Video activity recognition: State-of-the-art

I Rodríguez-Moreno, JM Martínez-Otzeta, B Sierra… - Sensors, 2019 - mdpi.com
Video activity recognition, although being an emerging task, has been the subject of
important research efforts due to the importance of its everyday applications. Surveillance by …

Epic-fusion: Audio-visual temporal binding for egocentric action recognition

E Kazakos, A Nagrani, A Zisserman… - Proceedings of the …, 2019 - openaccess.thecvf.com
We focus on multi-modal fusion for egocentric action recognition, and propose a novel
architecture for multi-modal temporal-binding, ie the combination of modalities within a …

H2o: Two hands manipulating objects for first person interaction recognition

T Kwon, B Tekin, J Stühmer, F Bogo… - Proceedings of the …, 2021 - openaccess.thecvf.com
We present a comprehensive framework for egocentric interaction recognition using
markerless 3D annotations of two hands manipulating objects. To this end, we propose a …

A comprehensive study of deep video action recognition

Y Zhu, X Li, C Liu, M Zolfaghari, Y **ong, C Wu… - arxiv preprint arxiv …, 2020 - arxiv.org
Video action recognition is one of the representative tasks for video understanding. Over the
last decade, we have witnessed great advancements in video action recognition thanks to …

Human poseitioning system (hps): 3d human pose estimation and self-localization in large scenes from body-mounted sensors

V Guzov, A Mir, T Sattler… - Proceedings of the IEEE …, 2021 - openaccess.thecvf.com
Abstract We introduce (HPS) Human POSEitioning System, a method to recover the full 3D
pose of a human registered with a 3D scan of the surrounding environment using wearable …

First-person hand action benchmark with rgb-d videos and 3d hand pose annotations

G Garcia-Hernando, S Yuan… - Proceedings of the …, 2018 - openaccess.thecvf.com
In this work we study the use of 3D hand poses to recognize first-person dynamic hand
actions interacting with 3D objects. Towards this goal, we collected RGB-D video sequences …

H+ o: Unified egocentric recognition of 3d hand-object poses and interactions

B Tekin, F Bogo, M Pollefeys - Proceedings of the IEEE/CVF …, 2019 - openaccess.thecvf.com
We present a unified framework for understanding 3D hand and object interactions in raw
image sequences from egocentric RGB cameras. Given a single RGB image, our model …

In the eye of beholder: Joint learning of gaze and actions in first person video

Y Li, M Liu, JM Rehg - Proceedings of the European …, 2018 - openaccess.thecvf.com
We address the task of jointly determining what a person is doing and where they are
looking based on the analysis of video captured by a headworn camera. We propose a …