- Academic Search

A Bandini, J Zariffa - IEEE transactions on pattern analysis and …, 2020 - ieeexplore.ieee.org

Egocentric vision (aka first-person vision–FPV) applications have thrived over the past few
years, thanks to the availability of affordable wearable cameras and large annotated …

บันทึก อ้างอิง อ้างโดย107 บทความที่เกี่ยวข้อง ทั้งหมด 9 ฉบับ

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Socratic models: Composing zero-shot multimodal reasoning with language

A Zeng, M Attarian, B Ichter, K Choromanski… - arxiv preprint arxiv …, 2022 - arxiv.org

Large pretrained (eg," foundation") models exhibit distinct capabilities depending on the
domain of data they are trained on. While these domains are generic, they may only barely …

บันทึก อ้างอิง อ้างโดย523 บทความที่เกี่ยวข้อง ทั้งหมด 5 ฉบับ ดูในรูปแบบ HTML

[Free GPT-4]
[DeepSeek]

[HTML] mdpi.com

[HTML][HTML] Video activity recognition: State-of-the-art

I Rodríguez-Moreno, JM Martínez-Otzeta, B Sierra… - Sensors, 2019 - mdpi.com

Video activity recognition, although being an emerging task, has been the subject of
important research efforts due to the importance of its everyday applications. Surveillance by …

บันทึก อ้างอิง อ้างโดย101 บทความที่เกี่ยวข้อง ทั้งหมด 13 ฉบับ แคช

[Free GPT-4]
[DeepSeek]

[PDF] thecvf.com

Epic-fusion: Audio-visual temporal binding for egocentric action recognition

E Kazakos, A Nagrani, A Zisserman… - Proceedings of the …, 2019 - openaccess.thecvf.com

We focus on multi-modal fusion for egocentric action recognition, and propose a novel
architecture for multi-modal temporal-binding, ie the combination of modalities within a …

บันทึก อ้างอิง อ้างโดย428 บทความที่เกี่ยวข้อง ทั้งหมด 15 ฉบับ ดูในรูปแบบ HTML

[Free GPT-4]
[DeepSeek]

[PDF] thecvf.com

H2o: Two hands manipulating objects for first person interaction recognition

T Kwon, B Tekin, J Stühmer, F Bogo… - Proceedings of the …, 2021 - openaccess.thecvf.com

We present a comprehensive framework for egocentric interaction recognition using
markerless 3D annotations of two hands manipulating objects. To this end, we propose a …

บันทึก อ้างอิง อ้างโดย176 บทความที่เกี่ยวข้อง ทั้งหมด 8 ฉบับ ดูในรูปแบบ HTML

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

A comprehensive study of deep video action recognition

Y Zhu, X Li, C Liu, M Zolfaghari, Y **ong, C Wu… - arxiv preprint arxiv …, 2020 - arxiv.org

Video action recognition is one of the representative tasks for video understanding. Over the
last decade, we have witnessed great advancements in video action recognition thanks to …

บันทึก อ้างอิง อ้างโดย243 บทความที่เกี่ยวข้อง ทั้งหมด 2 ฉบับ ดูในรูปแบบ HTML

[Free GPT-4]
[DeepSeek]

[PDF] thecvf.com

Human poseitioning system (hps): 3d human pose estimation and self-localization in large scenes from body-mounted sensors

V Guzov, A Mir, T Sattler… - Proceedings of the IEEE …, 2021 - openaccess.thecvf.com

Abstract We introduce (HPS) Human POSEitioning System, a method to recover the full 3D
pose of a human registered with a 3D scan of the surrounding environment using wearable …

บันทึก อ้างอิง อ้างโดย156 บทความที่เกี่ยวข้อง ทั้งหมด 7 ฉบับ ดูในรูปแบบ HTML

[Free GPT-4]
[DeepSeek]

[PDF] thecvf.com

First-person hand action benchmark with rgb-d videos and 3d hand pose annotations

G Garcia-Hernando, S Yuan… - Proceedings of the …, 2018 - openaccess.thecvf.com

In this work we study the use of 3D hand poses to recognize first-person dynamic hand
actions interacting with 3D objects. Towards this goal, we collected RGB-D video sequences …

บันทึก อ้างอิง อ้างโดย634 บทความที่เกี่ยวข้อง ทั้งหมด 11 ฉบับ ดูในรูปแบบ HTML

[Free GPT-4]
[DeepSeek]

[PDF] thecvf.com

H+ o: Unified egocentric recognition of 3d hand-object poses and interactions

B Tekin, F Bogo, M Pollefeys - Proceedings of the IEEE/CVF …, 2019 - openaccess.thecvf.com

We present a unified framework for understanding 3D hand and object interactions in raw
image sequences from egocentric RGB cameras. Given a single RGB image, our model …

บันทึก อ้างอิง อ้างโดย323 บทความที่เกี่ยวข้อง ทั้งหมด 12 ฉบับ ดูในรูปแบบ HTML

[Free GPT-4]
[DeepSeek]

[PDF] thecvf.com

In the eye of beholder: Joint learning of gaze and actions in first person video

Y Li, M Liu, JM Rehg - Proceedings of the European …, 2018 - openaccess.thecvf.com

We address the task of jointly determining what a person is doing and where they are
looking based on the analysis of video captured by a headworn camera. We propose a …

บันทึก อ้างอิง อ้างโดย390 บทความที่เกี่ยวข้อง ทั้งหมด 9 ฉบับ ดูในรูปแบบ HTML

สร้างการแจ้งเตือน

อ้างอิง

การค้นหาขั้นสูง

บันทึกไปยังคลังของฉันแล้ว

Going deeper into first-person activity recognition

Analysis of the hands in egocentric vision: A survey

Socratic models: Composing zero-shot multimodal reasoning with language

[HTML][HTML] Video activity recognition: State-of-the-art

Epic-fusion: Audio-visual temporal binding for egocentric action recognition

H2o: Two hands manipulating objects for first person interaction recognition

A comprehensive study of deep video action recognition

Human poseitioning system (hps): 3d human pose estimation and self-localization in large scenes from body-mounted sensors

First-person hand action benchmark with rgb-d videos and 3d hand pose annotations

H+ o: Unified egocentric recognition of 3d hand-object poses and interactions

In the eye of beholder: Joint learning of gaze and actions in first person video