- Academic Search

HB Zhang, YX Zhang, B Zhong, Q Lei, L Yang, JX Du… - Sensors, 2019 - mdpi.com

Although widely used in many applications, accurate and efficient human action recognition
remains a challenging area of research in the field of computer vision. Most recent surveys …

Enregistrer Citer Cité 584 fois Autres articles Les 9 versions Free GPT-4 En cache

[Free GPT-4]

[PDF] arxiv.org

A comprehensive survey of scene graphs: Generation and application

X Chang, P Ren, P Xu, Z Li, X Chen… - IEEE Transactions on …, 2021 - ieeexplore.ieee.org

Scene graph is a structured representation of a scene that can clearly express the objects,
attributes, and relationships between objects in the scene. As computer vision technology …

Enregistrer Citer Cité 359 fois Autres articles Les 15 versions Free GPT-4

[Free GPT-4]

[PDF] thecvf.com

Clip-event: Connecting text and images with event structures

M Li, R Xu, S Wang, L Zhou, X Lin… - Proceedings of the …, 2022 - openaccess.thecvf.com

Abstract Vision-language (V+ L) pretraining models have achieved great success in
supporting multimedia applications by understanding the alignments between images and …

Enregistrer Citer Cité 142 fois Autres articles Les 8 versions Free GPT-4 Version HTML

[Free GPT-4]

[PDF] thecvf.com

Reconstructing hands in 3d with transformers

G Pavlakos, D Shan, I Radosavovic… - Proceedings of the …, 2024 - openaccess.thecvf.com

We present an approach that can reconstruct hands in 3D from monocular input. Our
approach for Hand Mesh Recovery HaMeR follows a fully transformer-based architecture …

Enregistrer Citer Cité 66 fois Autres articles Les 3 versions Free GPT-4 Version HTML

[Free GPT-4]

[PDF] thecvf.com

Learning human-object interactions by graph parsing neural networks

S Qi, W Wang, B Jia, J Shen… - Proceedings of the …, 2018 - openaccess.thecvf.com

This paper addresses the task of detecting and recognizing human-object interactions (HOI)
in images and videos. We introduce the Graph Parsing Neural Network (GPNN), a …

Enregistrer Citer Cité 674 fois Autres articles Les 13 versions Free GPT-4 Version HTML

[Free GPT-4]

[PDF] arxiv.org

Drg: Dual relation graph for human-object interaction detection

C Gao, J Xu, Y Zou, JB Huang - … Conference, Glasgow, UK, August 23–28 …, 2020 - Springer

We tackle the challenging problem of human-object interaction (HOI) detection. Existing
methods either recognize the interaction of each human-object pair in isolation or perform …

Enregistrer Citer Cité 251 fois Autres articles Les 5 versions Free GPT-4

[Free GPT-4]

[PDF] thecvf.com

Neural motifs: Scene graph parsing with global context

R Zellers, M Yatskar, S Thomson… - Proceedings of the …, 2018 - openaccess.thecvf.com

We investigate the problem of producing structured graph representations of visual scenes.
Our work analyzes the role of motifs: regularly appearing substructures in scene graphs. We …

Enregistrer Citer Cité 1168 fois Autres articles Les 9 versions Free GPT-4 Version HTML

[Free GPT-4]

[PDF] thecvf.com

Understanding human hands in contact at internet scale

D Shan, J Geng, M Shu… - Proceedings of the IEEE …, 2020 - openaccess.thecvf.com

Hands are the central means by which humans manipulate their world and being able to
reliably extract hand state information from Internet videos of humans engaged in their …

Enregistrer Citer Cité 347 fois Autres articles Les 12 versions Free GPT-4 Version HTML

[Free GPT-4]

[PDF] thecvf.com

Ava: A video dataset of spatio-temporally localized atomic visual actions

C Gu, C Sun, DA Ross, C Vondrick… - Proceedings of the …, 2018 - openaccess.thecvf.com

This paper introduces a video dataset of spatio-temporally localized Atomic Visual Actions
(AVA). The AVA dataset densely annotates 80 atomic visual actions in 437 15-minute video …

Enregistrer Citer Cité 1270 fois Autres articles Les 20 versions Free GPT-4 Version HTML

[Free GPT-4]

[PDF] thecvf.com

Scene graph generation by iterative message passing

D Xu, Y Zhu, CB Choy, L Fei-Fei - Proceedings of the IEEE …, 2017 - openaccess.thecvf.com

Understanding a visual scene goes beyond recognizing individual objects in isolation.
Relationships between objects also constitute rich semantic information about the scene. In …

Enregistrer Citer Cité 1495 fois Autres articles Les 11 versions Free GPT-4 Version HTML

Créer l'alerte

Citer

Recherche avancée

Enregistré dans Ma bibliothèque

Hico: A benchmark for recognizing human-object interactions in images

A comprehensive survey of vision-based human action recognition methods

A comprehensive survey of scene graphs: Generation and application

Clip-event: Connecting text and images with event structures

Reconstructing hands in 3d with transformers

Learning human-object interactions by graph parsing neural networks

Drg: Dual relation graph for human-object interaction detection

Neural motifs: Scene graph parsing with global context

Understanding human hands in contact at internet scale

Ava: A video dataset of spatio-temporally localized atomic visual actions

Scene graph generation by iterative message passing