A comprehensive survey of scene graphs: Generation and application

X Chang, P Ren, P Xu, Z Li, X Chen… - IEEE Transactions on …, 2021 - ieeexplore.ieee.org
Scene graph is a structured representation of a scene that can clearly express the objects,
attributes, and relationships between objects in the scene. As computer vision technology …

Scene graph generation: A comprehensive survey

G Zhu, L Zhang, Y Jiang, Y Dang, H Hou… - ar**_Hands_An_Object-Aware_Ego-Centric_Video_Recognition_Model_ICCV_2023_paper.pdf" data-clk="hl=it&sa=T&oi=gga&ct=gga&cd=8&d=6184441514343437799&ei=9v6vZ4q9Ldqy6rQP56ab0AE" data-clk-atid="55WebMqM01UJ" target="_blank">[PDF] thecvf.com

Hel** hands: An object-aware ego-centric video recognition model

C Zhang, A Gupta, A Zisserman - Proceedings of the IEEE …, 2023 - openaccess.thecvf.com
We introduce an object-aware decoder for improving the performance of spatio-temporal
representations on ego-centric videos. The key idea is to enhance object-awareness during …

Agqa: A benchmark for compositional spatio-temporal reasoning

M Grunde-McLaughlin, R Krishna… - Proceedings of the …, 2021 - openaccess.thecvf.com
Visual events are a composition of temporal actions involving actors spatially interacting with
objects. When develo** computer vision models that can reason about compositional …