A comprehensive survey of scene graphs: Generation and application
Scene graph is a structured representation of a scene that can clearly express the objects,
attributes, and relationships between objects in the scene. As computer vision technology …
attributes, and relationships between objects in the scene. As computer vision technology …
Hel** hands: An object-aware ego-centric video recognition model
We introduce an object-aware decoder for improving the performance of spatio-temporal
representations on ego-centric videos. The key idea is to enhance object-awareness during …
representations on ego-centric videos. The key idea is to enhance object-awareness during …
Agqa: A benchmark for compositional spatio-temporal reasoning
Visual events are a composition of temporal actions involving actors spatially interacting with
objects. When develo** computer vision models that can reason about compositional …
objects. When develo** computer vision models that can reason about compositional …