Contextual translation embedding for visual relationship detection and scene graph generation

ZS Hung, A Mallya, S Lazebnik - IEEE transactions on pattern …, 2020 - ieeexplore.ieee.org
Relations amongst entities play a central role in image understanding. Due to the complexity
of modeling (subject, predicate, object) relation triplets, it is crucial to develop a method that …

[PDF][PDF] Localizing spatially and temporally objects and actions in videos

V Kalogeiton - 2018 - academia.edu
The rise of deep learning has facilitated remarkable progress in video understanding. This
thesis addresses three important tasks of video understanding: video object detection, joint …

Visual relationship understanding

ZS Hung - 2020 - ideals.illinois.edu
This thesis addresses two visual understanding tasks: visual relationship detection (VRD)
and video action recognition. The majority of the thesis is focused on VRD, which is our main …