Contextual translation embedding for visual relationship detection and scene graph generation
Relations amongst entities play a central role in image understanding. Due to the complexity
of modeling (subject, predicate, object) relation triplets, it is crucial to develop a method that …
of modeling (subject, predicate, object) relation triplets, it is crucial to develop a method that …
[PDF][PDF] Localizing spatially and temporally objects and actions in videos
V Kalogeiton - 2018 - academia.edu
The rise of deep learning has facilitated remarkable progress in video understanding. This
thesis addresses three important tasks of video understanding: video object detection, joint …
thesis addresses three important tasks of video understanding: video object detection, joint …
Visual relationship understanding
ZS Hung - 2020 - ideals.illinois.edu
This thesis addresses two visual understanding tasks: visual relationship detection (VRD)
and video action recognition. The majority of the thesis is focused on VRD, which is our main …
and video action recognition. The majority of the thesis is focused on VRD, which is our main …