Scene graph generation: A comprehensive survey

G Zhu, L Zhang, Y Jiang, Y Dang, H Hou… - arxiv preprint arxiv …, 2022 - arxiv.org
Deep learning techniques have led to remarkable breakthroughs in the field of generic
object detection and have spawned a lot of scene-understanding tasks in recent years …

The open images dataset v4: Unified image classification, object detection, and visual relationship detection at scale

A Kuznetsova, H Rom, N Alldrin, J Uijlings… - International journal of …, 2020 - Springer
Abstract We present Open Images V4, a dataset of 9.2 M images with unified annotations for
image classification, object detection and visual relationship detection. The images have a …

Stacked hybrid-attention and group collaborative learning for unbiased scene graph generation

X Dong, T Gan, X Song, J Wu… - Proceedings of the …, 2022 - openaccess.thecvf.com
Abstract Scene Graph Generation, which generally follows a regular encoder-decoder
pipeline, aims to first encode the visual contents within the given image and then parse them …

[HTML][HTML] Scene graph generation: A comprehensive survey

H Li, G Zhu, L Zhang, Y Jiang, Y Dang, H Hou, P Shen… - Neurocomputing, 2024 - Elsevier
Deep learning techniques have led to remarkable breakthroughs in the field of object
detection and have spawned a lot of scene-understanding tasks in recent years. Scene …

Bridging knowledge graphs to generate scene graphs

A Zareian, S Karaman, SF Chang - … , Glasgow, UK, August 23–28, 2020 …, 2020 - Springer
Scene graphs are powerful representations that parse images into their abstract semantic
elements, ie, objects and their interactions, which facilitates visual comprehension and …

Webly supervised concept expansion for general purpose vision models

A Kamath, C Clark, T Gupta, E Kolve, D Hoiem… - … on Computer Vision, 2022 - Springer
Abstract General Purpose Vision (GPV) systems are models that are designed to solve a
wide array of visual tasks without requiring architectural changes. Today, GPVs primarily …

Automatic animation of hair blowing in still portrait photos

W **ao, W Liu, Y Wang… - Proceedings of the …, 2023 - openaccess.thecvf.com
We propose a novel approach to animate human hair in a still portrait photo. Existing work
has largely studied the animation of fluid elements such as water and fire. However, hair …

Video relationship reasoning using gated spatio-temporal energy graph

YHH Tsai, S Divvala, LP Morency… - Proceedings of the …, 2019 - openaccess.thecvf.com
Visual relationship reasoning is a crucial yet challenging task for understanding rich
interactions across visual concepts. For example, a relationship\man, open, door\involves a …

Hierarchical graph attention network for visual relationship detection

L Mi, Z Chen - Proceedings of the IEEE/CVF conference on …, 2020 - openaccess.thecvf.com
Abstract Visual Relationship Detection (VRD) aims to describe the relationship between two
objects by providing a structural triplet shown as. Existing graph-based methods mainly …

On exploring undetermined relationships for visual relationship detection

Y Zhan, J Yu, T Yu, D Tao - … of the IEEE/CVF Conference on …, 2019 - openaccess.thecvf.com
In visual relationship detection, human-notated relationships can be regarded as
determinate relationships. However, there are still large amount of unlabeled data, such as …