Image generation: A review
The creation of an image from another and from different types of data including text, scene
graph, and object layout, is one of the very challenging tasks in computer vision. In addition …
graph, and object layout, is one of the very challenging tasks in computer vision. In addition …
Layoutllm-t2i: Eliciting layout guidance from llm for text-to-image generation
In the text-to-image generation field, recent remarkable progress in Stable Diffusion makes it
possible to generate rich kinds of novel photorealistic images. However, current models still …
possible to generate rich kinds of novel photorealistic images. However, current models still …
Spatial-temporal transformer for dynamic scene graph generation
Dynamic scene graph generation aims at generating a scene graph of the given video.
Compared to the task of scene graph generation from images, it is more challenging …
Compared to the task of scene graph generation from images, it is more challenging …
Frido: Feature pyramid diffusion for complex scene image synthesis
Diffusion models (DMs) have shown great potential for high-quality image synthesis.
However, when it comes to producing images with complex scenes, how to properly …
However, when it comes to producing images with complex scenes, how to properly …
Text to image generation with semantic-spatial aware gan
Text-to-image synthesis (T2I) aims to generate photo-realistic images which are
semantically consistent with the text descriptions. Existing methods are usually built upon …
semantically consistent with the text descriptions. Existing methods are usually built upon …
DrivingDiffusion: Layout-Guided Multi-view Driving Scenarios Video Generation with Latent Diffusion Model
With the surge in autonomous driving technologies, the reliance on comprehensive and high-
definition bird's-eye-view (BEV) representations has become paramount. This burgeoning …
definition bird's-eye-view (BEV) representations has become paramount. This burgeoning …
Scenecomposer: Any-level semantic image synthesis
We propose a new framework for conditional image synthesis from semantic layouts of any
precision levels, ranging from pure text to a 2D semantic canvas with precise shapes. More …
precision levels, ranging from pure text to a 2D semantic canvas with precise shapes. More …
Layoutdiffuse: Adapting foundational diffusion models for layout-to-image generation
Layout-to-image generation refers to the task of synthesizing photo-realistic images based
on semantic layouts. In this paper, we propose LayoutDiffuse that adapts a foundational …
on semantic layouts. In this paper, we propose LayoutDiffuse that adapts a foundational …
Modeling image composition for complex scene generation
We present a method that achieves state-of-the-art results on challenging (few-shot) layout-
to-image generation tasks by accurately modeling textures, structures and relationships …
to-image generation tasks by accurately modeling textures, structures and relationships …
Diffusion-based scene graph to image generation with masked contrastive pre-training
Generating images from graph-structured inputs, such as scene graphs, is uniquely
challenging due to the difficulty of aligning nodes and connections in graphs with objects …
challenging due to the difficulty of aligning nodes and connections in graphs with objects …