Μελετητής Google

M Elasri, O Elharrouss, S Al-Maadeed, H Tairi - Neural Processing Letters, 2022 - Springer

The creation of an image from another and from different types of data including text, scene
graph, and object layout, is one of the very challenging tasks in computer vision. In addition …

Αποθήκευση Παράθεση Γίνεται αναφορά σε 106 Σχετικά άρθρα Όλες οι 6 εκδοχές

[Free GPT-4]
[DeepSeek]

[PDF] acm.org

Layoutllm-t2i: Eliciting layout guidance from llm for text-to-image generation

L Qu, S Wu, H Fei, L Nie, TS Chua - Proceedings of the 31st ACM …, 2023 - dl.acm.org

In the text-to-image generation field, recent remarkable progress in Stable Diffusion makes it
possible to generate rich kinds of novel photorealistic images. However, current models still …

Αποθήκευση Παράθεση Γίνεται αναφορά σε 98 Σχετικά άρθρα Όλες οι 3 εκδοχές

[Free GPT-4]
[DeepSeek]

[PDF] thecvf.com

Spatial-temporal transformer for dynamic scene graph generation

Y Cong, W Liao, H Ackermann… - Proceedings of the …, 2021 - openaccess.thecvf.com

Dynamic scene graph generation aims at generating a scene graph of the given video.
Compared to the task of scene graph generation from images, it is more challenging …

Αποθήκευση Παράθεση Γίνεται αναφορά σε 155 Σχετικά άρθρα Όλες οι 12 εκδοχές Προβολή ως HTML

[Free GPT-4]
[DeepSeek]

[PDF] aaai.org

Frido: Feature pyramid diffusion for complex scene image synthesis

WC Fan, YC Chen, DD Chen, Y Cheng… - Proceedings of the …, 2023 - ojs.aaai.org

Diffusion models (DMs) have shown great potential for high-quality image synthesis.
However, when it comes to producing images with complex scenes, how to properly …

Αποθήκευση Παράθεση Γίνεται αναφορά σε 83 Σχετικά άρθρα Όλες οι 4 εκδοχές Προβολή ως HTML

[Free GPT-4]
[DeepSeek]

[PDF] thecvf.com

Text to image generation with semantic-spatial aware gan

W Liao, K Hu, MY Yang… - Proceedings of the IEEE …, 2022 - openaccess.thecvf.com

Text-to-image synthesis (T2I) aims to generate photo-realistic images which are
semantically consistent with the text descriptions. Existing methods are usually built upon …

Αποθήκευση Παράθεση Γίνεται αναφορά σε 179 Σχετικά άρθρα Όλες οι 9 εκδοχές Προβολή ως HTML

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

DrivingDiffusion: Layout-Guided Multi-view Driving Scenarios Video Generation with Latent Diffusion Model

X Li, Y Zhang, X Ye - European Conference on Computer Vision, 2024 - Springer

With the surge in autonomous driving technologies, the reliance on comprehensive and high-
definition bird's-eye-view (BEV) representations has become paramount. This burgeoning …

Αποθήκευση Παράθεση Γίνεται αναφορά σε 30 Σχετικά άρθρα Όλες οι 2 εκδοχές

[Free GPT-4]
[DeepSeek]

[PDF] thecvf.com

Scenecomposer: Any-level semantic image synthesis

Y Zeng, Z Lin, J Zhang, Q Liu… - Proceedings of the …, 2023 - openaccess.thecvf.com

We propose a new framework for conditional image synthesis from semantic layouts of any
precision levels, ranging from pure text to a 2D semantic canvas with precise shapes. More …

Αποθήκευση Παράθεση Γίνεται αναφορά σε 44 Σχετικά άρθρα Όλες οι 6 εκδοχές Προβολή ως HTML

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Layoutdiffuse: Adapting foundational diffusion models for layout-to-image generation

J Cheng, X Liang, X Shi, T He, T **ao, M Li - arxiv preprint arxiv …, 2023 - arxiv.org

Layout-to-image generation refers to the task of synthesizing photo-realistic images based
on semantic layouts. In this paper, we propose LayoutDiffuse that adapts a foundational …

Αποθήκευση Παράθεση Γίνεται αναφορά σε 62 Σχετικά άρθρα Όλες οι 2 εκδοχές Προβολή ως HTML

[Free GPT-4]
[DeepSeek]

[PDF] thecvf.com

Modeling image composition for complex scene generation

Z Yang, D Liu, C Wang, J Yang… - Proceedings of the IEEE …, 2022 - openaccess.thecvf.com

We present a method that achieves state-of-the-art results on challenging (few-shot) layout-
to-image generation tasks by accurately modeling textures, structures and relationships …

Αποθήκευση Παράθεση Γίνεται αναφορά σε 55 Σχετικά άρθρα Όλες οι 6 εκδοχές Προβολή ως HTML

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Diffusion-based scene graph to image generation with masked contrastive pre-training

L Yang, Z Huang, Y Song, S Hong, G Li… - arxiv preprint arxiv …, 2022 - arxiv.org

Generating images from graph-structured inputs, such as scene graphs, is uniquely
challenging due to the difficulty of aligning nodes and connections in graphs with objects …

Αποθήκευση Παράθεση Γίνεται αναφορά σε 45 Σχετικά άρθρα Όλες οι 4 εκδοχές Προβολή ως HTML

Δημιουργία ειδοποίησης

Παράθεση

Σύνθετη αναζήτηση

Αποθηκεύτηκε στη Βιβλιοθήκη μου

Context-aware layout to image generation with enhanced object appearance

Image generation: A review

Layoutllm-t2i: Eliciting layout guidance from llm for text-to-image generation

Spatial-temporal transformer for dynamic scene graph generation

Frido: Feature pyramid diffusion for complex scene image synthesis

Text to image generation with semantic-spatial aware gan

DrivingDiffusion: Layout-Guided Multi-view Driving Scenarios Video Generation with Latent Diffusion Model

Scenecomposer: Any-level semantic image synthesis

Layoutdiffuse: Adapting foundational diffusion models for layout-to-image generation

Modeling image composition for complex scene generation

Diffusion-based scene graph to image generation with masked contrastive pre-training