Google 學術搜尋

FA Croitoru, V Hondru, RT Ionescu… - IEEE Transactions on …, 2023 - ieeexplore.ieee.org

Denoising diffusion models represent a recent emerging topic in computer vision,
demonstrating remarkable results in the area of generative modeling. A diffusion model is a …

儲存引用被引用 1326 次相關文章全部共 7 個版本

[Free GPT-4]
[DeepSeek]

[PDF] thecvf.com

Universal guidance for diffusion models

A Bansal, HM Chu, A Schwarzschild… - Proceedings of the …, 2023 - openaccess.thecvf.com

Typical diffusion models are trained to accept a particular form of conditioning, most
commonly text, and cannot be conditioned on other modalities without retraining. In this …

儲存引用被引用 230 次相關文章全部共 7 個版本 HTML 版

[Free GPT-4]
[DeepSeek]

[PDF] openreview.net

Multidiffusion: Fusing diffusion paths for controlled image generation

O Bar-Tal, L Yariv, Y Lipman, T Dekel - 2023 - openreview.net

Recent advances in text-to-image generation with diffusion models present transformative
capabilities in image quality. However, user controllability of the generated image, and fast …

儲存引用被引用 235 次相關文章全部共 7 個版本 HTML 版

[Free GPT-4]
[DeepSeek]

[PDF] thecvf.com

Collaborative diffusion for multi-modal face generation and editing

Z Huang, KCK Chan, Y Jiang… - Proceedings of the IEEE …, 2023 - openaccess.thecvf.com

Diffusion models arise as a powerful generative tool recently. Despite the great progress,
existing diffusion models mainly focus on uni-modal control, ie, the diffusion process is …

儲存引用被引用 121 次相關文章全部共 5 個版本 HTML 版

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Multimodal image synthesis and editing: The generative AI era

F Zhan, Y Yu, R Wu, J Zhang, S Lu, L Liu… - … on Pattern Analysis …, 2023 - ieeexplore.ieee.org

As information exists in various modalities in real world, effective interaction and fusion
among multimodal information plays a key role for the creation and perception of multimodal …

儲存引用被引用 42 次相關文章全部共 8 個版本

[Free GPT-4]
[DeepSeek]

[PDF] mpg.de

[PDF][PDF] Multimodal image synthesis and editing: A survey

F Zhan, Y Yu, R Wu, J Zhang, S Lu, L Liu… - arxiv preprint arxiv …, 2022 - pure.mpg.de

As information exists in various modalities in real world, effective interaction and fusion
among multimodal information plays a key role for the creation and perception of multimodal …

儲存引用被引用 259 次相關文章全部共 3 個版本 HTML 版

[Free GPT-4]
[DeepSeek]

[PDF] neurips.cc

Freemask: Synthetic images with dense annotations make stronger segmentation models

L Yang, X Xu, B Kang, Y Shi… - Advances in Neural …, 2023 - proceedings.neurips.cc

Semantic segmentation has witnessed tremendous progress due to the proposal of various
advanced network architectures. However, they are extremely hungry for delicate …

儲存引用被引用 42 次相關文章全部共 7 個版本 HTML 版

[Free GPT-4]
[DeepSeek]

[PDF] thecvf.com

Freestyle layout-to-image synthesis

H Xue, Z Huang, Q Sun, L Song… - Proceedings of the …, 2023 - openaccess.thecvf.com

Typical layout-to-image synthesis (LIS) models generate images for a closed set of semantic
classes, eg, 182 common objects in COCO-Stuff. In this work, we explore the freestyle …

儲存引用被引用 65 次相關文章全部共 9 個版本 HTML 版

[Free GPT-4]
[DeepSeek]

[PDF] thecvf.com

Zero-shot spatial layout conditioning for text-to-image diffusion models

G Couairon, M Careil, M Cord… - Proceedings of the …, 2023 - openaccess.thecvf.com

Large-scale text-to-image diffusion models have significantly improved the state of the art in
generative image modeling and allow for an intuitive and powerful user interface to drive the …

儲存引用被引用 58 次相關文章全部共 6 個版本 HTML 版

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Generative semantic communication: Diffusion models beyond bit recovery

E Grassucci, S Barbarossa, D Comminiello - arxiv preprint arxiv …, 2023 - arxiv.org

Semantic communication is expected to be one of the cores of next-generation AI-based
communications. One of the possibilities offered by semantic communication is the capability …

儲存引用被引用 58 次相關文章全部共 3 個版本 HTML 版

建立快訊

引用

進階搜尋

已儲存至「我的圖書館」

Semantic image synthesis via diffusion models

Diffusion models in vision: A survey

Universal guidance for diffusion models

Multidiffusion: Fusing diffusion paths for controlled image generation

Collaborative diffusion for multi-modal face generation and editing

Multimodal image synthesis and editing: The generative AI era

[PDF][PDF] Multimodal image synthesis and editing: A survey

Freemask: Synthetic images with dense annotations make stronger segmentation models

Freestyle layout-to-image synthesis

Zero-shot spatial layout conditioning for text-to-image diffusion models

Generative semantic communication: Diffusion models beyond bit recovery