- Academic Search

Y Huang, J Huang, Y Liu, M Yan, J Lv, J Liu… - arxiv preprint arxiv …, 2024 - arxiv.org

Denoising diffusion models have emerged as a powerful tool for various image generation
and editing tasks, facilitating the synthesis of visual content in an unconditional or input …

保存引用被引用数: 70 関連記事全 2 バージョン HTMLバージョン

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

LEGO: L earning EGO centric Action Frame Generation via Visual Instruction Tuning

B Lai, X Dai, L Chen, G Pang, JM Rehg… - European Conference on …, 2024 - Springer

Generating instructional images of human daily actions from an egocentric viewpoint serves
as a key step towards efficient skill transfer. In this paper, we introduce a novel problem …

保存引用被引用数: 6 関連記事全 2 バージョン

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

BELM: Bidirectional Explicit Linear Multi-step Sampler for Exact Inversion in Diffusion Models

F Wang, H Yin, Y Dong, H Zhu, C Zhang… - arxiv preprint arxiv …, 2024 - arxiv.org

The inversion of diffusion model sampling, which aims to find the corresponding initial noise
of a sample, plays a critical role in various tasks. Recently, several heuristic exact inversion …

保存引用被引用数: 3 関連記事全 3 バージョン HTMLバージョン

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Conditional Image Synthesis with Diffusion Models: A Survey

Z Zhan, D Chen, JP Mei, Z Zhao, J Chen… - arxiv preprint arxiv …, 2024 - arxiv.org

Conditional image synthesis based on user-specified requirements is a key component in
creating complex visual content. In recent years, diffusion-based generative modeling has …

保存引用被引用数: 1 関連記事全 4 バージョン HTMLバージョン

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Exploring the latent space of diffusion models directly through singular value decomposition

L Wang, B Gao, Y Li, Z Wang, X Yang… - arxiv preprint arxiv …, 2025 - arxiv.org

Despite the groundbreaking success of diffusion models in generating high-fidelity images,
their latent space remains relatively under-explored, even though it holds significant promise …

保存引用関連記事全 2 バージョン HTMLバージョン

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Addressing Attribute Leakages in Diffusion-based Image Editing without Training

S Mun, J Nam, S Cho, J Ok - arxiv preprint arxiv:2412.04715, 2024 - arxiv.org

Diffusion models have become a cornerstone in image editing, offering flexibility with
language prompts and source images. However, a key challenge is attribute leakage, where …

保存引用関連記事全 2 バージョン HTMLバージョン

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

DiffChat: Learning to Chat with Text-to-Image Synthesis Models for Interactive Image Creation

J Wang, C Wang, T Cao, J Huang, L ** - arxiv preprint arxiv:2403.04997, 2024 - arxiv.org

We present DiffChat, a novel method to align Large Language Models (LLMs) to" chat" with
prompt-as-input Text-to-Image Synthesis (TIS) models (eg, Stable Diffusion) for interactive …

保存引用被引用数: 2 関連記事全 2 バージョン HTMLバージョン

アラートを作成

引用

検索オプション

マイライブラリに保存しました

User-friendly Image Editing with Minimal Text Input: Leveraging Captioning and Injection Techniques

Diffusion model-based image editing: A survey

LEGO: L earning EGO centric Action Frame Generation via Visual Instruction Tuning

BELM: Bidirectional Explicit Linear Multi-step Sampler for Exact Inversion in Diffusion Models

Conditional Image Synthesis with Diffusion Models: A Survey

Exploring the latent space of diffusion models directly through singular value decomposition

Addressing Attribute Leakages in Diffusion-based Image Editing without Training

DiffChat: Learning to Chat with Text-to-Image Synthesis Models for Interactive Image Creation