Diffusion model-based image editing: A survey

Y Huang, J Huang, Y Liu, M Yan, J Lv, J Liu… - arxiv preprint arxiv …, 2024 - arxiv.org
Denoising diffusion models have emerged as a powerful tool for various image generation
and editing tasks, facilitating the synthesis of visual content in an unconditional or input …

Diffusion models in low-level vision: A survey

C He, Y Shen, C Fang, F **ao, L Tang, Y Zhang… - arxiv preprint arxiv …, 2024 - arxiv.org
Deep generative models have garnered significant attention in low-level vision tasks due to
their generative capabilities. Among them, diffusion model-based solutions, characterized by …

Multi-domain awareness for compressed deepfake videos detection over social networks guided by common mechanisms between artifacts

Y Wang, Q Sun, D Rong, R Geng - Computer Vision and Image …, 2024 - Elsevier
The viral spread of massive deepfake videos over social networks has caused serious
security problems. Despite the remarkable advancements achieved by existing deepfake …

Locinv: localization-aware inversion for text-guided image editing

C Tang, K Wang, F Yang, J van de Weijer - arxiv preprint arxiv …, 2024 - arxiv.org
Large-scale Text-to-Image (T2I) diffusion models demonstrate significant generation
capabilities based on textual prompts. Based on the T2I diffusion models, text-guided image …

Image inpainting using diffusion models to restore eaves tile patterns in Chinese heritage buildings

X Zhong, W Chen, Z Guo, J Zhang, H Luo - Automation in Construction, 2025 - Elsevier
Wadangs (a type of eaves tile) are integral components of traditional Chinese buildings and
often suffer damage over time, resulting in the loss of pattern information. Currently, AI …

Marigold-DC: Zero-Shot Monocular Depth Completion with Guided Diffusion

M Viola, K Qu, N Metzger, B Ke, A Becker… - arxiv preprint arxiv …, 2024 - arxiv.org
Depth completion upgrades sparse depth measurements into dense depth maps guided by
a conventional image. Existing methods for this highly ill-posed task operate in tightly …

Semantic-driven diffusion for sign language production with gloss-pose latent spaces alignment

S Chen, Q Wang, Q Wang - Computer Vision and Image Understanding, 2024 - Elsevier
Abstract Sign Language Production (SLP) aims to translate spoken language into visual
sign language sequences. The most challenging process in SLP is the transformation of a …

[HTML][HTML] Can Stylized Products Generated by AI Better Attract User Attention? Using Eye-Tracking Technology for Research

Y Tang, C Chen - Applied Sciences, 2024 - mdpi.com
The emergence of AIGC has significantly improved design efficiency, enriched creativity,
and promoted innovation in the design industry. However, whether the content generated …

From Noise to Nuance: Advances in Deep Generative Image Models

B Peng, CX Liang, Z Bi, M Liu, Y Zhang, T Wang… - arxiv preprint arxiv …, 2024 - arxiv.org
Deep learning-based image generation has undergone a paradigm shift since 2021,
marked by fundamental architectural breakthroughs and computational innovations …

[HTML][HTML] Flame Combustion State Detection Method of Cement Rotary Furnace Based on Improved RE-DDPM and DAF-FasterNet

Y Zhang, Z Gu, H Yu, S Shi - Applied Sciences, 2024 - mdpi.com
It is of great significance to effectively identify the flame-burning state of cement rotary kilns
to optimize the calcination process and ensure the quality of cement. However, high …