Attacks and defenses for generative diffusion models: A comprehensive survey

VT Truong, LB Dang, LB Le - arxiv preprint arxiv:2408.03400, 2024 - arxiv.org
Diffusion models (DMs) have achieved state-of-the-art performance on various generative
tasks such as image synthesis, text-to-image, and text-guided image-to-image generation …

Latent guard: a safety framework for text-to-image generation

R Liu, A Khakzar, J Gu, Q Chen, P Torr… - European Conference on …, 2024 - Springer
With the ability to generate high-quality images, text-to-image (T2I) models can be exploited
for creating inappropriate content. To prevent misuse, existing safety measures are either …

Safety at Scale: A Comprehensive Survey of Large Model Safety

X Ma, Y Gao, Y Wang, R Wang, X Wang, Y Sun… - arxiv preprint arxiv …, 2025 - arxiv.org
The rapid advancement of large models, driven by their exceptional abilities in learning and
generalization through large-scale pre-training, has reshaped the landscape of Artificial …

Trustworthy Text-to-Image Diffusion Models: A Timely and Focused Survey

Y Zhang, Z Chen, CH Cheng, W Ruan, X Huang… - arxiv preprint arxiv …, 2024 - arxiv.org
Text-to-Image (T2I) Diffusion Models (DMs) have garnered widespread attention for their
impressive advancements in image generation. However, their growing popularity has …

Repairing Catastrophic-Neglect in Text-to-Image Diffusion Models via Attention-Guided Feature Enhancement

Z Chang, M Li, J Wang, Y Liu, Q Wang, Y Liu - arxiv preprint arxiv …, 2024 - arxiv.org
Text-to-Image Diffusion Models (T2I DMs) have garnered significant attention for their ability
to generate high-quality images from textual descriptions. However, these models often …

Prompt suffix-attack against text-to-image diffusion models

S **ong, Y Du, Z Wang, P Sun - Neurocomputing, 2025 - Elsevier
Text-to-image diffusion models (T2I DMs) have achieved excellent performance in image
generation. However, the adversarial robustness of T2I DMs has not been sufficiently …

Mitigating Hallucinations in Diffusion Models through Adaptive Attention Modulation

T Oorloff, Y Yacoob, A Shrivastava - arxiv preprint arxiv:2502.16872, 2025 - arxiv.org
Diffusion models, while increasingly adept at generating realistic images, are notably
hindered by hallucinations--unrealistic or incorrect features inconsistent with the trained data …

Can Diffusion Models Learn Hidden Inter-Feature Rules Behind Images?

Y Han, A Han, W Huang, C Lu, D Zou - arxiv preprint arxiv:2502.04725, 2025 - arxiv.org
Despite the remarkable success of diffusion models (DMs) in data generation, they exhibit
specific failure cases with unsatisfactory outputs. We focus on one such limitation: the ability …

Adversarial Attacks on Parts of Speech: An Empirical Study in Text-to-Image Generation

GM Shahariar, J Chen, J Li, Y Dong - arxiv preprint arxiv:2409.15381, 2024 - arxiv.org
Recent studies show that text-to-image (T2I) models are vulnerable to adversarial attacks,
especially with noun perturbations in text prompts. In this study, we investigate the impact of …

From Fake to Real: Pretraining on Balanced Synthetic Images to Prevent Spurious Correlations in Image Recognition

M Qraitem, K Saenko, BA Plummer - European Conference on Computer …, 2024 - Springer
Visual recognition models are prone to learning spurious correlations induced by a biased
training set where certain conditions B (eg, Indoors) are over-represented in certain classes …