Adversarial attacks and defenses on text-to-image diffusion models: A survey

C Zhang, M Hu, W Li, L Wang - Information Fusion, 2024 - Elsevier
Recently, the text-to-image diffusion model has gained considerable attention from the
community due to its exceptional image generation capability. A representative model …

[HTML][HTML] Digital Sentinels and Antagonists: The Dual Nature of Chatbots in Cybersecurity

H Szmurlo, Z Akhtar - Information, 2024 - mdpi.com
Advancements in artificial intelligence, machine learning, and natural language processing
have culminated in sophisticated technologies such as transformer models, generative AI …

LLMs for cyber security: New opportunities

DM Divakaran, ST Peddinti - arxiv preprint arxiv:2404.11338, 2024 - arxiv.org
Large language models (LLMs) are a class of powerful and versatile models that are
beneficial to many industries. With the emergence of LLMs, we take a fresh look at cyber …

Perception-guided jailbreak against text-to-image models

Y Huang, L Liang, T Li, X Jia, R Wang, W Miao… - arxiv preprint arxiv …, 2024 - arxiv.org
In recent years, Text-to-Image (T2I) models have garnered significant attention due to their
remarkable advancements. However, security concerns have emerged due to their potential …

Jailbreak Attacks and Defenses against Multimodal Generative Models: A Survey

X Liu, X Cui, P Li, Z Li, H Huang, S **a, M Zhang… - arxiv preprint arxiv …, 2024 - arxiv.org
The rapid evolution of multimodal foundation models has led to significant advancements in
cross-modal understanding and generation across diverse modalities, including text …

Moderator: Moderating Text-to-Image Diffusion Models through Fine-grained Context-based Policies

P Wang, Q Li, L Yu, Z Wang, A Li, H ** - … of the 2024 on ACM SIGSAC …, 2024 - dl.acm.org
We present Moderator, a policy-based model management system that allows
administrators to specify fine-grained content moderation policies and modify the weights of …

ColJailBreak: Collaborative Generation and Editing for Jailbreaking Text-to-Image Deep Generation

Y Ma, S Pang, Q Guo, T Wei… - Advances in Neural …, 2025 - proceedings.neurips.cc
The commercial text-to-image deep generation models (eg DALL· E) can produce high-
quality images based on input language descriptions. These models incorporate a black …

T2ISafety: Benchmark for Assessing Fairness, Toxicity, and Privacy in Image Generation

L Li, Z Shi, X Hu, B Dong, Y Qin, X Liu, L Sheng… - arxiv preprint arxiv …, 2025 - arxiv.org
Text-to-image (T2I) models have rapidly advanced, enabling the generation of high-quality
images from text prompts across various domains. However, these models present notable …

CogMorph: Cognitive Morphing Attacks for Text-to-Image Models

Z **g, Z Ying, L Wang, S Liang, A Liu, X Liu… - arxiv preprint arxiv …, 2025 - arxiv.org
The development of text-to-image (T2I) generative models, that enable the creation of high-
quality synthetic images from textual prompts, has opened new frontiers in creative design …

AEIOU: A Unified Defense Framework against NSFW Prompts in Text-to-Image Models

Y Wang, J Chen, Q Li, X Yang, S Ji - arxiv preprint arxiv:2412.18123, 2024 - arxiv.org
As text-to-image (T2I) models continue to advance and gain widespread adoption, their
associated safety issues are becoming increasingly prominent. Malicious users often exploit …