A survey of attacks on large vision-language models: Resources, advances, and future trends
With the significant development of large models in recent years, Large Vision-Language
Models (LVLMs) have demonstrated remarkable capabilities across a wide range of …
Models (LVLMs) have demonstrated remarkable capabilities across a wide range of …
Unbridled icarus: A survey of the potential perils of image inputs in multimodal large language model security
Multimodal Large Language Models (MLLMs) demonstrate remarkable capabilities that
increasingly influence various aspects of our daily lives, constantly defining the new …
increasingly influence various aspects of our daily lives, constantly defining the new …
Artificial intelligence for biomedical video generation
As a prominent subfield of Artificial Intelligence Generated Content (AIGC), video generation
has achieved notable advancements in recent years. The introduction of Sora-alike models …
has achieved notable advancements in recent years. The introduction of Sora-alike models …
Few-Shot Adversarial Prompt Learning on Vision-Language Models
The vulnerability of deep neural networks to imperceptible adversarial perturbations has
attracted widespread attention. Inspired by the success of vision-language foundation …
attracted widespread attention. Inspired by the success of vision-language foundation …
Adversarial Attacks of Vision Tasks in the Past 10 Years: A Survey
Adversarial attacks, which manipulate input data to undermine model availability and
integrity, pose significant security threats during machine learning inference. With the advent …
integrity, pose significant security threats during machine learning inference. With the advent …
Navigating the risks: A survey of security, privacy, and ethics threats in llm-based agents
With the continuous development of large language models (LLMs), transformer-based
models have made groundbreaking advances in numerous natural language processing …
models have made groundbreaking advances in numerous natural language processing …
B-AVIBench: Towards Evaluating the Robustness of Large Vision-Language Model on Black-box Adversarial Visual-Instructions
Large Vision-Language Models (LVLMs) have shown significant progress in responding
well to visual-instructions from users. However, these instructions, encompassing images …
well to visual-instructions from users. However, these instructions, encompassing images …
Adversarial Prompt Distillation for Vision-Language Models
Large pre-trained Vision-Language Models (VLMs) such as Contrastive Language-Image
Pre-Training (CLIP) have been shown to be susceptible to adversarial attacks, raising …
Pre-Training (CLIP) have been shown to be susceptible to adversarial attacks, raising …
Defending Multimodal Backdoored Models by Repulsive Visual Prompt Tuning
Multimodal contrastive learning models (eg, CLIP) can learn high-quality representations
from large-scale image-text datasets, yet they exhibit significant vulnerabilities to backdoor …
from large-scale image-text datasets, yet they exhibit significant vulnerabilities to backdoor …
[HTML][HTML] MDAPT: Multi-Modal Depth Adversarial Prompt Tuning to Enhance the Adversarial Robustness of Visual Language Models
C Li, Y Liao, C Ding, Z Ye - Sensors, 2025 - mdpi.com
Large visual language models like Contrastive Language-Image Pre-training (CLIP), despite
their excellent performance, are highly vulnerable to the influence of adversarial examples …
their excellent performance, are highly vulnerable to the influence of adversarial examples …