A survey of attacks on large vision-language models: Resources, advances, and future trends

D Liu, M Yang, X Qu, P Zhou, Y Cheng… - arxiv preprint arxiv …, 2024 - arxiv.org
With the significant development of large models in recent years, Large Vision-Language
Models (LVLMs) have demonstrated remarkable capabilities across a wide range of …

Enhancing the Transferability of Adversarial Attacks with Stealth Preservation

X Zhang, T Zhang, Y Zhang… - Proceedings of the IEEE …, 2024 - openaccess.thecvf.com
Deep neural networks are susceptible to attacks from adversarial examples in recent years.
Especially the black-box attacks cause a more serious threat to practical applications …

Module-wise adaptive adversarial training for end-to-end autonomous driving

T Zhang, L Wang, J Kang, X Zhang, S Liang… - arxiv preprint arxiv …, 2024 - arxiv.org
Recent advances in deep learning have markedly improved autonomous driving (AD)
models, particularly end-to-end systems that integrate perception, prediction, and planning …

APBAM: Adversarial perturbation-driven backdoor attack in multimodal learning

S Zhang, W Chen, X Li, Q Liu, G Wang - Information Sciences, 2025 - Elsevier
Due to the reliance on the cloud for training, multimodal learning models are vulnerable to
multimodal backdoor attacks. However, such attacks often use static trigger patterns, which …

LTA-PCS: Learnable Task-Agnostic Point Cloud Sampling

J Liu, J Li, K Wang, H Guo, J Yang… - Proceedings of the …, 2024 - openaccess.thecvf.com
Recently many approaches directly operate on point clouds for different tasks. These
approaches become more computation and storage demanding when point cloud size is …

Patch is enough: naturalistic adversarial patch against vision-language pre-training models

D Kong, S Liang, X Zhu, Y Zhong, W Ren - Visual Intelligence, 2024 - Springer
Visual language pre-training (VLP) models have demonstrated significant success in
various domains, but they remain vulnerable to adversarial attacks. Addressing these …

Security matrix for multimodal agents on mobile devices: A systematic and proof of concept study

Y Yang, X Yang, S Li, C Lin, Z Zhao, C Shen… - arxiv preprint arxiv …, 2024 - arxiv.org
The rapid progress in the reasoning capability of the Multi-modal Large Language Models
(MLLMs) has triggered the development of autonomous agent systems on mobile devices …

[HTML][HTML] RobustE2E: Exploring the Robustness of End-to-End Autonomous Driving

W Jiang, L Wang, T Zhang, Y Chen, J Dong, W Bao… - Electronics, 2024 - mdpi.com
Autonomous driving technology has advanced significantly with deep learning, but noise
and attacks threaten its real-world deployment. While research has revealed vulnerabilities …

Energy-latency manipulation of multi-modal large language models via verbose samples

K Gao, J Gu, Y Bai, ST **a, P Torr, W Liu… - arxiv preprint arxiv …, 2024 - arxiv.org
Despite the exceptional performance of multi-modal large language models (MLLMs), their
deployment requires substantial computational resources. Once malicious users induce …

Visual Adversarial Attack on Vision-Language Models for Autonomous Driving

T Zhang, L Wang, X Zhang, Y Zhang, B Jia… - arxiv preprint arxiv …, 2024 - arxiv.org
Vision-language models (VLMs) have significantly advanced autonomous driving (AD) by
enhancing reasoning capabilities. However, these models remain highly vulnerable to …