Watch out for your agents! investigating backdoor threats to llm-based agents

W Yang, X Bi, Y Lin, S Chen… - Advances in Neural …, 2025 - proceedings.neurips.cc
Driven by the rapid development of Large Language Models (LLMs), LLM-based agents
have been developed to handle various real-world applications, including finance …

Cooperative Backdoor Attack in Decentralized Reinforcement Learning with Theoretical Guarantee

M Gao, Y Zou, Z Zhang, X Cheng, D Yu - arxiv preprint arxiv:2405.15245, 2024 - arxiv.org
The safety of decentralized reinforcement learning (RL) is a challenging problem since
malicious agents can share their poisoned policies with benign agents. The paper …

Combinational Backdoor Attack against Customized Text-to-Image Models

W Jiang, J He, H Li, G Xu, R Zhang, H Chen… - arxiv preprint arxiv …, 2024 - arxiv.org
Recently, Text-to-Image (T2I) synthesis technology has made tremendous strides.
Numerous representative T2I models have emerged and achieved promising application …

BLAST: A Stealthy Backdoor Leverage Attack against Cooperative Multi-Agent Deep Reinforcement Learning based Systems

Y Yu, S Yan, X Yin, J Fang, J Liu - arxiv preprint arxiv:2501.01593, 2025 - arxiv.org
Recent studies have shown that cooperative multi-agent deep reinforcement learning (c-
MADRL) is under the threat of backdoor attacks. Once a backdoor trigger is observed, it will …

Robustness Evaluation of Offline Reinforcement Learning for Robot Control Against Action Perturbations

S Ayabe, T Otomo, H Kera, K Kawamoto - arxiv preprint arxiv:2412.18781, 2024 - arxiv.org
Offline reinforcement learning, which learns solely from datasets without environmental
interaction, has gained attention. This approach, similar to traditional online deep …

无人系统中离线**化学**的隐蔽数据投毒攻击方法

周雪, 苘大鹏, 许晨, 吕继光, 曾凡一, 高朝阳… - 通信学报, 2024 - infocomm-journal.com
针对现有离线**化学**数据投毒攻击方法有效性及隐蔽性不足的问题, 提出一种关键时间步动态
投毒攻击方法, 通过对重要性较高的样本进行动态扰动, 实现高效隐蔽的攻击效果. 具体来说 …

Towards robust, secure, and privacy-aware large language models of code

Z YANG - 2024 - ink.library.smu.edu.sg
The field of software engineering has witnessed a surge in large language models
specifically tailored to understand and process code, which we call large language models …

Temporal Logic-Based Multi-Vehicle Backdoor Attacks against Offline RL Agents in End-to-end Autonomous Driving

X Chen, S Feng, Z **ong, S An, Y Mao, G Tao, W Guo… - openreview.net
End-to-end autonomous driving (AD) systems integrate complex decision-making
processes. Assessing the safety of these systems against potential security threats, including …