A survey of backdoor attacks and defenses on large language models: Implications for security measures
Large Language Models (LLMs), which bridge the gap between human language
understanding and complex problem-solving, achieve state-of-the-art performance on …
understanding and complex problem-solving, achieve state-of-the-art performance on …
Backdoor attacks and countermeasures in natural language processing models: A comprehensive security review
Applicating third-party data and models has become a new paradigm for language modeling
in NLP, which also introduces some potential security vulnerabilities because attackers can …
in NLP, which also introduces some potential security vulnerabilities because attackers can …
Defending against weight-poisoning backdoor attacks for parameter-efficient fine-tuning
Recently, various parameter-efficient fine-tuning (PEFT) strategies for application to
language models have been proposed and successfully implemented. However, this raises …
language models have been proposed and successfully implemented. However, this raises …
Beear: Embedding-based adversarial removal of safety backdoors in instruction-tuned language models
Safety backdoor attacks in large language models (LLMs) enable the stealthy triggering of
unsafe behaviors while evading detection during normal interactions. The high …
unsafe behaviors while evading detection during normal interactions. The high …
Mitigating backdoor threats to large language models: Advancement and challenges
The advancement of Large Language Models (LLMs) has significantly impacted various
domains, including Web search, healthcare, and software development. However, as these …
domains, including Web search, healthcare, and software development. However, as these …
Enhancing LLM Capabilities Beyond Scaling Up
General-purpose large language models (LLMs) are progressively expanding both in scale
and access to unpublic training data. This has led to notable progress in a variety of AI …
and access to unpublic training data. This has led to notable progress in a variety of AI …
[PDF][PDF] Combating security and privacy issues in the era of large language models
This tutorial seeks to provide a systematic summary of risks and vulnerabilities in security,
privacy and copyright aspects of large language models (LLMs), and most recent solutions …
privacy and copyright aspects of large language models (LLMs), and most recent solutions …
Navigating the risks: A survey of security, privacy, and ethics threats in llm-based agents
With the continuous development of large language models (LLMs), transformer-based
models have made groundbreaking advances in numerous natural language processing …
models have made groundbreaking advances in numerous natural language processing …
Rethinking Backdoor Detection Evaluation for Language Models
Backdoor attacks, in which a model behaves maliciously when given an attacker-specified
trigger, pose a major security risk for practitioners who depend on publicly released …
trigger, pose a major security risk for practitioners who depend on publicly released …
Two heads are better than one: Nested poe for robust defense against multi-backdoors
Data poisoning backdoor attacks can cause undesirable behaviors in large language
models (LLMs), and defending against them is of increasing importance. Existing defense …
models (LLMs), and defending against them is of increasing importance. Existing defense …