Turnitin
降AI改写
早检测系统
早降重系统
Turnitin-UK版
万方检测-期刊版
维普编辑部版
Grammarly检测
Paperpass检测
checkpass检测
PaperYY检测
Better diffusion models further improve adversarial training
It has been recognized that the data generated by the denoising diffusion probabilistic
model (DDPM) improves adversarial training. After two years of rapid development in …
model (DDPM) improves adversarial training. After two years of rapid development in …
Diffusion models for adversarial purification
Adversarial purification refers to a class of defense methods that remove adversarial
perturbations using a generative model. These methods do not make assumptions on the …
perturbations using a generative model. These methods do not make assumptions on the …
Smoothllm: Defending large language models against jailbreaking attacks
Despite efforts to align large language models (LLMs) with human values, widely-used
LLMs such as GPT, Llama, Claude, and PaLM are susceptible to jailbreaking attacks …
LLMs such as GPT, Llama, Claude, and PaLM are susceptible to jailbreaking attacks …
Understanding robust overfitting of adversarial training and beyond
Robust overfitting widely exists in adversarial training of deep networks. The exact
underlying reasons for this are still not completely understood. Here, we explore the causes …
underlying reasons for this are still not completely understood. Here, we explore the causes …
Robust evaluation of diffusion-based adversarial purification
We question the current evaluation practice on diffusion-based purification methods.
Diffusion-based purification methods aim to remove adversarial effects from an input data …
Diffusion-based purification methods aim to remove adversarial effects from an input data …
On the robustness of open-world test-time training: Self-training with dynamic prototype expansion
Generalizing deep learning models to unknown target domain distribution with low latency
has motivated research into test-time training/adaptation (TTT/TTA). Existing approaches …
has motivated research into test-time training/adaptation (TTT/TTA). Existing approaches …
DISCO: Adversarial defense with local implicit functions
The problem of adversarial defenses for image classification, where the goal is to robustify a
classifier against adversarial examples, is considered. Inspired by the hypothesis that these …
classifier against adversarial examples, is considered. Inspired by the hypothesis that these …
SoK: Explainable machine learning in adversarial environments
Modern deep learning methods have long been considered black boxes due to the lack of
insights into their decision-making process. However, recent advances in explainable …
insights into their decision-making process. However, recent advances in explainable …
Visual prompting for adversarial robustness
In this work, we leverage visual prompting (VP) to improve adversarial robustness of a fixed,
pre-trained model at test time. Compared to conventional adversarial defenses, VP allows …
pre-trained model at test time. Compared to conventional adversarial defenses, VP allows …
Threat model-agnostic adversarial defense using diffusion models
Deep Neural Networks (DNNs) are highly sensitive to imperceptible malicious perturbations,
known as adversarial attacks. Following the discovery of this vulnerability in real-world …
known as adversarial attacks. Following the discovery of this vulnerability in real-world …