Robust natural language processing: Recent advances, challenges, and future directions
Recent natural language processing (NLP) techniques have accomplished high
performance on benchmark data sets, primarily due to the significant improvement in the …
performance on benchmark data sets, primarily due to the significant improvement in the …
Adversarial attack and defense technologies in natural language processing: A survey
S Qiu, Q Liu, S Zhou, W Huang - Neurocomputing, 2022 - Elsevier
Recently, the adversarial attack and defense technology has made remarkable
achievements and has been widely applied in the computer vision field, promoting its rapid …
achievements and has been widely applied in the computer vision field, promoting its rapid …
Learning to discriminate perturbations for blocking adversarial attacks in text classification
Adversarial attacks against machine learning models have threatened various real-world
applications such as spam filtering and sentiment analysis. In this paper, we propose a …
applications such as spam filtering and sentiment analysis. In this paper, we propose a …
[PDF][PDF] Defense against synonym substitution-based adversarial attacks via Dirichlet neighborhood ensemble
Although deep neural networks have achieved prominent performance on many NLP tasks,
they are vulnerable to adversarial examples. We propose Dirichlet Neighborhood Ensemble …
they are vulnerable to adversarial examples. We propose Dirichlet Neighborhood Ensemble …
Text adversarial attacks and defenses: Issues, taxonomy, and perspectives
Deep neural networks (DNNs) have been widely used in many fields due to their powerful
representation learning capabilities. However, they are exposed to serious threats caused …
representation learning capabilities. However, they are exposed to serious threats caused …
Certified robustness to text adversarial attacks by randomized [mask]
Very recently, few certified defense methods have been developed to provably guarantee
the robustness of a text classifier to adversarial synonym substitutions. However, all the …
the robustness of a text classifier to adversarial synonym substitutions. However, all the …
Coco: Controllable counterfactuals for evaluating dialogue state trackers
Dialogue state trackers have made significant progress on benchmark datasets, but their
generalization capability to novel and realistic scenarios beyond the held-out conversations …
generalization capability to novel and realistic scenarios beyond the held-out conversations …
Character-level white-box adversarial attacks against transformers via attachable subwords substitution
We propose the first character-level white-box adversarial attack method against transformer
models. The intuition of our method comes from the observation that words are split into …
models. The intuition of our method comes from the observation that words are split into …
Sgd-x: A benchmark for robust generalization in schema-guided dialogue systems
Zero/few-shot transfer to unseen services is a critical challenge in task-oriented dialogue
research. The Schema-Guided Dialogue (SGD) dataset introduced a paradigm for enabling …
research. The Schema-Guided Dialogue (SGD) dataset introduced a paradigm for enabling …
Defense against adversarial attacks in nlp via dirichlet neighborhood ensemble
Despite neural networks have achieved prominent performance on many natural language
processing (NLP) tasks, they are vulnerable to adversarial examples. In this paper, we …
processing (NLP) tasks, they are vulnerable to adversarial examples. In this paper, we …