Robust natural language processing: Recent advances, challenges, and future directions

M Omar, S Choi, DH Nyang, D Mohaisen - IEEE Access, 2022 - ieeexplore.ieee.org
Recent natural language processing (NLP) techniques have accomplished high
performance on benchmark data sets, primarily due to the significant improvement in the …

Adversarial attack and defense technologies in natural language processing: A survey

S Qiu, Q Liu, S Zhou, W Huang - Neurocomputing, 2022 - Elsevier
Recently, the adversarial attack and defense technology has made remarkable
achievements and has been widely applied in the computer vision field, promoting its rapid …

Learning to discriminate perturbations for blocking adversarial attacks in text classification

Y Zhou, JY Jiang, KW Chang, W Wang - arxiv preprint arxiv:1909.03084, 2019 - arxiv.org
Adversarial attacks against machine learning models have threatened various real-world
applications such as spam filtering and sentiment analysis. In this paper, we propose a …

[PDF][PDF] Defense against synonym substitution-based adversarial attacks via Dirichlet neighborhood ensemble

Y Zhou, X Zheng, CJ Hsieh, KW Chang… - Association for …, 2021 - par.nsf.gov
Although deep neural networks have achieved prominent performance on many NLP tasks,
they are vulnerable to adversarial examples. We propose Dirichlet Neighborhood Ensemble …

Text adversarial attacks and defenses: Issues, taxonomy, and perspectives

X Han, Y Zhang, W Wang… - Security and …, 2022 - Wiley Online Library
Deep neural networks (DNNs) have been widely used in many fields due to their powerful
representation learning capabilities. However, they are exposed to serious threats caused …

Certified robustness to text adversarial attacks by randomized [mask]

J Zeng, J Xu, X Zheng, X Huang - Computational Linguistics, 2023 - direct.mit.edu
Very recently, few certified defense methods have been developed to provably guarantee
the robustness of a text classifier to adversarial synonym substitutions. However, all the …

Coco: Controllable counterfactuals for evaluating dialogue state trackers

S Li, S Yavuz, K Hashimoto, J Li, T Niu… - arxiv preprint arxiv …, 2020 - arxiv.org
Dialogue state trackers have made significant progress on benchmark datasets, but their
generalization capability to novel and realistic scenarios beyond the held-out conversations …

Character-level white-box adversarial attacks against transformers via attachable subwords substitution

A Liu, H Yu, X Hu, S Li, L Lin, F Ma, Y Yang… - arxiv preprint arxiv …, 2022 - arxiv.org
We propose the first character-level white-box adversarial attack method against transformer
models. The intuition of our method comes from the observation that words are split into …

Sgd-x: A benchmark for robust generalization in schema-guided dialogue systems

H Lee, R Gupta, A Rastogi, Y Cao, B Zhang… - Proceedings of the AAAI …, 2022 - ojs.aaai.org
Zero/few-shot transfer to unseen services is a critical challenge in task-oriented dialogue
research. The Schema-Guided Dialogue (SGD) dataset introduced a paradigm for enabling …

Defense against adversarial attacks in nlp via dirichlet neighborhood ensemble

Y Zhou, X Zheng, CJ Hsieh, K Chang… - arxiv preprint arxiv …, 2020 - arxiv.org
Despite neural networks have achieved prominent performance on many natural language
processing (NLP) tasks, they are vulnerable to adversarial examples. In this paper, we …