Trustworthy AI: From principles to practices
The rapid development of Artificial Intelligence (AI) technology has enabled the deployment
of various systems based on it. However, many current AI systems are found vulnerable to …
of various systems based on it. However, many current AI systems are found vulnerable to …
Adversarial attacks and defenses in deep learning: From a perspective of cybersecurity
The outstanding performance of deep neural networks has promoted deep learning
applications in a broad set of domains. However, the potential risks caused by adversarial …
applications in a broad set of domains. However, the potential risks caused by adversarial …
Adversarial examples are not bugs, they are features
Adversarial examples have attracted significant attention in machine learning, but the
reasons for their existence and pervasiveness remain unclear. We demonstrate that …
reasons for their existence and pervasiveness remain unclear. We demonstrate that …
Theoretically principled trade-off between robustness and accuracy
We identify a trade-off between robustness and accuracy that serves as a guiding principle
in the design of defenses against adversarial examples. Although this problem has been …
in the design of defenses against adversarial examples. Although this problem has been …
Robustness may be at odds with accuracy
We show that there may exist an inherent tension between the goal of adversarial
robustness and that of standard generalization. Specifically, training robust models may not …
robustness and that of standard generalization. Specifically, training robust models may not …
Adversarial examples: Attacks and defenses for deep learning
With rapid progress and significant successes in a wide spectrum of applications, deep
learning is being applied in many safety-critical environments. However, deep neural …
learning is being applied in many safety-critical environments. However, deep neural …
Simple black-box adversarial attacks
We propose an intriguingly simple method for the construction of adversarial images in the
black-box setting. In constrast to the white-box scenario, constructing black-box adversarial …
black-box setting. In constrast to the white-box scenario, constructing black-box adversarial …
Adversarially robust generalization requires more data
Abstract Machine learning models are often susceptible to adversarial perturbations of their
inputs. Even small perturbations can cause state-of-the-art classifiers with high" standard" …
inputs. Even small perturbations can cause state-of-the-art classifiers with high" standard" …
Adversarial training and robustness for multiple perturbations
Defenses against adversarial examples, such as adversarial training, are typically tailored to
a single perturbation type (eg, small $\ell_\infty $-noise). For other perturbations, these …
a single perturbation type (eg, small $\ell_\infty $-noise). For other perturbations, these …
A closer look at accuracy vs. robustness
Current methods for training robust networks lead to a drop in test accuracy, which has led
prior works to posit that a robustness-accuracy tradeoff may be inevitable in deep learning …
prior works to posit that a robustness-accuracy tradeoff may be inevitable in deep learning …