- Academic Search

A survey of safety and trustworthiness of deep neural networks: Verification, testing, adversarial attack and defence, and interpretability

X Huang, D Kroening, W Ruan, J Sharp, Y Sun… - Computer Science …, 2020 - Elsevier

In the past few years, significant progress has been made on deep neural networks (DNNs)
in achieving human-level performance on several long-standing tasks. With the broader …

保存引用被引用次数：579 相关文章所有 12 个版本

[Free GPT-4]

[PDF] arxiv.org

Backdoor attacks and countermeasures on deep learning: A comprehensive review

Y Gao, BG Doan, Z Zhang, S Ma, J Zhang, A Fu… - arxiv preprint arxiv …, 2020 - arxiv.org

This work provides the community with a timely comprehensive review of backdoor attacks
and countermeasures on deep learning. According to the attacker's capability and affected …

保存引用被引用次数：256 相关文章所有 3 个版本 HTML 版

[Free GPT-4]

[PDF] arxiv.org

Universal and transferable adversarial attacks on aligned language models

A Zou, Z Wang, N Carlini, M Nasr, JZ Kolter… - arxiv preprint arxiv …, 2023 - arxiv.org

Because" out-of-the-box" large language models are capable of generating a great deal of
objectionable content, recent work has focused on aligning these models in an attempt to …

保存引用被引用次数：1059 相关文章所有 8 个版本 HTML 版

[Free GPT-4]

[PDF] arxiv.org

Weight poisoning attacks on pre-trained models

K Kurita, P Michel, G Neubig - arxiv preprint arxiv:2004.06660, 2020 - arxiv.org

Recently, NLP has seen a surge in the usage of large pre-trained models. Users download
weights of models pre-trained on large datasets, then fine-tune the weights on a task of their …

保存引用被引用次数：455 相关文章所有 4 个版本 HTML 版

[Free GPT-4]

[PDF] thecvf.com

Adversarial deepfakes: Evaluating vulnerability of deepfake detectors to adversarial examples

S Hussain, P Neekhara, M Jere… - Proceedings of the …, 2021 - openaccess.thecvf.com

Recent advances in video manipulation techniques have made the generation of fake
videos more accessible than ever before. Manipulated videos can fuel disinformation and …

保存引用被引用次数：216 相关文章所有 12 个版本 HTML 版

[Free GPT-4]

[PDF] acm.org

Advpulse: Universal, synchronization-free, and targeted audio adversarial attacks via subsecond perturbations

Z Li, Y Wu, J Liu, Y Chen, B Yuan - Proceedings of the 2020 ACM …, 2020 - dl.acm.org

Existing efforts in audio adversarial attacks only focus on the scenarios where an adversary
has prior knowledge of the entire speech input so as to generate an adversarial example by …

保存引用被引用次数：131 相关文章所有 4 个版本

[Free GPT-4]

[PDF] arxiv.org

A survey on universal adversarial attack

C Zhang, P Benz, C Lin, A Karjauv, J Wu… - arxiv preprint arxiv …, 2021 - arxiv.org

The intriguing phenomenon of adversarial examples has attracted significant attention in
machine learning and what might be more surprising to the community is the existence of …

保存引用被引用次数：111 相关文章所有 5 个版本 HTML 版

[Free GPT-4]

[PDF] academia.edu

A survey on voice assistant security: Attacks and countermeasures

C Yan, X Ji, K Wang, Q Jiang, Z **, W Xu - ACM Computing Surveys, 2022 - dl.acm.org

Voice assistants (VA) have become prevalent on a wide range of personal devices such as
smartphones and smart speakers. As companies build voice assistants with extra …

保存引用被引用次数：64 相关文章所有 2 个版本

[Free GPT-4]

[PDF] thecvf.com

Adversarial threats to deepfake detection: A practical perspective

P Neekhara, B Dolhansky, J Bitton… - Proceedings of the …, 2021 - openaccess.thecvf.com

Facially manipulated images and videos or DeepFakes can be used maliciously to fuel
misinformation or defame individuals. Therefore, detecting DeepFakes is crucial to increase …

保存引用被引用次数：102 相关文章所有 6 个版本 HTML 版

[Free GPT-4]

[PDF] thecvf.com

Data-free universal adversarial perturbation and black-box attack

C Zhang, P Benz, A Karjauv… - Proceedings of the IEEE …, 2021 - openaccess.thecvf.com

Universal adversarial perturbation (UAP), ie a single perturbation to fool the network for most
images, is widely recognized as a more practical attack because the UAP can be generated …

保存引用被引用次数：68 相关文章所有 4 个版本 HTML 版

创建快讯

引用

高级搜索

已保存到“我的图书馆”

Universal adversarial perturbations for speech recognition systems

A survey of safety and trustworthiness of deep neural networks: Verification, testing, adversarial attack and defence, and interpretability

Backdoor attacks and countermeasures on deep learning: A comprehensive review

Universal and transferable adversarial attacks on aligned language models

Weight poisoning attacks on pre-trained models

Adversarial deepfakes: Evaluating vulnerability of deepfake detectors to adversarial examples

Advpulse: Universal, synchronization-free, and targeted audio adversarial attacks via subsecond perturbations

A survey on universal adversarial attack

A survey on voice assistant security: Attacks and countermeasures

Adversarial threats to deepfake detection: A practical perspective

Data-free universal adversarial perturbation and black-box attack