Membership inference attacks on machine learning: A survey

H Hu, Z Salcic, L Sun, G Dobbie, PS Yu… - ACM Computing Surveys …, 2022 - dl.acm.org
Machine learning (ML) models have been widely applied to various applications, including
image classification, text generation, audio recognition, and graph data analysis. However …

Ai alignment: A comprehensive survey

J Ji, T Qiu, B Chen, B Zhang, H Lou, K Wang… - arxiv preprint arxiv …, 2023 - arxiv.org
AI alignment aims to make AI systems behave in line with human intentions and values. As
AI systems grow more capable, the potential large-scale risks associated with misaligned AI …

Trustworthy LLMs: A survey and guideline for evaluating large language models' alignment

Y Liu, Y Yao, JF Ton, X Zhang, RGH Cheng… - arxiv preprint arxiv …, 2023 - arxiv.org
Ensuring alignment, which refers to making models behave in accordance with human
intentions [1, 2], has become a critical task before deploying large language models (LLMs) …

A survey on adversarial attacks and defences

A Chakraborty, M Alam, V Dey… - CAAI Transactions …, 2021 - Wiley Online Library
Deep learning has evolved as a strong and efficient framework that can be applied to a
broad spectrum of complex learning problems which were difficult to solve using the …

Secure and robust machine learning for healthcare: A survey

A Qayyum, J Qadir, M Bilal… - IEEE Reviews in …, 2020 - ieeexplore.ieee.org
Recent years have witnessed widespread adoption of machine learning (ML)/deep learning
(DL) techniques due to their superior performance for a variety of healthcare applications …

Strip: A defence against trojan attacks on deep neural networks

Y Gao, C Xu, D Wang, S Chen… - Proceedings of the 35th …, 2019 - dl.acm.org
A recent trojan attack on deep neural network (DNN) models is one insidious variant of data
poisoning attacks. Trojan attacks exploit an effective backdoor created in a DNN model by …

Detecting backdoor attacks on deep neural networks by activation clustering

B Chen, W Carvalho, N Baracaldo, H Ludwig… - arxiv preprint arxiv …, 2018 - arxiv.org
While machine learning (ML) models are being increasingly trusted to make decisions in
different and varying areas, the safety of systems using such models has become an …

Quantum machine learning for 6G communication networks: State-of-the-art and vision for the future

SJ Nawaz, SK Sharma, S Wyne, MN Patwary… - IEEE …, 2019 - ieeexplore.ieee.org
The upcoming fifth generation (5G) of wireless networks is expected to lay a foundation of
intelligent networks with the provision of some isolated artificial intelligence (AI) operations …

Adversarial examples: Attacks and defenses for deep learning

X Yuan, P He, Q Zhu, X Li - IEEE transactions on neural …, 2019 - ieeexplore.ieee.org
With rapid progress and significant successes in a wide spectrum of applications, deep
learning is being applied in many safety-critical environments. However, deep neural …

Torchattacks: A pytorch repository for adversarial attacks

H Kim - arxiv preprint arxiv:2010.01950, 2020 - arxiv.org
Torchattacks : A Pytorch Repository for Adversarial Attacks Page 1 arxiv:2010.01950v3 [cs.LG]
19 Feb 2021 Torchattacks: A PyTorch Repository for Adversarial Attacks Hoki Kim Seoul …