Backdoor attacks and defenses targeting multi-domain ai models: A comprehensive review

S Zhang, Y Pan, Q Liu, Z Yan, KKR Choo… - ACM Computing …, 2024 - dl.acm.org
Since the emergence of security concerns in artificial intelligence (AI), there has been
significant attention devoted to the examination of backdoor attacks. Attackers can utilize …

The unreasonable effectiveness of few-shot learning for machine translation

X Garcia, Y Bansal, C Cherry, G Foster… - International …, 2023 - proceedings.mlr.press
We demonstrate the potential of few-shot translation systems, trained with unpaired
language data, for both high and low-resource language pairs. We show that with only 5 …

Cater: Intellectual property protection on text generation apis via conditional watermarks

X He, Q Xu, Y Zeng, L Lyu, F Wu… - Advances in Neural …, 2022 - proceedings.neurips.cc
Previous works have validated that text generation APIs can be stolen through imitation
attacks, causing IP violations. In order to protect the IP of text generation APIs, recent work …

Spinning language models: Risks of propaganda-as-a-service and countermeasures

E Bagdasaryan, V Shmatikov - 2022 IEEE Symposium on …, 2022 - ieeexplore.ieee.org
We investigate a new threat to neural sequence-to-sequence (seq2seq) models: training-
time attacks that cause models to “spin” their outputs so as to support an adversary-chosen …

Safeguarding human values: rethinking US law for generative AI's societal impacts

I Cheong, A Caliskan, T Kohno - AI and Ethics, 2024 - Springer
Our interdisciplinary study examines the effectiveness of US law in addressing the complex
challenges posed by generative AI systems to fundamental human values, including …