- Academic Search

BC Das, MH Amini, Y Wu - ACM Computing Surveys, 2024 - dl.acm.org

Large language models (LLMs) have demonstrated extraordinary capabilities and
contributed to multiple fields, such as generating and summarizing text, language …

Speichern Zitieren Zitiert von: 95 Ähnliche Artikel Alle 3 Versionen

[Free GPT-4]

[HTML] sciencedirect.com

[HTML][HTML] A survey on large language model (llm) security and privacy: The good, the bad, and the ugly

Y Yao, J Duan, K Xu, Y Cai, Z Sun, Y Zhang - High-Confidence Computing, 2024 - Elsevier

Abstract Large Language Models (LLMs), such as ChatGPT and Bard, have revolutionized
natural language understanding and generation. They possess deep language …

Speichern Zitieren Zitiert von: 523 Ähnliche Artikel Alle 11 Versionen

[Free GPT-4]

[PDF] neurips.cc

On the exploitability of instruction tuning

M Shu, J Wang, C Zhu, J Gei**… - Advances in Neural …, 2023 - proceedings.neurips.cc

Instruction tuning is an effective technique to align large language models (LLMs) with
human intent. In this work, we investigate how an adversary can exploit instruction tuning by …

Speichern Zitieren Zitiert von: 86 Ähnliche Artikel Alle 6 Versionen HTML-Version

[Free GPT-4]

[PDF] arxiv.org

Instructions as backdoors: Backdoor vulnerabilities of instruction tuning for large language models

J Xu, MD Ma, F Wang, C **ao, M Chen - arxiv preprint arxiv:2305.14710, 2023 - arxiv.org

We investigate security concerns of the emergent instruction tuning paradigm, that models
are trained on crowdsourced datasets with task instructions to achieve superior …

Speichern Zitieren Zitiert von: 100 Ähnliche Artikel Alle 3 Versionen HTML-Version

[Free GPT-4]

[PDF] neurips.cc

Revisiting out-of-distribution robustness in nlp: Benchmarks, analysis, and LLMs evaluations

L Yuan, Y Chen, G Cui, H Gao, F Zou… - Advances in …, 2023 - proceedings.neurips.cc

This paper reexamines the research on out-of-distribution (OOD) robustness in the field of
NLP. We find that the distribution shift settings in previous studies commonly lack adequate …

Speichern Zitieren Zitiert von: 77 Ähnliche Artikel Alle 6 Versionen HTML-Version

[Free GPT-4]

[PDF] oup.com Full View

Perils and opportunities in using large language models in psychological research

S Abdurahman, M Atari, F Karimi-Malekabadi… - PNAS …, 2024 - academic.oup.com

The emergence of large language models (LLMs) has sparked considerable interest in their
potential application in psychological research, mainly as a model of the human psyche or …

Speichern Zitieren Zitiert von: 32 Ähnliche Artikel Alle 5 Versionen

[Free GPT-4]

[PDF] igi-global.com

Privacy and data protection in ChatGPT and other AI Chatbots: strategies for securing user information

G Sebastian - International Journal of Security and Privacy in …, 2023 - igi-global.com

The evolution of artificial intelligence (AI) and machine learning (ML) has led to the
development of sophisticated large language models (LLMs) that are used extensively in …

Speichern Zitieren Zitiert von: 105 Ähnliche Artikel Alle 6 Versionen

[Free GPT-4]

[PDF] neurips.cc

Setting the trap: Capturing and defeating backdoors in pretrained language models through honeypots

RR Tang, J Yuan, Y Li, Z Liu… - Advances in Neural …, 2023 - proceedings.neurips.cc

In the field of natural language processing, the prevalent approach involves fine-tuning
pretrained language models (PLMs) using local samples. Recent research has exposed the …

Speichern Zitieren Zitiert von: 13 Ähnliche Artikel Alle 5 Versionen HTML-Version

[Free GPT-4]

[PDF] arxiv.org

Risk taxonomy, mitigation, and assessment benchmarks of large language model systems

T Cui, Y Wang, C Fu, Y **ao, S Li, X Deng, Y Liu… - arxiv preprint arxiv …, 2024 - arxiv.org

Large language models (LLMs) have strong capabilities in solving diverse natural language
processing tasks. However, the safety and security issues of LLM systems have become the …

Speichern Zitieren Zitiert von: 50 Ähnliche Artikel Alle 2 Versionen HTML-Version

Backdoor Attacks and Defenses Targeting Multi-Domain AI Models: A Comprehensive Review

S Zhang, Y Pan, Q Liu, Z Yan, KKR Choo… - ACM Computing …, 2024 - dl.acm.org

Since the emergence of security concerns in artificial intelligence (AI), there has been
significant attention devoted to the examination of backdoor attacks. Attackers can utilize …

Speichern Zitieren Zitiert von: 3 Ähnliche Artikel

Alert erstellen

Zitieren

Erweiterte Suche

In „Meine Bibliothek“ gespeichert

Chatgpt as an attack tool: Stealthy textual backdoor attack via blackbox generative model trigger

Security and privacy challenges of large language models: A survey

[HTML][HTML] A survey on large language model (llm) security and privacy: The good, the bad, and the ugly

On the exploitability of instruction tuning

Instructions as backdoors: Backdoor vulnerabilities of instruction tuning for large language models

Revisiting out-of-distribution robustness in nlp: Benchmarks, analysis, and LLMs evaluations

Perils and opportunities in using large language models in psychological research

Privacy and data protection in ChatGPT and other AI Chatbots: strategies for securing user information

Setting the trap: Capturing and defeating backdoors in pretrained language models through honeypots

Risk taxonomy, mitigation, and assessment benchmarks of large language model systems

Backdoor Attacks and Defenses Targeting Multi-Domain AI Models: A Comprehensive Review