- Academic Search

J Ji, T Qiu, B Chen, B Zhang, H Lou, K Wang… - ar**… - Advances in Neural …, 2023 - proceedings.neurips.cc

Instruction tuning is an effective technique to align large language models (LLMs) with
human intent. In this work, we investigate how an adversary can exploit instruction tuning by …

Speichern Zitieren Zitiert von: 89 Ähnliche Artikel Alle 8 Versionen HTML-Version

[Free GPT-4]
[DeepSeek]

[PDF] springer.com

Artificial intelligence (AI) cybersecurity dimensions: a comprehensive framework for understanding adversarial and offensive AI

M Malatji, A Tolah - AI and Ethics, 2024 - Springer

Abstract As Artificial Intelligence (AI) rapidly advances and integrates into various domains,
cybersecurity emerges as a critical field grappling with both the benefits and pitfalls of AI …

Speichern Zitieren Zitiert von: 45 Ähnliche Artikel Alle 2 Versionen

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Llm self defense: By self examination, llms know they are being tricked

M Phute, A Helbling, M Hull, SY Peng, S Szyller… - arxiv preprint arxiv …, 2023 - arxiv.org

Large language models (LLMs) are popular for high-quality text generation but can produce
harmful content, even when aligned with human values through reinforcement learning …

Speichern Zitieren Zitiert von: 70 Ähnliche Artikel Alle 5 Versionen HTML-Version

[Free GPT-4]
[DeepSeek]

[PDF] researchgate.net

[PDF][PDF] AI-driven threat detection and response: A paradigm shift in cybersecurity

A Yaseen - International Journal of Information and Cybersecurity, 2023 - researchgate.net

The research paper delves into the transformative role of artificial intelligence (AI) in
revolutionizing cybersecurity. This study examines the historical context and evolution of AI …

Speichern Zitieren Zitiert von: 52 Ähnliche Artikel HTML-Version

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Who wrote this code? watermarking for code generation

T Lee, S Hong, J Ahn, I Hong, H Lee, S Yun… - arxiv preprint arxiv …, 2023 - arxiv.org

Since the remarkable generation performance of large language models raised ethical and
legal concerns, approaches to detect machine-generated text by embedding watermarks are …

Speichern Zitieren Zitiert von: 65 Ähnliche Artikel Alle 5 Versionen HTML-Version

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Deepfakes, misinformation, and disinformation in the era of frontier AI, generative AI, and large AI models

MR Shoaib, Z Wang, MT Ahvanooey… - … on Computer and …, 2023 - ieeexplore.ieee.org

With the advent of sophisticated artificial intelligence (AI) technologies, the proliferation of
deepfakes and the spread of m/disinformation have emerged as formidable threats to the …

Speichern Zitieren Zitiert von: 58 Ähnliche Artikel Alle 6 Versionen

Alert erstellen

Zitieren

Erweiterte Suche

In „Meine Bibliothek“ gespeichert

The threat of offensive ai to organizations

Ai alignment: A comprehensive survey

Artificial intelligence (AI) cybersecurity dimensions: a comprehensive framework for understanding adversarial and offensive AI

Llm self defense: By self examination, llms know they are being tricked

[PDF][PDF] AI-driven threat detection and response: A paradigm shift in cybersecurity

Who wrote this code? watermarking for code generation

Deepfakes, misinformation, and disinformation in the era of frontier AI, generative AI, and large AI models