- Academic Search

J Zhang, H Bu, H Wen, Y Liu, H Fei… - …, 2025 - cybersecurity.springeropen.com

The rapid development of large language models (LLMs) has opened new avenues across
various fields, including cybersecurity, which faces an evolving threat landscape and …

Save Cite Cited by 37 Related articles All 2 versions Free GPT-4 Cached

[Free GPT-4]

[PDF] aaai.org

Red-Teaming for generative AI: Silver bullet or security theater?

M Feffer, A Sinha, WH Deng, ZC Lipton… - Proceedings of the AAAI …, 2024 - ojs.aaai.org

In response to rising concerns surrounding the safety, security, and trustworthiness of
Generative AI (GenAI) models, practitioners and regulators alike have pointed to AI red …

Save Cite Cited by 41 Related articles All 2 versions Free GPT-4 View as HTML

[Free GPT-4]

[PDF] arxiv.org

Jailbreak attacks and defenses against large language models: A survey

S Yi, Y Liu, Z Sun, T Cong, X He, J Song, K Xu… - arxiv preprint arxiv …, 2024 - arxiv.org

Large Language Models (LLMs) have performed exceptionally in various text-generative
tasks, including question answering, translation, code completion, etc. However, the over …

Save Cite Cited by 35 Related articles All 4 versions Free GPT-4 View as HTML

[Free GPT-4]

[PDF] arxiv.org

Llm defenses are not robust to multi-turn human jailbreaks yet

N Li, Z Han, I Steneker, W Primack, R Goodside… - arxiv preprint arxiv …, 2024 - arxiv.org

Recent large language model (LLM) defenses have greatly improved models' ability to
refuse harmful queries, even when adversarially attacked. However, LLM defenses are …

Save Cite Cited by 24 Related articles All 4 versions Free GPT-4 View as HTML

[Free GPT-4]

[PDF] arxiv.org

Jailbreakzoo: Survey, landscapes, and horizons in jailbreaking large language and vision-language models

H **, L Hu, X Li, P Zhang, C Chen, J Zhuang… - arxiv preprint arxiv …, 2024 - arxiv.org

The rapid evolution of artificial intelligence (AI) through developments in Large Language
Models (LLMs) and Vision-Language Models (VLMs) has brought significant advancements …

Save Cite Cited by 21 Related articles All 3 versions Free GPT-4 View as HTML

[Free GPT-4]

[PDF] jair.org

Against The Achilles' Heel: A Survey on Red Teaming for Generative Models

L Lin, H Mu, Z Zhai, M Wang, Y Wang, R Wang… - Journal of Artificial …, 2025 - jair.org

Generative models are rapidly gaining popularity and being integrated into everyday
applications, raising concerns over their safe use as various vulnerabilities are exposed. In …

Save Cite Cited by 12 Related articles All 2 versions Free GPT-4 View as HTML

[Free GPT-4]

[PDF] purdue.edu

On large language models' resilience to coercive interrogation

Z Zhang, G Shen, G Tao, S Cheng… - 2024 IEEE Symposium on …, 2024 - computer.org

Abstract Large Language Models (LLMs) are increasingly employed in numerous
applications. It is hence important to ensure that their ethical standard aligns with humans' …

Save Cite Cited by 20 Related articles

[Free GPT-4]

[PDF] arxiv.org

Jailbreaking and mitigation of vulnerabilities in large language models

B Peng, Z Bi, Q Niu, M Liu, P Feng, T Wang… - arxiv preprint arxiv …, 2024 - arxiv.org

Large Language Models (LLMs) have transformed artificial intelligence by advancing
natural language understanding and generation, enabling applications across fields beyond …

Save Cite Cited by 11 Related articles All 5 versions Free GPT-4 View as HTML

[Free GPT-4]

[PDF] arxiv.org

A Survey on Large Language Models with Multilingualism: Recent Advances and New Frontiers

K Huang, F Mo, H Li, Y Li, Y Zhang, W Yi, Y Mao… - arxiv preprint arxiv …, 2024 - arxiv.org

The rapid development of Large Language Models (LLMs) demonstrates remarkable
multilingual capabilities in natural language processing, attracting global attention in both …

Save Cite Cited by 13 Related articles All 2 versions Free GPT-4 View as HTML

[Free GPT-4]

[PDF] researchgate.net

[PDF][PDF] A Survey on Responsible Generative AI: What to Generate and What Not

J Gu - arxiv preprint arxiv:2404.05783, 2024 - researchgate.net

In recent years, generative AI (GenAI), like large language models and text-to-image
models, has received significant attention across various domains. However, ensuring the …

Save Cite Cited by 12 Related articles All 2 versions Free GPT-4 View as HTML

Create alert

Cite

Advanced search

Saved to My library

A Wolf in Sheep's Clothing: Generalized Nested Jailbreak Prompts can Fool Large Language...

[HTML][HTML] When llms meet cybersecurity: A systematic literature review

Red-Teaming for generative AI: Silver bullet or security theater?

Jailbreak attacks and defenses against large language models: A survey

Llm defenses are not robust to multi-turn human jailbreaks yet

Jailbreakzoo: Survey, landscapes, and horizons in jailbreaking large language and vision-language models

Against The Achilles' Heel: A Survey on Red Teaming for Generative Models

On large language models' resilience to coercive interrogation

Jailbreaking and mitigation of vulnerabilities in large language models

A Survey on Large Language Models with Multilingualism: Recent Advances and New Frontiers

[PDF][PDF] A Survey on Responsible Generative AI: What to Generate and What Not