The security of using large language models: A survey with emphasis on ChatGPT

W Zhou, X Zhu, QL Han, L Li, X Chen… - IEEE/CAA Journal of …, 2024 - ieeexplore.ieee.org
ChatGPT is a powerful artificial intelligence (AI) language model that has demonstrated
significant improvements in various natural language processing (NLP) tasks. However, like …

Multi-Turn Context Jailbreak Attack on Large Language Models From First Principles

X Sun, D Zhang, D Yang, Q Zou, H Li - ar** and Analyzing Safety Architectures
I Domkundwar, I Bhola - arxiv preprint arxiv:2409.03793, 2024 - arxiv.org
AI agents, specifically powered by large language models, have demonstrated exceptional
capabilities in various applications where precision and efficacy are necessary. However …