Challenges and applications of large language models

J Kaddour, J Harris, M Mozes, H Bradley… - ar**… - Advances in Neural …, 2023 - proceedings.neurips.cc
Instruction tuning is an effective technique to align large language models (LLMs) with
human intent. In this work, we investigate how an adversary can exploit instruction tuning by …