Challenges and applications of large language models

J Kaddour, J Harris, M Mozes, H Bradley… - arxiv preprint arxiv …, 2023 - arxiv.org
Large Language Models (LLMs) went from non-existent to ubiquitous in the machine
learning discourse within a few years. Due to the fast pace of the field, it is difficult to identify …

Domain specialization as the key to make large language models disruptive: A comprehensive survey

C Ling, X Zhao, J Lu, C Deng, C Zheng, J Wang… - arxiv preprint arxiv …, 2023 - arxiv.org
Large language models (LLMs) have significantly advanced the field of natural language
processing (NLP), providing a highly useful, task-agnostic foundation for a wide range of …

[PDF][PDF] A prompt pattern catalog to enhance prompt engineering with chatgpt

J White, Q Fu, S Hays, M Sandborn, C Olea… - arxiv preprint arxiv …, 2023 - file.mixpaper.cn
Prompt engineering is an increasingly important skill set needed to converse effectively with
large language models (LLMs), such as ChatGPT. Prompts are instructions given to an LLM …

Distilling step-by-step! outperforming larger language models with less training data and smaller model sizes

CY Hsieh, CL Li, CK Yeh, H Nakhost, Y Fujii… - arxiv preprint arxiv …, 2023 - arxiv.org
Deploying large language models (LLMs) is challenging because they are memory
inefficient and compute-intensive for practical applications. In reaction, researchers train …

Hyena hierarchy: Towards larger convolutional language models

M Poli, S Massaroli, E Nguyen, DY Fu… - International …, 2023 - proceedings.mlr.press
Recent advances in deep learning have relied heavily on the use of large Transformers due
to their ability to learn at scale. However, the core building block of Transformers, the …

Legalbench: A collaboratively built benchmark for measuring legal reasoning in large language models

N Guha, J Nyarko, D Ho, C Ré… - Advances in …, 2023 - proceedings.neurips.cc
The advent of large language models (LLMs) and their adoption by the legal community has
given rise to the question: what types of legal reasoning can LLMs perform? To enable …

Holistic evaluation of language models

P Liang, R Bommasani, T Lee, D Tsipras… - arxiv preprint arxiv …, 2022 - arxiv.org
Language models (LMs) are becoming the foundation for almost all major language
technologies, but their capabilities, limitations, and risks are not well understood. We present …

Don't listen to me: understanding and exploring jailbreak prompts of large language models

Z Yu, X Liu, S Liang, Z Cameron, C **ao… - 33rd USENIX Security …, 2024 - usenix.org
Recent advancements in generative AI have enabled ubiquitous access to large language
models (LLMs). Empowered by their exceptional capabilities to understand and generate …

In-context impersonation reveals large language models' strengths and biases

L Salewski, S Alaniz, I Rio-Torto… - Advances in neural …, 2023 - proceedings.neurips.cc
In everyday conversations, humans can take on different roles and adapt their vocabulary to
their chosen roles. We explore whether LLMs can take on, that is impersonate, different roles …

Frugalgpt: How to use large language models while reducing cost and improving performance

L Chen, M Zaharia, J Zou - arxiv preprint arxiv:2305.05176, 2023 - arxiv.org
There is a rapidly growing number of large language models (LLMs) that users can query for
a fee. We review the cost associated with querying popular LLM APIs, eg GPT-4, ChatGPT …