A survey on evaluation of large language models

Y Chang, X Wang, J Wang, Y Wu, L Yang… - ACM Transactions on …, 2024 - dl.acm.org
Large language models (LLMs) are gaining increasing popularity in both academia and
industry, owing to their unprecedented performance in various applications. As LLMs …

Unleashing the potential of prompt engineering in Large Language Models: a comprehensive review

B Chen, Z Zhang, N Langrené, S Zhu - arxiv preprint arxiv:2310.14735, 2023 - arxiv.org
This paper delves into the pivotal role of prompt engineering in unleashing the capabilities
of Large Language Models (LLMs). Prompt engineering is the process of structuring input …

A review on large Language Models: Architectures, applications, taxonomies, open issues and challenges

MAK Raiaan, MSH Mukta, K Fatema, NM Fahad… - IEEE …, 2024 - ieeexplore.ieee.org
Large Language Models (LLMs) recently demonstrated extraordinary capability in various
natural language processing (NLP) tasks including language translation, text generation …

Baseline defenses for adversarial attacks against aligned language models

N Jain, A Schwarzschild, Y Wen, G Somepalli… - arxiv preprint arxiv …, 2023 - arxiv.org
As Large Language Models quickly become ubiquitous, their security vulnerabilities are
critical to understand. Recent work shows that text optimizers can produce jailbreaking …

Flask: Fine-grained language model evaluation based on alignment skill sets

S Ye, D Kim, S Kim, H Hwang, S Kim, Y Jo… - arxiv preprint arxiv …, 2023 - arxiv.org
Evaluation of Large Language Models (LLMs) is challenging because aligning to human
values requires the composition of multiple skills and the required set of skills varies …

Foundation metrics for evaluating effectiveness of healthcare conversations powered by generative AI

M Abbasian, E Khatibi, I Azimi, D Oniani… - NPJ Digital …, 2024 - nature.com
Abstract Generative Artificial Intelligence is set to revolutionize healthcare delivery by
transforming traditional patient care into a more personalized, efficient, and proactive …

Data-centric foundation models in computational healthcare: A survey

Y Zhang, J Gao, Z Tan, L Zhou, K Ding, M Zhou… - arxiv preprint arxiv …, 2024 - arxiv.org
The advent of foundation models (FMs) as an emerging suite of AI techniques has struck a
wave of opportunities in computational healthcare. The interactive nature of these models …

Sok: Memorization in general-purpose large language models

V Hartmann, A Suri, V Bindschaedler, D Evans… - arxiv preprint arxiv …, 2023 - arxiv.org
Large Language Models (LLMs) are advancing at a remarkable pace, with myriad
applications under development. Unlike most earlier machine learning models, they are no …

Prompting frameworks for large language models: A survey

X Liu, J Wang, J Sun, X Yuan, G Dong, P Di… - arxiv preprint arxiv …, 2023 - arxiv.org
Since the launch of ChatGPT, a powerful AI Chatbot developed by OpenAI, large language
models (LLMs) have made significant advancements in both academia and industry …

Evaluating large language models for material selection

D Grandi, YP Jain, A Groom… - … of Computing and …, 2025 - asmedigitalcollection.asme.org
Material selection is a crucial step in conceptual design due to its significant impact on the
functionality, aesthetics, manufacturability, and sustainability impact of the final product. This …