A survey on evaluation of large language models

Y Chang, X Wang, J Wang, Y Wu, L Yang… - ACM Transactions on …, 2024 - dl.acm.org
Large language models (LLMs) are gaining increasing popularity in both academia and
industry, owing to their unprecedented performance in various applications. As LLMs …

Retrieval-augmented generation for large language models: A survey

Y Gao, Y **ong, X Gao, K Jia, J Pan, Y Bi, Y Dai… - arxiv preprint arxiv …, 2023 - arxiv.org
Large language models (LLMs) demonstrate powerful capabilities, but they still face
challenges in practical applications, such as hallucinations, slow knowledge updates, and …

Survey on factuality in large language models: Knowledge, retrieval and domain-specificity

C Wang, X Liu, Y Yue, X Tang, T Zhang… - arxiv preprint arxiv …, 2023 - arxiv.org
This survey addresses the crucial issue of factuality in Large Language Models (LLMs). As
LLMs find applications across diverse domains, the reliability and accuracy of their outputs …

Huatuogpt-ii, one-stage training for medical adaption of llms

J Chen, X Wang, K Ji, A Gao, F Jiang, S Chen… - arxiv preprint arxiv …, 2023 - arxiv.org
Adapting a language model into a specific domain, akadomain adaption', is a common
practice when specialized knowledge, eg medicine, is not encapsulated in a general …

Lawbench: Benchmarking legal knowledge of large language models

Z Fei, X Shen, D Zhu, F Zhou, Z Han, S Zhang… - arxiv preprint arxiv …, 2023 - arxiv.org
Large language models (LLMs) have demonstrated strong capabilities in various aspects.
However, when applying them to the highly specialized, safe-critical legal domain, it is …

Disc-medllm: Bridging general large language models and real-world medical consultation

Z Bao, W Chen, S **ao, K Ren, J Wu, C Zhong… - arxiv preprint arxiv …, 2023 - arxiv.org
We propose DISC-MedLLM, a comprehensive solution that leverages Large Language
Models (LLMs) to provide accurate and truthful medical response in end-to-end …

A survey on knowledge distillation of large language models

X Xu, M Li, C Tao, T Shen, R Cheng, J Li, C Xu… - arxiv preprint arxiv …, 2024 - arxiv.org
This survey presents an in-depth exploration of knowledge distillation (KD) techniques
within the realm of Large Language Models (LLMs), spotlighting the pivotal role of KD in …

A comprehensive survey on evaluating large language model applications in the medical industry

Y Huang, K Tang, M Chen, B Wang - arxiv preprint arxiv:2404.15777, 2024 - arxiv.org
Since the inception of the Transformer architecture in 2017, Large Language Models (LLMs)
such as GPT and BERT have evolved significantly, impacting various industries with their …

Datasets for large language models: A comprehensive survey

Y Liu, J Cao, C Liu, K Ding, L ** - arxiv preprint arxiv:2402.18041, 2024 - arxiv.org
This paper embarks on an exploration into the Large Language Model (LLM) datasets,
which play a crucial role in the remarkable advancements of LLMs. The datasets serve as …

Medbench: A comprehensive, standardized, and reliable benchmarking system for evaluating chinese medical large language models

M Liu, W Hu, J Ding, J Xu, X Li, L Zhu… - Big Data Mining and …, 2024 - ieeexplore.ieee.org
Ensuring the general efficacy and benefit for human beings from medical Large Language
Models (LLM) before real-world deployment is crucial. However, a widely accepted and …