A survey on evaluation of large language models

Y Chang, X Wang, J Wang, Y Wu, L Yang… - ACM Transactions on …, 2024 - dl.acm.org
Large language models (LLMs) are gaining increasing popularity in both academia and
industry, owing to their unprecedented performance in various applications. As LLMs …

Knowledge conflicts for llms: A survey

R Xu, Z Qi, Z Guo, C Wang, H Wang, Y Zhang… - ar** in Time: Adding Temporal Context to Sentiment Analysis Models
D Ninalga - arxiv preprint arxiv:2309.13562, 2023 - arxiv.org
This paper presents a state-of-the-art solution to the LongEval CLEF 2023 Lab Task 2:
LongEval-Classification. The goal of this task is to improve and preserve the performance of …