A comprehensive overview of large language models
Large Language Models (LLMs) have recently demonstrated remarkable capabilities in
natural language processing tasks and beyond. This success of LLMs has led to a large …
natural language processing tasks and beyond. This success of LLMs has led to a large …
Datasets for large language models: A comprehensive survey
This paper embarks on an exploration into the Large Language Model (LLM) datasets,
which play a crucial role in the remarkable advancements of LLMs. The datasets serve as …
which play a crucial role in the remarkable advancements of LLMs. The datasets serve as …
Judging llm-as-a-judge with mt-bench and chatbot arena
Evaluating large language model (LLM) based chat assistants is challenging due to their
broad capabilities and the inadequacy of existing benchmarks in measuring human …
broad capabilities and the inadequacy of existing benchmarks in measuring human …
[PDF][PDF] Retrieval-augmented generation for large language models: A survey
Y Gao, Y **ong, X Gao, K Jia, J Pan, Y Bi… - arxiv preprint arxiv …, 2023 - simg.baai.ac.cn
Large language models (LLMs) demonstrate powerful capabilities, but they still face
challenges in practical applications, such as hallucinations, slow knowledge updates, and …
challenges in practical applications, such as hallucinations, slow knowledge updates, and …
Harnessing the power of llms in practice: A survey on chatgpt and beyond
This article presents a comprehensive and practical guide for practitioners and end-users
working with Large Language Models (LLMs) in their downstream Natural Language …
working with Large Language Models (LLMs) in their downstream Natural Language …
Semantic uncertainty: Linguistic invariances for uncertainty estimation in natural language generation
We introduce a method to measure uncertainty in large language models. For tasks like
question answering, it is essential to know when we can trust the natural language outputs …
question answering, it is essential to know when we can trust the natural language outputs …
Evaluating correctness and faithfulness of instruction-following models for question answering
Instruction-following models are attractive alternatives to fine-tuned approaches for question
answering (QA). By simply prepending relevant documents and an instruction to their input …
answering (QA). By simply prepending relevant documents and an instruction to their input …
Beyond the imitation game: Quantifying and extrapolating the capabilities of language models
Language models demonstrate both quantitative improvement and new qualitative
capabilities with increasing scale. Despite their potentially transformative impact, these new …
capabilities with increasing scale. Despite their potentially transformative impact, these new …
Pandalm: An automatic evaluation benchmark for llm instruction tuning optimization
Instruction tuning large language models (LLMs) remains a challenging task, owing to the
complexity of hyperparameter selection and the difficulty involved in evaluating the tuned …
complexity of hyperparameter selection and the difficulty involved in evaluating the tuned …
Benchmarking foundation models with language-model-as-an-examiner
Numerous benchmarks have been established to assess the performance of foundation
models on open-ended question answering, which serves as a comprehensive test of a …
models on open-ended question answering, which serves as a comprehensive test of a …