Ragas: Automated evaluation of retrieval augmented generation

S Es, J James, L Espinosa-Anke… - arxiv preprint arxiv …, 2023 - arxiv.org
We introduce RAGAs (Retrieval Augmented Generation Assessment), a framework for
reference-free evaluation of Retrieval Augmented Generation (RAG) pipelines. RAG …

Understanding retrieval augmentation for long-form question answering

HT Chen, F Xu, S Arora, E Choi - arxiv preprint arxiv:2310.12150, 2023 - arxiv.org
We present a study of retrieval-augmented language models (LMs) on long-form question
answering. We analyze how retrieval augmentation impacts different LMs, by comparing …

Piperag: Fast retrieval-augmented generation via algorithm-system co-design

W Jiang, S Zhang, B Han, J Wang, B Wang… - arxiv preprint arxiv …, 2024 - arxiv.org
Retrieval-augmented generation (RAG) can enhance the generation quality of large
language models (LLMs) by incorporating external token databases. However, retrievals …

More room for language: Investigating the effect of retrieval on language models

D Samuel, LGG Charpentier, S Wold - arxiv preprint arxiv:2404.10939, 2024 - arxiv.org
Retrieval-augmented language models pose a promising alternative to standard language
modeling. During pretraining, these models search in a corpus of documents for contextually …

Great Memory, Shallow Reasoning: Limits of NN-LMs

S Geng, W Zhao, AM Rush - arxiv preprint arxiv:2408.11815, 2024 - arxiv.org
$ K $-nearest neighbor language models ($ k $ NN-LMs), which integrate retrieval with next-
word prediction, have demonstrated strong performance in language modeling as well as …

PersonaLM: Language Model Personalization via Domain-distributed Span Aggregated K-Nearest N-gram Retrieval Augmentation

P Mathur, Z Liu, K Li, Y Ma, G Keren… - Findings of the …, 2023 - aclanthology.org
Abstract We introduce PersonaLM-Domain-distributed Span-Aggregated K-nearest N-gram
retrieval augmentation to improve language modeling for Automatic Speech Recognition …

Morse: Bridging the gap in cybersecurity expertise with retrieval augmented generation

M Simoni, A Saracino, M Conti - arxiv preprint arxiv:2407.15748, 2024 - arxiv.org
In this paper, we introduce MoRSE (Mixture of RAGs Security Experts), the first specialised
AI chatbot for cybersecurity. MoRSE aims to provide comprehensive and complete …

Chunk-Distilled Language Modeling

Y Li, K Livescu, J Zhou - arxiv preprint arxiv:2501.00343, 2024 - arxiv.org
We introduce Chunk-Distilled Language Modeling (CD-LM), an approach to text generation
that addresses two challenges in current large language models (LLMs): the inefficiency of …

Nearest Neighbor Speculative Decoding for LLM Generation and Attribution

M Li, X Chen, A Holtzman, B Chen, J Lin, W Yih… - arxiv preprint arxiv …, 2024 - arxiv.org
Large language models (LLMs) often hallucinate and lack the ability to provide attribution for
their generations. Semi-parametric LMs, such as kNN-LM, approach these limitations by …

Towards Robust Long-form Text Generation Systems

K Krishna - 2023 - scholarworks.umass.edu
Text generation is an important emerging AI technology that has seen significant research
advances in recent years. Due to its closeness to how humans communicate, mastering text …