Grounding and evaluation for large language models: Practical challenges and lessons learned (survey)

K Kenthapadi, M Sameki, A Taly - Proceedings of the 30th ACM SIGKDD …, 2024 - dl.acm.org
With the ongoing rapid adoption of Artificial Intelligence (AI)-based systems in high-stakes
domains, ensuring the trustworthiness, safety, and observability of these systems has …

Retrieval-augmented generation for natural language processing: A survey

S Wu, Y **ong, Y Cui, H Wu, C Chen, Y Yuan… - arxiv preprint arxiv …, 2024 - arxiv.org
Large language models (LLMs) have demonstrated great success in various fields,
benefiting from their huge amount of parameters that store knowledge. However, LLMs still …

The power of noise: Redefining retrieval for rag systems

F Cuconasu, G Trappolini, F Siciliano, S Filice… - Proceedings of the 47th …, 2024 - dl.acm.org
Retrieval-Augmented Generation (RAG) has recently emerged as a method to extend
beyond the pre-trained knowledge of Large Language Models by augmenting the original …

Evaluating retrieval quality in retrieval-augmented generation

A Salemi, H Zamani - Proceedings of the 47th International ACM SIGIR …, 2024 - dl.acm.org
Evaluating retrieval-augmented generation (RAG) presents challenges, particularly for
retrieval models within these systems. Traditional end-to-end evaluation methods are …

Improving medical multi-modal contrastive learning with expert annotations

Y Kumar, P Marttinen - European Conference on Computer Vision, 2024 - Springer
We introduce eCLIP, an enhanced version of the CLIP model that integrates expert
annotations in the form of radiologist eye-gaze heatmaps. It tackles key challenges in …

CoST: Contrastive Quantization based Semantic Tokenization for Generative Recommendation

J Zhu, M **, Q Liu, Z Qiu, Z Dong, X Li - … of the 18th ACM Conference on …, 2024 - dl.acm.org
Embedding-based retrieval serves as a dominant approach to candidate item matching for
industrial recommender systems. With the success of generative AI, generative retrieval has …

Text2sql is not enough: Unifying ai and databases with tag

A Biswal, L Patel, S Jha, A Kamsetty, S Liu… - arxiv preprint arxiv …, 2024 - arxiv.org
AI systems that serve natural language questions over databases promise to unlock
tremendous value. Such systems would allow users to leverage the powerful reasoning and …

Retrieval-Augmented Conversational Recommendation with Prompt-based Semi-Structured Natural Language State Tracking

S Kemper, J Cui, K Dicarlantonio, K Lin… - Proceedings of the 47th …, 2024 - dl.acm.org
Conversational recommendation (ConvRec) systems must understand rich and diverse
natural language (NL) expressions of user preferences and intents, often communicated in …

Magicpig: Lsh sampling for efficient llm generation

Z Chen, R Sadhukhan, Z Ye, Y Zhou, J Zhang… - arxiv preprint arxiv …, 2024 - arxiv.org
Large language models (LLMs) with long context windows have gained significant attention.
However, the KV cache, stored to avoid re-computation, becomes a bottleneck. Various …

Echosight: Advancing visual-language models with wiki knowledge

Y Yan, W **e - arxiv preprint arxiv:2407.12735, 2024 - arxiv.org
Knowledge-based Visual Question Answering (KVQA) tasks require answering questions
about images using extensive background knowledge. Despite significant advancements …