Large language models for information retrieval: A survey

Y Zhu, H Yuan, S Wang, J Liu, W Liu, C Deng… - arxiv preprint arxiv …, 2023 - arxiv.org
As a primary means of information acquisition, information retrieval (IR) systems, such as
search engines, have integrated themselves into our daily lives. These systems also serve …

Nv-embed: Improved techniques for training llms as generalist embedding models

C Lee, R Roy, M Xu, J Raiman, M Shoeybi… - arxiv preprint arxiv …, 2024 - arxiv.org
Decoder-only large language model (LLM)-based embedding models are beginning to
outperform BERT or T5-based embedding models in general-purpose text embedding tasks …

A setwise approach for effective and highly efficient zero-shot ranking with large language models

S Zhuang, H Zhuang, B Koopman… - Proceedings of the 47th …, 2024 - dl.acm.org
We propose a novel zero-shot document ranking approach based on Large Language
Models (LLMs): the Setwise prompting approach. Our approach complements existing …

Towards responsible development of generative AI for education: An evaluation-driven approach

I Jurenka, M Kunesch, KR McKee, D Gillick… - arxiv preprint arxiv …, 2024 - arxiv.org
A major challenge facing the world is the provision of equitable and universal access to
quality education. Recent advances in generative AI (gen AI) have created excitement about …

Block transformer: Global-to-local language modeling for fast inference

N Ho, S Bae, T Kim, H Jo, Y Kim… - Advances in …, 2025 - proceedings.neurips.cc
Abstract We introduce the Block Transformer which adopts hierarchical global-to-local
modeling to autoregressive transformers to mitigate the inference bottlenecks associated …

mgte: Generalized long-context text representation and reranking models for multilingual text retrieval

X Zhang, Y Zhang, D Long, W **e, Z Dai, J Tang… - arxiv preprint arxiv …, 2024 - arxiv.org
We present systematic efforts in building long-context multilingual text representation model
(TRM) and reranker from scratch for text retrieval. We first introduce a text encoder (base …

Recent advances in text embedding: A Comprehensive Review of Top-Performing Methods on the MTEB Benchmark

H Cao - arxiv preprint arxiv:2406.01607, 2024 - arxiv.org
Text embedding methods have become increasingly popular in both industrial and
academic fields due to their critical role in a variety of natural language processing tasks …

Promptreps: Prompting large language models to generate dense and sparse representations for zero-shot document retrieval

S Zhuang, X Ma, B Koopman, J Lin… - arxiv preprint arxiv …, 2024 - arxiv.org
Utilizing large language models (LLMs) for zero-shot document ranking is done in one of
two ways:(1) prompt-based re-ranking methods, which require no further training but are …

Can Long-Context Language Models Subsume Retrieval, RAG, SQL, and More?

J Lee, A Chen, Z Dai, D Dua, DS Sachan… - arxiv preprint arxiv …, 2024 - arxiv.org
Long-context language models (LCLMs) have the potential to revolutionize our approach to
tasks traditionally reliant on external tools like retrieval systems or databases. Leveraging …

Bright: A realistic and challenging benchmark for reasoning-intensive retrieval

H Su, H Yen, M **a, W Shi, N Muennighoff… - arxiv preprint arxiv …, 2024 - arxiv.org
Existing retrieval benchmarks primarily consist of information-seeking queries (eg,
aggregated questions from search engines) where keyword or semantic-based retrieval is …