Information retrieval: recent advances and beyond

KA Hambarde, H Proenca - IEEE Access, 2023 - ieeexplore.ieee.org
This paper provides an extensive and thorough overview of the models and techniques
utilized in the first and second stages of the typical information retrieval processing chain …

Beir: A heterogenous benchmark for zero-shot evaluation of information retrieval models

N Thakur, N Reimers, A Rücklé, A Srivastava… - arxiv preprint arxiv …, 2021 - arxiv.org
Existing neural information retrieval (IR) models have often been studied in homogeneous
and narrow settings, which has considerably limited insights into their out-of-distribution …

Promptagator: Few-shot dense retrieval from 8 examples

Z Dai, VY Zhao, J Ma, Y Luan, J Ni, J Lu… - arxiv preprint arxiv …, 2022 - arxiv.org
Much recent research on information retrieval has focused on how to transfer from one task
(typically with abundant supervised data) to various other tasks where supervision is limited …

Dense text retrieval based on pretrained language models: A survey

WX Zhao, J Liu, R Ren, JR Wen - ACM Transactions on Information …, 2024 - dl.acm.org
Text retrieval is a long-standing research topic on information seeking, where a system is
required to return relevant information resources to user's queries in natural language. From …

[KİTAP][B] Pretrained transformers for text ranking: Bert and beyond

J Lin, R Nogueira, A Yates - 2022 - books.google.com
The goal of text ranking is to generate an ordered list of texts retrieved from a corpus in
response to a query. Although the most common formulation of text ranking is search …

Improving the domain adaptation of retrieval augmented generation (RAG) models for open domain question answering

S Siriwardhana, R Weerasekera, E Wen… - Transactions of the …, 2023 - direct.mit.edu
Abstract Retrieval Augment Generation (RAG) is a recent advancement in Open-Domain
Question Answering (ODQA). RAG has only been trained and explored with a Wikipedia …

GPL: Generative pseudo labeling for unsupervised domain adaptation of dense retrieval

K Wang, N Thakur, N Reimers, I Gurevych - arxiv preprint arxiv …, 2021 - arxiv.org
Dense retrieval approaches can overcome the lexical gap and lead to significantly improved
search results. However, they require large amounts of training data which is not available …

Inpars: Unsupervised dataset generation for information retrieval

L Bonifacio, H Abonizio, M Fadaee… - Proceedings of the 45th …, 2022 - dl.acm.org
The Information Retrieval (IR) community has recently witnessed a revolution due to large
pretrained transformer models. Another key ingredient for this revolution was the MS …

Pre-training methods in information retrieval

Y Fan, X **e, Y Cai, J Chen, X Ma, X Li… - … and Trends® in …, 2022 - nowpublishers.com
The core of information retrieval (IR) is to identify relevant information from large-scale
resources and return it as a ranked list to respond to user's information need. In recent years …

Inpars: Data augmentation for information retrieval using large language models

L Bonifacio, H Abonizio, M Fadaee… - arxiv preprint arxiv …, 2022 - arxiv.org
The information retrieval community has recently witnessed a revolution due to large
pretrained transformer models. Another key ingredient for this revolution was the MS …