Information retrieval: recent advances and beyond

KA Hambarde, H Proenca - IEEE Access, 2023 - ieeexplore.ieee.org
This paper provides an extensive and thorough overview of the models and techniques
utilized in the first and second stages of the typical information retrieval processing chain …

A survey of the usages of deep learning for natural language processing

DW Otter, JR Medina, JK Kalita - IEEE transactions on neural …, 2020 - ieeexplore.ieee.org
Over the last several years, the field of natural language processing has been propelled
forward by an explosion in the use of deep learning models. This article provides a brief …

Large language models for information retrieval: A survey

Y Zhu, H Yuan, S Wang, J Liu, W Liu, C Deng… - arxiv preprint arxiv …, 2023 - arxiv.org
As a primary means of information acquisition, information retrieval (IR) systems, such as
search engines, have integrated themselves into our daily lives. These systems also serve …

Dense text retrieval based on pretrained language models: A survey

WX Zhao, J Liu, R Ren, JR Wen - ACM Transactions on Information …, 2024 - dl.acm.org
Text retrieval is a long-standing research topic on information seeking, where a system is
required to return relevant information resources to user's queries in natural language. From …

Promptagator: Few-shot dense retrieval from 8 examples

Z Dai, VY Zhao, J Ma, Y Luan, J Ni, J Lu… - arxiv preprint arxiv …, 2022 - arxiv.org
Much recent research on information retrieval has focused on how to transfer from one task
(typically with abundant supervised data) to various other tasks where supervision is limited …

Learning to tokenize for generative retrieval

W Sun, L Yan, Z Chen, S Wang, H Zhu… - Advances in …, 2023 - proceedings.neurips.cc
As a new paradigm in information retrieval, generative retrieval directly generates a ranked
list of document identifiers (docids) for a given query using generative language models …

Long range arena: A benchmark for efficient transformers

Y Tay, M Dehghani, S Abnar, Y Shen, D Bahri… - arxiv preprint arxiv …, 2020 - arxiv.org
Transformers do not scale very well to long sequence lengths largely because of quadratic
self-attention complexity. In the recent months, a wide spectrum of efficient, fast Transformers …

Approximate nearest neighbor negative contrastive learning for dense text retrieval

L **ong, C **ong, Y Li, KF Tang, J Liu… - arxiv preprint arxiv …, 2020 - arxiv.org
Conducting text retrieval in a dense learned representation space has many intriguing
advantages over sparse retrieval. Yet the effectiveness of dense retrieval (DR) often requires …

RocketQA: An optimized training approach to dense passage retrieval for open-domain question answering

Y Qu, Y Ding, J Liu, K Liu, R Ren, WX Zhao… - arxiv preprint arxiv …, 2020 - arxiv.org
In open-domain question answering, dense passage retrieval has become a new paradigm
to retrieve relevant passages for finding answers. Typically, the dual-encoder architecture is …

Colbert: Efficient and effective passage search via contextualized late interaction over bert

O Khattab, M Zaharia - Proceedings of the 43rd International ACM SIGIR …, 2020 - dl.acm.org
Recent progress in Natural Language Understanding (NLU) is driving fast-paced advances
in Information Retrieval (IR), largely owed to fine-tuning deep language models (LMs) for …