Query2doc: Query expansion with large language models

L Wang, N Yang, F Wei - arxiv preprint arxiv:2303.07678, 2023 - arxiv.org
This paper introduces a simple yet effective query expansion approach, denoted as
query2doc, to improve both sparse and dense retrieval systems. The proposed method first …

Utilizing BERT for Information Retrieval: Survey, Applications, Resources, and Challenges

J Wang, JX Huang, X Tu, J Wang, AJ Huang… - ACM Computing …, 2024 - dl.acm.org
Recent years have witnessed a substantial increase in the use of deep learning to solve
various natural language processing (NLP) problems. Early deep learning models were …

Generative relevance feedback with large language models

I Mackie, S Chatterjee, J Dalton - … of the 46th International ACM SIGIR …, 2023 - dl.acm.org
Current query expansion models use pseudo-relevance feedback to improve first-pass
retrieval effectiveness; however, this fails when the initial results are not relevant. Instead of …

Expand, rerank, and retrieve: Query reranking for open-domain question answering

YS Chuang, W Fang, SW Li, W Yih, J Glass - arxiv preprint arxiv …, 2023 - arxiv.org
We propose EAR, a query Expansion And Reranking approach for improving passage
retrieval, with the application to open-domain question answering. EAR first applies a query …

Generative and pseudo-relevant feedback for sparse, dense and learned sparse retrieval

I Mackie, S Chatterjee, J Dalton - arxiv preprint arxiv:2305.07477, 2023 - arxiv.org
Pseudo-relevance feedback (PRF) is a classical approach to address lexical mismatch by
enriching the query using first-pass retrieval. Moreover, recent work on generative-relevance …

Re-Invoke: Tool invocation rewriting for zero-shot tool retrieval

Y Chen, J Yoon, DS Sachan, Q Wang… - arxiv preprint arxiv …, 2024 - arxiv.org
Recent advances in large language models (LLMs) have enabled autonomous agents with
complex reasoning and task-fulfillment capabilities using a wide range of tools. However …

Retrieval-Enhanced Machine Learning: Synthesis and Opportunities

F Diaz, A Drozdov, TE Kim, A Salemi… - Proceedings of the 2024 …, 2024 - dl.acm.org
Retrieval-enhanced machine learning (REML) refers to the use of information retrieval
methods to support reasoning and inference in machine learning tasks. Although relatively …

GRM: generative relevance modeling using relevance-aware sample estimation for document retrieval

I Mackie, I Sekulic, S Chatterjee, J Dalton… - arxiv preprint arxiv …, 2023 - arxiv.org
Recent studies show that Generative Relevance Feedback (GRF), using text generated by
Large Language Models (LLMs), can enhance the effectiveness of query expansion …

HyQE: Ranking Contexts with Hypothetical Query Embeddings

W Zhou, J Zhang, H Hasson, A Singh, W Li - arxiv preprint arxiv …, 2024 - arxiv.org
In retrieval-augmented systems, context ranking techniques are commonly employed to
reorder the retrieved contexts based on their relevance to a user query. A standard …

Doc2Token: Bridging Vocabulary Gap by Predicting Missing Tokens for E-commerce Search

K Li, J Lin, T Lee - arxiv preprint arxiv:2406.19647, 2024 - arxiv.org
Addressing the" vocabulary mismatch" issue in information retrieval is a central challenge for
e-commerce search engines, because product pages often miss important keywords that …