- Academic Search

L Wang, N Yang, X Huang, B Jiao, L Yang… - arxiv preprint arxiv …, 2022 - arxiv.org

This paper presents E5, a family of state-of-the-art text embeddings that transfer well to a
wide range of tasks. The model is trained in a contrastive manner with weak supervision …

Save Cite Cited by 408 Related articles All 2 versions Free GPT-4 View as HTML

[Free GPT-4]

[PDF] arxiv.org

Colbertv2: Effective and efficient retrieval via lightweight late interaction

K Santhanam, O Khattab, J Saad-Falcon… - arxiv preprint arxiv …, 2021 - arxiv.org

Neural information retrieval (IR) has greatly advanced search and other knowledge-
intensive language tasks. While many neural IR methods encode queries and documents …

Save Cite Cited by 396 Related articles All 5 versions Free GPT-4 View as HTML

[Free GPT-4]

[PDF] arxiv.org

Unsupervised corpus aware language model pre-training for dense passage retrieval

L Gao, J Callan - arxiv preprint arxiv:2108.05540, 2021 - arxiv.org

Recent research demonstrates the effectiveness of using fine-tuned language models~(LM)
for dense retrieval. However, dense retrievers are hard to train, typically requiring heavily …

Save Cite Cited by 333 Related articles All 7 versions Free GPT-4 View as HTML

[Free GPT-4]

[PDF] arxiv.org

Promptagator: Few-shot dense retrieval from 8 examples

Z Dai, VY Zhao, J Ma, Y Luan, J Ni, J Lu… - arxiv preprint arxiv …, 2022 - arxiv.org

Much recent research on information retrieval has focused on how to transfer from one task
(typically with abundant supervised data) to various other tasks where supervision is limited …

Save Cite Cited by 216 Related articles All 4 versions Free GPT-4 View as HTML

[Free GPT-4]

[PDF] neurips.cc

Autoregressive search engines: Generating substrings as document identifiers

M Bevilacqua, G Ottaviano, P Lewis… - Advances in …, 2022 - proceedings.neurips.cc

Abstract Knowledge-intensive language tasks require NLP systems to both provide the
correct answer and retrieve supporting evidence for it in a given corpus. Autoregressive …

Save Cite Cited by 163 Related articles All 8 versions Free GPT-4 View as HTML

[Free GPT-4]

[PDF] arxiv.org

Dense text retrieval based on pretrained language models: A survey

WX Zhao, J Liu, R Ren, JR Wen - ACM Transactions on Information …, 2024 - dl.acm.org

Text retrieval is a long-standing research topic on information seeking, where a system is
required to return relevant information resources to user's queries in natural language. From …

Save Cite Cited by 189 Related articles All 4 versions Free GPT-4

[Free GPT-4]

[PDF] arxiv.org

GPL: Generative pseudo labeling for unsupervised domain adaptation of dense retrieval

K Wang, N Thakur, N Reimers, I Gurevych - arxiv preprint arxiv …, 2021 - arxiv.org

Dense retrieval approaches can overcome the lexical gap and lead to significantly improved
search results. However, they require large amounts of training data which is not available …

Save Cite Cited by 161 Related articles All 4 versions Free GPT-4 View as HTML

[Free GPT-4]

[PDF] arxiv.org

Adversarial retriever-ranker for dense text retrieval

H Zhang, Y Gong, Y Shen, J Lv, N Duan… - arxiv preprint arxiv …, 2021 - arxiv.org

Current dense text retrieval models face two typical challenges. First, they adopt a siamese
dual-encoder architecture to encode queries and documents independently for fast indexing …

Save Cite Cited by 117 Related articles All 3 versions Free GPT-4 View as HTML

[Free GPT-4]

[PDF] arxiv.org

Dense x retrieval: What retrieval granularity should we use?

T Chen, H Wang, S Chen, W Yu, K Ma, X Zhao… - arxiv preprint arxiv …, 2023 - arxiv.org

Dense retrieval has become a prominent method to obtain relevant context or world
knowledge in open-domain NLP tasks. When we use a learned dense retriever on a …

Save Cite Cited by 43 Related articles All 2 versions Free GPT-4 View as HTML

[Free GPT-4]

[PDF] arxiv.org

Simlm: Pre-training with representation bottleneck for dense passage retrieval

L Wang, N Yang, X Huang, B Jiao, L Yang… - arxiv preprint arxiv …, 2022 - arxiv.org

In this paper, we propose SimLM (Similarity matching with Language Model pre-training), a
simple yet effective pre-training method for dense passage retrieval. It employs a simple …

Save Cite Cited by 98 Related articles All 6 versions Free GPT-4 View as HTML

Create alert

Cite

Advanced search

Saved to My library

Domain-matched pre-training tasks for dense retrieval

Text embeddings by weakly-supervised contrastive pre-training

Colbertv2: Effective and efficient retrieval via lightweight late interaction

Unsupervised corpus aware language model pre-training for dense passage retrieval

Promptagator: Few-shot dense retrieval from 8 examples

Autoregressive search engines: Generating substrings as document identifiers

Dense text retrieval based on pretrained language models: A survey

GPL: Generative pseudo labeling for unsupervised domain adaptation of dense retrieval

Adversarial retriever-ranker for dense text retrieval

Dense x retrieval: What retrieval granularity should we use?

Simlm: Pre-training with representation bottleneck for dense passage retrieval