- Academic Search

C Li, A Yates, S MacAvaney, B He, Y Sun - ACM Transactions on …, 2023 - dl.acm.org

Pre-trained transformer models, such as BERT and T5, have shown to be highly effective at
ad hoc passage and document ranking. Due to the inherent sequence length limits of these …

Save Cite Cited by 175 Related articles All 6 versions Free GPT-4

[Free GPT-4]

[PDF] arxiv.org

Inpars: Data augmentation for information retrieval using large language models

L Bonifacio, H Abonizio, M Fadaee… - arxiv preprint arxiv …, 2022 - arxiv.org

The information retrieval community has recently witnessed a revolution due to large
pretrained transformer models. Another key ingredient for this revolution was the MS …

Save Cite Cited by 104 Related articles All 2 versions Free GPT-4 View as HTML

[Free GPT-4]

[PDF] arxiv.org

mmarco: A multilingual version of the ms marco passage ranking dataset

L Bonifacio, V Jeronymo, HQ Abonizio… - arxiv preprint arxiv …, 2021 - arxiv.org

The MS MARCO ranking dataset has been widely used for training deep learning models for
IR tasks, achieving considerable effectiveness on diverse zero-shot scenarios. However, this …

Save Cite Cited by 106 Related articles All 4 versions Free GPT-4 View as HTML

[Free GPT-4]

[PDF] aclanthology.org

Exploring listwise evidence reasoning with t5 for fact verification

K Jiang, R Pradeep, J Lin - … of the 59th Annual Meeting of the …, 2021 - aclanthology.org

This work explores a framework for fact verification that leverages pretrained sequence-to-
sequence transformer models for sentence selection and label prediction, two key sub-tasks …

Save Cite Cited by 59 Related articles All 3 versions Free GPT-4 View as HTML

[Free GPT-4]

[PDF] uwaterloo.ca

Squeezing water from a stone: a bag of tricks for further improving cross-encoder effectiveness for reranking

R Pradeep, Y Liu, X Zhang, Y Li, A Yates… - European Conference on …, 2022 - Springer

While much recent work has demonstrated that hard negative mining can be used to train
better bi-encoder models, few have considered it in the context of cross-encoders, which are …

Save Cite Cited by 43 Related articles All 5 versions Free GPT-4

[Free GPT-4]

[PDF] researchgate.net

[PDF][PDF] No parameter left behind: How distillation and model size affect zero-shot retrieval

GM Rosa, L Bonifacio, V Jeronymo… - arxiv preprint arxiv …, 2022 - researchgate.net

Recent work has shown that small distilled language models are strong competitors to
models that are orders of magnitude larger and slower in a wide range of information …

Save Cite Cited by 32 Related articles All 2 versions Free GPT-4 View as HTML

[Free GPT-4]

[PDF] arxiv.org

In defense of cross-encoders for zero-shot retrieval

G Rosa, L Bonifacio, V Jeronymo, H Abonizio… - arxiv preprint arxiv …, 2022 - arxiv.org

Bi-encoders and cross-encoders are widely used in many state-of-the-art retrieval pipelines.
In this work we study the generalization ability of these two types of architectures on a wide …

Save Cite Cited by 27 Related articles All 3 versions Free GPT-4 View as HTML

[Free GPT-4]

[PDF] uwaterloo.ca

Neural query synthesis and domain-specific ranking templates for multi-stage clinical trial matching

R Pradeep, Y Li, Y Wang, J Lin - … of the 45th International ACM SIGIR …, 2022 - dl.acm.org

In this work, we propose an effective multi-stage neural ranking system for the clinical trial
matching problem. First, we introduce NQS, a neural query synthesis method that leverages …

Save Cite Cited by 23 Related articles All 4 versions Free GPT-4

[Free GPT-4]

[PDF] acm.org

Document expansion baselines and learned sparse lexical representations for ms marco v1 and v2

X Ma, R Pradeep, R Nogueira, J Lin - Proceedings of the 45th …, 2022 - dl.acm.org

With doc2query, we train a neural sequence-to-sequence model that, given an input span of
text, predicts a natural language query that the text might answer. These predictions can be …

Save Cite Cited by 28 Related articles All 3 versions Free GPT-4

[Free GPT-4]

[PDF] arxiv.org

Billions of parameters are worth more than in-domain training data: A case study in the legal case entailment task

GM Rosa, L Bonifacio, V Jeronymo, H Abonizio… - arxiv preprint arxiv …, 2022 - arxiv.org

Recent work has shown that language models scaled to billions of parameters, such as GPT-
3, perform remarkably well in zero-shot and few-shot scenarios. In this work, we experiment …

Save Cite Cited by 15 Related articles View as HTML

Create alert

Cite

Advanced search

Saved to My library

H2oloo at trec 2020: When all you got is a hammer... deep learning, health misinformation,...

PARADE: Passage Representation Aggregation forDocument Reranking

Inpars: Data augmentation for information retrieval using large language models

mmarco: A multilingual version of the ms marco passage ranking dataset

Exploring listwise evidence reasoning with t5 for fact verification

Squeezing water from a stone: a bag of tricks for further improving cross-encoder effectiveness for reranking

[PDF][PDF] No parameter left behind: How distillation and model size affect zero-shot retrieval

In defense of cross-encoders for zero-shot retrieval

Neural query synthesis and domain-specific ranking templates for multi-stage clinical trial matching

Document expansion baselines and learned sparse lexical representations for ms marco v1 and v2

Billions of parameters are worth more than in-domain training data: A case study in the legal case entailment task