Pyserini: A Python toolkit for reproducible information retrieval research with sparse and dense representations

J Lin, X Ma, SC Lin, JH Yang, R Pradeep… - Proceedings of the 44th …, 2021 - dl.acm.org
Pyserini is a Python toolkit for reproducible information retrieval research with sparse and
dense representations. It aims to provide effective, reproducible, and easy-to-use first-stage …

Conversational information seeking

H Zamani, JR Trippas, J Dalton… - … and Trends® in …, 2023 - nowpublishers.com
Conversational information seeking (CIS) is concerned with a sequence of interactions
between one or more users and an information system. Interactions in CIS are primarily …

How does generative retrieval scale to millions of passages?

R Pradeep, K Hui, J Gupta, AD Lelkes… - arxiv preprint arxiv …, 2023 - arxiv.org
Popularized by the Differentiable Search Index, the emerging paradigm of generative
retrieval re-frames the classic information retrieval problem into a sequence-to-sequence …

RankZephyr: Effective and Robust Zero-Shot Listwise Reranking is a Breeze!

R Pradeep, S Sharifymoghaddam, J Lin - arxiv preprint arxiv:2312.02724, 2023 - arxiv.org
In information retrieval, proprietary large language models (LLMs) such as GPT-4 and open-
source counterparts such as LLaMA and Vicuna have played a vital role in reranking …

Exploring listwise evidence reasoning with t5 for fact verification

K Jiang, R Pradeep, J Lin - … of the 59th Annual Meeting of the …, 2021 - aclanthology.org
This work explores a framework for fact verification that leverages pretrained sequence-to-
sequence transformer models for sentence selection and label prediction, two key sub-tasks …

Squeezing water from a stone: a bag of tricks for further improving cross-encoder effectiveness for reranking

R Pradeep, Y Liu, X Zhang, Y Li, A Yates… - European Conference on …, 2022 - Springer
While much recent work has demonstrated that hard negative mining can be used to train
better bi-encoder models, few have considered it in the context of cross-encoders, which are …

[PDF][PDF] Overview of the TREC 2020 Health Misinformation Track.

CLA Clarke, M Maistro, MD Smucker, G Zuccon - TREC, 2020 - trec.nist.gov
TREC 2021 was the third year for the Health Misinformation track, which was named the
Decision Track in 2019 [1]. In 2021, the track had an ad-hoc retrieval task. In each year, the …

Everything we hear: Towards tackling misinformation in podcasts

S Pathiyan Cherumanal, U Gadiraju… - Proceedings of the 26th …, 2024 - dl.acm.org
Advances in generative AI, the proliferation of large multimodal models (LMMs), and
democratized open access to these technologies have direct implications for the production …

Read it twice: Towards faithfully interpretable fact verification by revisiting evidence

X Hu, Z Hong, Z Guo, L Wen, P Yu - … of the 46th International ACM SIGIR …, 2023 - dl.acm.org
Real-world fact verification task aims to verify the factuality of a claim by retrieving evidence
from the source document. The quality of the retrieved evidence plays an important role in …

Neural query synthesis and domain-specific ranking templates for multi-stage clinical trial matching

R Pradeep, Y Li, Y Wang, J Lin - … of the 45th International ACM SIGIR …, 2022 - dl.acm.org
In this work, we propose an effective multi-stage neural ranking system for the clinical trial
matching problem. First, we introduce NQS, a neural query synthesis method that leverages …