CC-News-En: A large English news corpus

J Mackenzie, R Benham, M Petri, JR Trippas… - Proceedings of the 29th …, 2020 - dl.acm.org
We describe a static, open-access news corpus using data from the Common Crawl
Foundation, who provide free, publicly available web archives, including a continuous crawl …

Evaluating the robustness of retrieval pipelines with query variation generators

G Penha, A Câmara, C Hauff - European conference on information …, 2022 - Springer
Heavily pre-trained transformers for language modeling, such as BERT, have shown to be
remarkably effective for Information Retrieval (IR) tasks, typically applied to re-rank the …

Multi-stage conversational passage retrieval: An approach to fusing term importance estimation and neural query rewriting

SC Lin, JH Yang, R Nogueira, MF Tsai… - ACM Transactions on …, 2021 - dl.acm.org
Conversational search plays a vital role in conversational information seeking. As queries in
information seeking dialogues are ambiguous for traditional ad hoc information retrieval (IR) …

A study of a gain based approach for query aspects in recall oriented tasks

GM Di Nunzio, G Faggioli - Applied Sciences, 2021 - mdpi.com
Evidence-based healthcare integrates the best research evidence with clinical expertise in
order to make decisions based on the best practices available. In this context, the task of …

An enhanced evaluation framework for query performance prediction

G Faggioli, O Zendel, JS Culpepper, N Ferro… - … on Information Retrieval, 2021 - Springer
Abstract Query Performance Prediction (QPP) has been studied extensively in the IR
community over the last two decades. A by-product of this research is a methodology to …

Rank-in-rank loss for person re-identification

X Xu, X Yuan, Z Wang, K Zhang, R Hu - ACM Transactions on Multimedia …, 2022 - dl.acm.org
Person re-identification (re-ID) is commonly investigated as a ranking problem. However, the
performance of existing re-ID models drops dramatically, when they encounter extreme …

Where Do Queries Come From?

M Alaofi, L Gallagher, D Mckay, LL Saling… - Proceedings of the 45th …, 2022 - dl.acm.org
Where do queries--the words searchers type into a search box--come from? The Information
Retrieval community understands the performance of queries and search engines …

sMARE: a new paradigm to evaluate and understand query performance prediction methods

G Faggioli, O Zendel, JS Culpepper, N Ferro… - Information Retrieval …, 2022 - Springer
Query performance prediction (QPP) has been studied extensively in the IR community over
the last two decades. A by-product of this research is a methodology to evaluate the …

Validating simulations of user query variants

T Breuer, N Fuhr, P Schaer - European Conference on Information …, 2022 - Springer
Abstract System-oriented IR evaluations are limited to rather abstract understandings of real
user behavior. As a solution, simulating user interactions provides a cost-efficient way to …

Offline pseudo relevance feedback for efficient and effective single-pass dense retrieval

X Wen, X Chen, X Chen, B He, L Sun - Proceedings of the 46th …, 2023 - dl.acm.org
Dense retrieval has made significant advancements in information retrieval (IR) by achieving
high levels of effectiveness while maintaining online efficiency during a single-pass retrieval …