Simplified data wrangling with ir_datasets

S MacAvaney, A Yates, S Feldman, D Downey… - Proceedings of the 44th …, 2021 - dl.acm.org
Managing the data for Information Retrieval (IR) experiments can be challenging. Dataset
documentation is scattered across the Internet and once one obtains a copy of the data …

Overview of the TREC 2009 Web Track.

CLA Clarke, N Craswell, I Soboroff - Trec, 2009 - apps.dtic.mil
The TRECWeb Track explores and evaluates Web retrieval technologies. Currently, the Web
Track conducts experiments using the new billion-page ClueWeb09 collection. The TREC …

A novel TF-IDF weighting scheme for effective ranking

JH Paik - Proceedings of the 36th international ACM SIGIR …, 2013 - dl.acm.org
Term weighting schemes are central to the study of information retrieval systems. This article
proposes a novel TF-IDF term weighting scheme that employs two different within document …

Large-scale validation and analysis of interleaved search evaluation

O Chapelle, T Joachims, F Radlinski… - ACM Transactions on …, 2012 - dl.acm.org
Interleaving is an increasingly popular technique for evaluating information retrieval systems
based on implicit user feedback. While a number of isolated studies have analyzed how this …

Differentiable unbiased online learning to rank

H Oosterhuis, M de Rijke - Proceedings of the 27th ACM international …, 2018 - dl.acm.org
Online Learning to Rank (OLTR) methods optimize rankers based on user interactions. State-
of-the-art OLTR methods are built specifically for linear models. Their approaches do not …

[BOOK][B] Information retrieval evaluation

D Harman - 2011 - books.google.com
Evaluation has always played a major role in information retrieval, with the early pioneers
such as Cyril Cleverdon and Gerard Salton laying the foundations for most of the evaluation …

Learning concept importance using a weighted dependence model

M Bendersky, D Metzler, WB Croft - … conference on Web search and data …, 2010 - dl.acm.org
Modeling query concepts through term dependencies has been shown to have a significant
positive effect on retrieval performance, especially for tasks such as web search, where …

Ambiguous queries: test collections need more sense

M Sanderson - Proceedings of the 31st annual international ACM …, 2008 - dl.acm.org
Although there are many papers examining ambiguity in Information Retrieval, this paper
shows that there is a whole class of ambiguous word that past research has barely explored …

Quality-biased ranking of web documents

M Bendersky, WB Croft, Y Diao - … conference on Web search and data …, 2011 - dl.acm.org
Many existing retrieval approaches do not take into account the content quality of the
retrieved documents, although link-based measures such as PageRank are commonly used …

[HTML][HTML] An in-depth investigation on the behavior of measures to quantify reproducibility

M Maistro, T Breuer, P Schaer, N Ferro - Information Processing & …, 2023 - Elsevier
Science is facing a so-called reproducibility crisis, where researchers struggle to repeat
experiments and to get the same or comparable results. This represents a fundamental …