One-shot labeling for automatic relevance estimation

S MacAvaney, L Soldaini - Proceedings of the 46th International ACM …, 2023 - dl.acm.org
Dealing with unjudged documents (" holes") in relevance assessments is a perennial
problem when evaluating search systems with offline experiments. Holes can reduce the …

Retrieval evaluation with incomplete information

C Buckley, EM Voorhees - Proceedings of the 27th annual international …, 2004 - dl.acm.org
This paper examines whether the Cranfield evaluation methodology is robust to gross
violations of the completeness assumption (ie, the assumption that all relevant documents …

Test collection based evaluation of information retrieval systems

M Sanderson - Foundations and Trends® in Information …, 2010 - nowpublishers.com
Use of test collections and evaluation measures to assess the effectiveness of information
retrieval systems has its origins in work dating back to the early 1950s. Across the nearly 60 …

How does clickthrough data reflect retrieval quality?

F Radlinski, M Kurup, T Joachims - … of the 17th ACM conference on …, 2008 - dl.acm.org
Automatically judging the quality of retrieval functions based on observable user behavior
holds promise for making retrieval evaluation faster, cheaper, and more user centered …

Automatically assessing machine summary content without a gold standard

A Louis, A Nenkova - Computational Linguistics, 2013 - direct.mit.edu
The most widely adopted approaches for evaluation of summary content follow some
protocol for comparing a summary with gold-standard human summaries, which are …

[КНИГА][B] An introduction to search engines and web navigation

M Levene - 2011 - books.google.com
This book is a second edition, updated and expanded to explain the technologies that help
us find information on the web. Search engines and web navigation tools have become …

Increasing cheat robustness of crowdsourcing tasks

C Eickhoff, AP de Vries - Information retrieval, 2013 - Springer
Crowdsourcing successfully strives to become a widely used means of collecting large-scale
scientific corpora. Many research fields, including Information Retrieval, rely on this novel …

A new rank correlation coefficient for information retrieval

E Yilmaz, JA Aslam, S Robertson - … of the 31st annual international ACM …, 2008 - dl.acm.org
In the field of information retrieval, one is often faced with the problem of computing the
correlation between two ranked lists. The most commonly used statistic that quantifies this …

Estimating average precision with incomplete and imperfect judgments

E Yilmaz, JA Aslam - Proceedings of the 15th ACM international …, 2006 - dl.acm.org
We consider the problem of evaluating retrieval systems using incomplete judgment
information. Buckley and Voorhees recently demonstrated that retrieval systems can be …

Query performance prediction using relevance judgments generated by large language models

C Meng, N Arabzadeh, A Askari, M Aliannejadi… - arxiv preprint arxiv …, 2024 - arxiv.org
Query performance prediction (QPP) aims to estimate the retrieval quality of a search system
for a query without human relevance judgments. Previous QPP methods typically return a …