User simulation for evaluating information access systems

K Balog, CX Zhai - Proceedings of the Annual International ACM SIGIR …, 2023 - dl.acm.org
With the emergence of various information access systems exhibiting increasing complexity,
there is a critical need for sound and scalable means of automatic evaluation. To address …

One-shot labeling for automatic relevance estimation

S MacAvaney, L Soldaini - Proceedings of the 46th International ACM …, 2023 - dl.acm.org
Dealing with unjudged documents (" holes") in relevance assessments is a perennial
problem when evaluating search systems with offline experiments. Holes can reduce the …

Research frontiers in information retrieval: Report from the third strategic workshop on information retrieval in lorne (swirl 2018)

JS Culpepper, F Diaz, MD Smucker - ACM SIGIR Forum, 2018 - dl.acm.org
The purpose of the Strategic Workshop in Information Retrieval in Lorne is to explore the
long-range issues of the Information Retrieval field, to recognize challenges that are on-or …

Evolution and impact of bias in human and machine learning algorithm interaction

W Sun, O Nasraoui, P Shafto - Plos one, 2020 - journals.plos.org
Traditionally, machine learning algorithms relied on reliable labels from experts to build
predictions. More recently however, algorithms have been receiving data from the general …

Can generative llms create query variants for test collections? an exploratory study

M Alaofi, L Gallagher, M Sanderson, F Scholer… - Proceedings of the 46th …, 2023 - dl.acm.org
This paper explores the utility of a Large Language Model (LLM) to automatically generate
queries and query variants from a description of an information need. Given a set of …

Offline evaluation options for recommender systems

R Cañamares, P Castells, A Moffat - Information Retrieval Journal, 2020 - Springer
We undertake a detailed examination of the steps that make up offline experiments for
recommender system evaluation, including the manner in which the available ratings are …

Offline recommender system evaluation: Challenges and new directions

P Castells, A Moffat - AI magazine, 2022 - ojs.aaai.org
Offline evaluation is an essential complement to online experiments in the selection,
improvement, tuning, and deployment of recommender systems. Offline methodologies for …

Measuring the utility of search engine result pages: an information foraging based measure

L Azzopardi, P Thomas, N Craswell - The 41st International ACM SIGIR …, 2018 - dl.acm.org
Web Search Engine Result Pages (SERPs) are complex responses to queries, containing
many heterogeneous result elements (web results, advertisements, and specialised" …

Evaluating generative ad hoc information retrieval

L Gienapp, H Scells, N Deckers, J Bevendorff… - Proceedings of the 47th …, 2024 - dl.acm.org
Recent advances in large language models have enabled the development of viable
generative retrieval systems. Instead of a traditional document ranking, generative retrieval …

Streamlining Evaluation with ir-measures

S MacAvaney, C Macdonald, I Ounis - European Conference on …, 2022 - Springer
We present ir-measures, a new tool that makes it convenient to calculate a diverse set of
evaluation measures used in information retrieval. Rather than implementing its own …