- Academic Search

K Balog, CX Zhai - Proceedings of the Annual International ACM SIGIR …, 2023 - dl.acm.org

With the emergence of various information access systems exhibiting increasing complexity,
there is a critical need for sound and scalable means of automatic evaluation. To address …

Uložit Citovat Počet citací tohoto článku: 32 Související články Všechny verze (počet: 6) Hledat knihovnu

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

One-shot labeling for automatic relevance estimation

S MacAvaney, L Soldaini - Proceedings of the 46th International ACM …, 2023 - dl.acm.org

Dealing with unjudged documents (" holes") in relevance assessments is a perennial
problem when evaluating search systems with offline experiments. Holes can reduce the …

Uložit Citovat Počet citací tohoto článku: 45 Související články Všechny verze (počet: 4)

[Free GPT-4]
[DeepSeek]

[PDF] ru.nl

Research frontiers in information retrieval: Report from the third strategic workshop on information retrieval in lorne (swirl 2018)

JS Culpepper, F Diaz, MD Smucker - ACM SIGIR Forum, 2018 - dl.acm.org

The purpose of the Strategic Workshop in Information Retrieval in Lorne is to explore the
long-range issues of the Information Retrieval field, to recognize challenges that are on-or …

Uložit Citovat Počet citací tohoto článku: 216 Související články Všechny verze (počet: 19) Hledat knihovnu

[Free GPT-4]
[DeepSeek]

[PDF] plos.org

Evolution and impact of bias in human and machine learning algorithm interaction

W Sun, O Nasraoui, P Shafto - Plos one, 2020 - journals.plos.org

Traditionally, machine learning algorithms relied on reliable labels from experts to build
predictions. More recently however, algorithms have been receiving data from the general …

Uložit Citovat Počet citací tohoto článku: 126 Související články Všechny verze (počet: 12) Archiv

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Can generative llms create query variants for test collections? an exploratory study

M Alaofi, L Gallagher, M Sanderson, F Scholer… - Proceedings of the 46th …, 2023 - dl.acm.org

This paper explores the utility of a Large Language Model (LLM) to automatically generate
queries and query variants from a description of an information need. Given a set of …

Uložit Citovat Počet citací tohoto článku: 41 Související články Všechny verze (počet: 8)

[Free GPT-4]
[DeepSeek]

[PDF] springer.com

Offline evaluation options for recommender systems

R Cañamares, P Castells, A Moffat - Information Retrieval Journal, 2020 - Springer

We undertake a detailed examination of the steps that make up offline experiments for
recommender system evaluation, including the manner in which the available ratings are …

Uložit Citovat Počet citací tohoto článku: 87 Související články Všechny verze (počet: 8)

[Free GPT-4]
[DeepSeek]

[PDF] aaai.org Full View

Offline recommender system evaluation: Challenges and new directions

P Castells, A Moffat - AI magazine, 2022 - ojs.aaai.org

Offline evaluation is an essential complement to online experiments in the selection,
improvement, tuning, and deployment of recommender systems. Offline methodologies for …

Uložit Citovat Počet citací tohoto článku: 38 Související články Všechny verze (počet: 9) Full View Zobrazit jako HTML

[Free GPT-4]
[DeepSeek]

[PDF] strath.ac.uk

Measuring the utility of search engine result pages: an information foraging based measure

L Azzopardi, P Thomas, N Craswell - The 41st International ACM SIGIR …, 2018 - dl.acm.org

Web Search Engine Result Pages (SERPs) are complex responses to queries, containing
many heterogeneous result elements (web results, advertisements, and specialised" …

Uložit Citovat Počet citací tohoto článku: 101 Související články Všechny verze (počet: 8)

[Free GPT-4]
[DeepSeek]

[PDF] acm.org

Evaluating generative ad hoc information retrieval

L Gienapp, H Scells, N Deckers, J Bevendorff… - Proceedings of the 47th …, 2024 - dl.acm.org

Recent advances in large language models have enabled the development of viable
generative retrieval systems. Instead of a traditional document ranking, generative retrieval …

Uložit Citovat Počet citací tohoto článku: 14 Související články Všechny verze (počet: 8)

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Streamlining Evaluation with `ir-measures`

S MacAvaney, C Macdonald, I Ounis - European Conference on …, 2022 - Springer

We present ir-measures, a new tool that makes it convenient to calculate a diverse set of
evaluation measures used in information retrieval. Rather than implementing its own …

Uložit Citovat Počet citací tohoto článku: 41 Související články Všechny verze (počet: 7)

Vytvořit upozornění

Citovat

Rozšířené vyhledávání

Uloženo do Mojí knihovny

Incorporating user expectations and behavior into the measurement of search effectiveness

User simulation for evaluating information access systems

One-shot labeling for automatic relevance estimation

Research frontiers in information retrieval: Report from the third strategic workshop on information retrieval in lorne (swirl 2018)

Evolution and impact of bias in human and machine learning algorithm interaction

Can generative llms create query variants for test collections? an exploratory study

Offline evaluation options for recommender systems

Offline recommender system evaluation: Challenges and new directions

Measuring the utility of search engine result pages: an information foraging based measure

Evaluating generative ad hoc information retrieval

Streamlining Evaluation with `ir-measures`