Simplified data wrangling with ir_datasets
Managing the data for Information Retrieval (IR) experiments can be challenging. Dataset
documentation is scattered across the Internet and once one obtains a copy of the data …
documentation is scattered across the Internet and once one obtains a copy of the data …
Overview of the TREC 2009 Web Track.
The TRECWeb Track explores and evaluates Web retrieval technologies. Currently, the Web
Track conducts experiments using the new billion-page ClueWeb09 collection. The TREC …
Track conducts experiments using the new billion-page ClueWeb09 collection. The TREC …
A novel TF-IDF weighting scheme for effective ranking
JH Paik - Proceedings of the 36th international ACM SIGIR …, 2013 - dl.acm.org
Term weighting schemes are central to the study of information retrieval systems. This article
proposes a novel TF-IDF term weighting scheme that employs two different within document …
proposes a novel TF-IDF term weighting scheme that employs two different within document …
Large-scale validation and analysis of interleaved search evaluation
Interleaving is an increasingly popular technique for evaluating information retrieval systems
based on implicit user feedback. While a number of isolated studies have analyzed how this …
based on implicit user feedback. While a number of isolated studies have analyzed how this …
Differentiable unbiased online learning to rank
Online Learning to Rank (OLTR) methods optimize rankers based on user interactions. State-
of-the-art OLTR methods are built specifically for linear models. Their approaches do not …
of-the-art OLTR methods are built specifically for linear models. Their approaches do not …
[BOOK][B] Information retrieval evaluation
D Harman - 2011 - books.google.com
Evaluation has always played a major role in information retrieval, with the early pioneers
such as Cyril Cleverdon and Gerard Salton laying the foundations for most of the evaluation …
such as Cyril Cleverdon and Gerard Salton laying the foundations for most of the evaluation …
Learning concept importance using a weighted dependence model
Modeling query concepts through term dependencies has been shown to have a significant
positive effect on retrieval performance, especially for tasks such as web search, where …
positive effect on retrieval performance, especially for tasks such as web search, where …
Ambiguous queries: test collections need more sense
M Sanderson - Proceedings of the 31st annual international ACM …, 2008 - dl.acm.org
Although there are many papers examining ambiguity in Information Retrieval, this paper
shows that there is a whole class of ambiguous word that past research has barely explored …
shows that there is a whole class of ambiguous word that past research has barely explored …
Quality-biased ranking of web documents
Many existing retrieval approaches do not take into account the content quality of the
retrieved documents, although link-based measures such as PageRank are commonly used …
retrieved documents, although link-based measures such as PageRank are commonly used …
[HTML][HTML] An in-depth investigation on the behavior of measures to quantify reproducibility
Science is facing a so-called reproducibility crisis, where researchers struggle to repeat
experiments and to get the same or comparable results. This represents a fundamental …
experiments and to get the same or comparable results. This represents a fundamental …