User simulation for evaluating information access systems
With the emergence of various information access systems exhibiting increasing complexity,
there is a critical need for sound and scalable means of automatic evaluation. To address …
there is a critical need for sound and scalable means of automatic evaluation. To address …
Fa* ir: A fair top-k ranking algorithm
In this work, we define and solve the Fair Top-k Ranking problem, in which we want to
determine a subset of k candidates from a large pool of n» k candidates, maximizing utility …
determine a subset of k candidates from a large pool of n» k candidates, maximizing utility …
Evaluating stochastic rankings with expected exposure
We introduce the concept of expected exposure as the average attention ranked items
receive from users over repeated samples of the same query. Furthermore, we advocate for …
receive from users over repeated samples of the same query. Furthermore, we advocate for …
Fair Top-k Ranking with multiple protected groups
Ranking items or people is a fundamental operation at the basis of several processes and
services, not all of them happening online. Ranking is required for different tasks, including …
services, not all of them happening online. Ranking is required for different tasks, including …
When does relevance mean usefulness and user satisfaction in web search?
Relevance is a fundamental concept in information retrieval (IR) studies. It is however often
observed that relevance as annotated by secondary assessors may not necessarily mean …
observed that relevance as annotated by secondary assessors may not necessarily mean …
Result Diversification in Search and Recommendation: A Survey
Diversifying return results is an important research topic in retrieval systems in order to
satisfy both the various interests of customers and the equal market exposure of providers …
satisfy both the various interests of customers and the equal market exposure of providers …
Towards unified metrics for accuracy and diversity for recommender systems
Recommender systems evaluation has evolved rapidly in recent years. However, for offline
evaluation, accuracy is the de facto standard for assessing the superiority of one method …
evaluation, accuracy is the de facto standard for assessing the superiority of one method …
Leveraging passage-level cumulative gain for document ranking
Document ranking is one of the most studied but challenging problems in information
retrieval (IR) research. A number of existing document ranking models capture relevance …
retrieval (IR) research. A number of existing document ranking models capture relevance …
Search result diversification based on hierarchical intents
A large percentage of queries issued to search engines are broad or ambiguous. Search
result diversification aims to solve this problem, by returning diverse results that can fulfill as …
result diversification aims to solve this problem, by returning diverse results that can fulfill as …
Which Diversity Evaluation Measures Are" Good"?
This study evaluates 30 IR evaluation measures or their instances, of which nine are for
adhoc IR and 21 are for diversified IR, primarily from the viewpoint of whether their …
adhoc IR and 21 are for diversified IR, primarily from the viewpoint of whether their …