Semantic models for the first-stage retrieval: A comprehensive review
Multi-stage ranking pipelines have been a practical solution in modern search systems,
where the first-stage retrieval is to return a subset of candidate documents and latter stages …
where the first-stage retrieval is to return a subset of candidate documents and latter stages …
How to approach ambiguous queries in conversational search: A survey of techniques, approaches, tools, and challenges
K Keyvan, JX Huang - ACM Computing Surveys, 2022 - dl.acm.org
The advent of recent Natural Language Processing technology has led human and machine
interactions more toward conversation. In Conversational Search Systems (CSS) like …
interactions more toward conversation. In Conversational Search Systems (CSS) like …
Asking clarifying questions in open-domain information-seeking conversations
Users often fail to formulate their complex information needs in a single query. As a
consequence, they may need to scan multiple result pages or reformulate their queries …
consequence, they may need to scan multiple result pages or reformulate their queries …
A hierarchical recurrent encoder-decoder for generative context-aware query suggestion
Users may strive to formulate an adequate textual query for their information need. Search
engines assist the users by presenting query suggestions. To preserve the original search …
engines assist the users by presenting query suggestions. To preserve the original search …
Accurate and effective latent concept modeling for ad hoc information retrieval
A keyword query is the representation of the information need of a user, and is the result of a
complex cognitive process which often results in under-specification. We propose an …
complex cognitive process which often results in under-specification. We propose an …
Building and evaluating open-domain dialogue corpora with clarifying questions
Enabling open-domain dialogue systems to ask clarifying questions when appropriate is an
important direction for improving the quality of the system response. Namely, for cases when …
important direction for improving the quality of the system response. Namely, for cases when …
Simplified data wrangling with ir_datasets
Managing the data for Information Retrieval (IR) experiments can be challenging. Dataset
documentation is scattered across the Internet and once one obtains a copy of the data …
documentation is scattered across the Internet and once one obtains a copy of the data …
The information retrieval experiment platform
We integrate irdatasets, ir_measures, and PyTerrier with TIRA in the Information Retrieval
Experiment Platform (TIREx) to promote more standardized, reproducible, scalable, and …
Experiment Platform (TIREx) to promote more standardized, reproducible, scalable, and …
Efficient and effective spam filtering and re-ranking for large web datasets
The TREC 2009 web ad hoc and relevance feedback tasks used a new document collection,
the ClueWeb09 dataset, which was crawled from the general web in early 2009. This …
the ClueWeb09 dataset, which was crawled from the general web in early 2009. This …
Exploiting simulated user feedback for conversational search: Ranking, rewriting, and beyond
This research aims to explore various methods for assessing user feedback in mixed-
initiative conversational search (CS) systems. While CS systems enjoy profuse …
initiative conversational search (CS) systems. While CS systems enjoy profuse …