Statistical language models for information retrieval a critical review

CX Zhai - Foundations and Trends® in Information Retrieval, 2008 - nowpublishers.com
Statistical language models have recently been successfully applied to many information
retrieval problems. A great deal of recent work has shown that statistical language models …

A Systematic Review of Fairness, Accountability, Transparency, and Ethics in Information Retrieval

N Bernard, K Balog - ACM Computing Surveys, 2025 - dl.acm.org
We live in an information society that strongly relies on information retrieval systems, such as
search engines and conversational assistants. Consequently, the trustworthiness of these …

Dense text retrieval based on pretrained language models: A survey

WX Zhao, J Liu, R Ren, JR Wen - ACM Transactions on Information …, 2024 - dl.acm.org
Text retrieval is a long-standing research topic on information seeking, where a system is
required to return relevant information resources to user's queries in natural language. From …

[BOEK][B] Pretrained transformers for text ranking: Bert and beyond

J Lin, R Nogueira, A Yates - 2022 - books.google.com
The goal of text ranking is to generate an ordered list of texts retrieved from a corpus in
response to a query. Although the most common formulation of text ranking is search …

SPLADE: Sparse lexical and expansion model for first stage ranking

T Formal, B Piwowarski, S Clinchant - Proceedings of the 44th …, 2021 - dl.acm.org
In neural Information Retrieval, ongoing research is directed towards improving the first
retriever in ranking pipelines. Learning dense embeddings to conduct retrieval using …

A deep relevance matching model for ad-hoc retrieval

J Guo, Y Fan, Q Ai, WB Croft - Proceedings of the 25th ACM international …, 2016 - dl.acm.org
In recent years, deep neural networks have led to exciting breakthroughs in speech
recognition, computer vision, and natural language processing (NLP) tasks. However, there …

A survey of text clustering algorithms

CC Aggarwal, CX Zhai - Mining text data, 2012 - Springer
Clustering is a widely studied data mining problem in the text domains. The problem finds
numerous applications in customer segmentation, classification, collaborative filtering …

Deeprank: A new deep architecture for relevance ranking in information retrieval

L Pang, Y Lan, J Guo, J Xu, J Xu, X Cheng - Proceedings of the 2017 …, 2017 - dl.acm.org
This paper concerns a deep learning approach to relevance ranking in information retrieval
(IR). Existing deep IR models such as DSSM and CDSSM directly apply neural networks to …

[BOEK][B] Text data management and analysis: a practical introduction to information retrieval and text mining

CX Zhai, S Massung - 2016 - dl.acm.org
Recent years have seen a dramatic growth of natural language text data, including web
pages, news articles, scientific literature, emails, enterprise documents, and social media …

Improving bug localization using structured information retrieval

RK Saha, M Lease, S Khurshid… - 2013 28th IEEE/ACM …, 2013 - ieeexplore.ieee.org
Locating bugs is important, difficult, and expensive, particularly for large-scale systems. To
address this, natural language information retrieval techniques are increasingly being used …