Information retrieval: recent advances and beyond

KA Hambarde, H Proenca - IEEE Access, 2023 - ieeexplore.ieee.org
This paper provides an extensive and thorough overview of the models and techniques
utilized in the first and second stages of the typical information retrieval processing chain …

[HTML][HTML] Measurement of text similarity: a survey

J Wang, Y Dong - Information, 2020 - mdpi.com
Text similarity measurement is the basis of natural language processing tasks, which play an
important role in information retrieval, automatic question answering, machine translation …

Improving language models by retrieving from trillions of tokens

S Borgeaud, A Mensch, J Hoffmann… - International …, 2022 - proceedings.mlr.press
We enhance auto-regressive language models by conditioning on document chunks
retrieved from a large corpus, based on local similarity with preceding tokens. With a 2 …

[BOG][B] Pretrained transformers for text ranking: Bert and beyond

J Lin, R Nogueira, A Yates - 2022 - books.google.com
The goal of text ranking is to generate an ordered list of texts retrieved from a corpus in
response to a query. Although the most common formulation of text ranking is search …

[BOG][B] Machine learning for text: An introduction

CC Aggarwal, CC Aggarwal - 2018 - Springer
The extraction of useful insights from text with various types of statistical algorithms is
referred to as text mining, text analytics, or machine learning from text. The choice of …

Semantic models for the first-stage retrieval: A comprehensive review

J Guo, Y Cai, Y Fan, F Sun, R Zhang… - ACM Transactions on …, 2022 - dl.acm.org
Multi-stage ranking pipelines have been a practical solution in modern search systems,
where the first-stage retrieval is to return a subset of candidate documents and latter stages …

[BOG][B] Lifelong machine learning

Z Chen, B Liu - 2018 - books.google.com
Lifelong Machine Learning, Second Edition is an introduction to an advanced machine
learning paradigm that continuously learns by accumulating past knowledge that it then …

A few brief notes on deepimpact, coil, and a conceptual framework for information retrieval techniques

J Lin, X Ma - arxiv preprint arxiv:2106.14807, 2021 - arxiv.org
Recent developments in representational learning for information retrieval can be organized
in a conceptual framework that establishes two pairs of contrasts: sparse vs. dense …

Explicit semantic ranking for academic search via knowledge graph embedding

C **ong, R Power, J Callan - … of the 26th international conference on …, 2017 - dl.acm.org
This paper introduces Explicit Semantic Ranking (ESR), a new ranking technique that
leverages knowledge graph embedding. Analysis of the query log from our academic search …

Learning to match using local and distributed representations of text for web search

B Mitra, F Diaz, N Craswell - … of the 26th international conference on …, 2017 - dl.acm.org
Models such as latent semantic analysis and those based on neural embeddings learn
distributed representations of text, and match the query against the document in the latent …