Information retrieval: recent advances and beyond

KA Hambarde, H Proenca - IEEE Access, 2023 - ieeexplore.ieee.org
This paper provides an extensive and thorough overview of the models and techniques
utilized in the first and second stages of the typical information retrieval processing chain …

Dense text retrieval based on pretrained language models: A survey

WX Zhao, J Liu, R Ren, JR Wen - ACM Transactions on Information …, 2024 - dl.acm.org
Text retrieval is a long-standing research topic on information seeking, where a system is
required to return relevant information resources to user's queries in natural language. From …

Multi-view document representation learning for open-domain dense retrieval

S Zhang, Y Liang, M Gong, D Jiang, N Duan - arxiv preprint arxiv …, 2022 - arxiv.org
Dense retrieval has achieved impressive advances in first-stage retrieval from a large-scale
document collection, which is built on bi-encoder architecture to produce single vector …

Structure-Aware Language Model Pretraining Improves Dense Retrieval on Structured Data

X Li, Z Liu, C **ong, S Yu, Y Gu, Z Liu, G Yu - arxiv preprint arxiv …, 2023 - arxiv.org
This paper presents Structure Aware Dense Retrieval (SANTA) model, which encodes user
queries and structured data in one universal embedding space for retrieving structured data …

Data augmentation for sample efficient and robust document ranking

A Anand, J Leonhardt, J Singh, K Rudra… - ACM Transactions on …, 2024 - dl.acm.org
Contextual ranking models have delivered impressive performance improvements over
classical models in the document ranking task. However, these highly over-parameterized …

Beyond two-tower: Attribute guided representation learning for candidate retrieval

H Shan, Q Zhang, Z Liu, G Zhang, C Li - Proceedings of the ACM Web …, 2023 - dl.acm.org
Candidate retrieval is a key part of the modern search engines whose goal is to find
candidate items that are semantically related to the query from a large item pool. The core …

[PDF][PDF] Towards Robust Dense Retrieval via Local Ranking Alignment.

X Chen, J Luo, B He, Le Sun 0001, Y Sun - IJCAI, 2022 - ijcai.org
Dense retrieval (DR) has extended the employment of pre-trained language models, like
BERT, for text ranking. However, recent studies have raised the robustness issue of DR …

Simplifying content-based neural news recommendation: On user modeling and training objectives

A Iana, G Glavas, H Paulheim - … of the 46th International ACM SIGIR …, 2023 - dl.acm.org
The advent of personalized news recommendation has given rise to increasingly complex
recommender architectures. Most neural news recommenders rely on user click behavior …

A roadmap for big model

S Yuan, H Zhao, S Zhao, J Leng, Y Liang… - arxiv preprint arxiv …, 2022 - arxiv.org
With the rapid development of deep learning, training Big Models (BMs) for multiple
downstream tasks becomes a popular paradigm. Researchers have achieved various …

Learning Discrete Document Representations in Web Search

R Huang, D Zhang, W Lu, H Li, M Wang, D Shi… - Proceedings of the 29th …, 2023 - dl.acm.org
Product quantization (PQ) has been usually applied to dense retrieval (DR) of documents
thanks to its competitive time, memory efficiency and compatibility with other approximate …