Information retrieval: recent advances and beyond
This paper provides an extensive and thorough overview of the models and techniques
utilized in the first and second stages of the typical information retrieval processing chain …
utilized in the first and second stages of the typical information retrieval processing chain …
Dense text retrieval based on pretrained language models: A survey
Text retrieval is a long-standing research topic on information seeking, where a system is
required to return relevant information resources to user's queries in natural language. From …
required to return relevant information resources to user's queries in natural language. From …
Multi-view document representation learning for open-domain dense retrieval
Dense retrieval has achieved impressive advances in first-stage retrieval from a large-scale
document collection, which is built on bi-encoder architecture to produce single vector …
document collection, which is built on bi-encoder architecture to produce single vector …
Structure-Aware Language Model Pretraining Improves Dense Retrieval on Structured Data
This paper presents Structure Aware Dense Retrieval (SANTA) model, which encodes user
queries and structured data in one universal embedding space for retrieving structured data …
queries and structured data in one universal embedding space for retrieving structured data …
Data augmentation for sample efficient and robust document ranking
Contextual ranking models have delivered impressive performance improvements over
classical models in the document ranking task. However, these highly over-parameterized …
classical models in the document ranking task. However, these highly over-parameterized …
Beyond two-tower: Attribute guided representation learning for candidate retrieval
Candidate retrieval is a key part of the modern search engines whose goal is to find
candidate items that are semantically related to the query from a large item pool. The core …
candidate items that are semantically related to the query from a large item pool. The core …
[PDF][PDF] Towards Robust Dense Retrieval via Local Ranking Alignment.
Dense retrieval (DR) has extended the employment of pre-trained language models, like
BERT, for text ranking. However, recent studies have raised the robustness issue of DR …
BERT, for text ranking. However, recent studies have raised the robustness issue of DR …
Simplifying content-based neural news recommendation: On user modeling and training objectives
The advent of personalized news recommendation has given rise to increasingly complex
recommender architectures. Most neural news recommenders rely on user click behavior …
recommender architectures. Most neural news recommenders rely on user click behavior …
A roadmap for big model
With the rapid development of deep learning, training Big Models (BMs) for multiple
downstream tasks becomes a popular paradigm. Researchers have achieved various …
downstream tasks becomes a popular paradigm. Researchers have achieved various …
Learning Discrete Document Representations in Web Search
R Huang, D Zhang, W Lu, H Li, M Wang, D Shi… - Proceedings of the 29th …, 2023 - dl.acm.org
Product quantization (PQ) has been usually applied to dense retrieval (DR) of documents
thanks to its competitive time, memory efficiency and compatibility with other approximate …
thanks to its competitive time, memory efficiency and compatibility with other approximate …