From matching to generation: A survey on generative information retrieval

X Li, J **, Y Zhou, Y Zhang, P Zhang, Y Zhu… - arxiv preprint arxiv …, 2024 - arxiv.org
Information Retrieval (IR) systems are crucial tools for users to access information, widely
applied in scenarios like search engines, question answering, and recommendation …

Corpuslm: Towards a unified language model on corpus for knowledge-intensive tasks

X Li, Z Dou, Y Zhou, F Liu - Proceedings of the 47th International ACM …, 2024 - dl.acm.org
Large language models (LLMs) have gained significant attention in various fields but prone
to hallucination, especially in knowledge-intensive (KI) tasks. To address this, retrieval …

Pqcache: Product quantization-based kvcache for long context llm inference

H Zhang, X Ji, Y Chen, F Fu, X Miao, X Nie… - arxiv preprint arxiv …, 2024 - arxiv.org
As the field of Large Language Models (LLMs) continues to evolve, the context length in
inference is steadily growing. Key-Value Cache (KVCache), a crucial component in LLM …

A survey of generative search and recommendation in the era of large language models

Y Li, X Lin, W Wang, F Feng, L Pang, W Li, L Nie… - arxiv preprint arxiv …, 2024 - arxiv.org
With the information explosion on the Web, search and recommendation are foundational
infrastructures to satisfying users' information needs. As the two sides of the same coin, both …

Experimental analysis of large-scale learnable vector storage compression

H Zhang, P Zhao, X Miao, Y Shao, Z Liu… - Proceedings of the …, 2023 - dl.acm.org
Learnable embedding vector is one of the most important applications in machine learning,
and is widely used in various database-related domains. However, the high dimensionality …

Towards a Unified Language Model for Knowledge-Intensive Tasks Utilizing External Corpus

X Li, Z Dou, Y Zhou, F Liu - arxiv preprint arxiv:2402.01176, 2024 - arxiv.org
The advent of large language models (LLMs) has showcased their efficacy across various
domains, yet they often hallucinate, especially in knowledge-intensive tasks that require …

Ace: A generative cross-modal retrieval framework with coarse-to-fine semantic modeling

M Fang, S Ji, J Zuo, H Huang, Y **a, J Zhu… - arxiv preprint arxiv …, 2024 - arxiv.org
Generative retrieval, which has demonstrated effectiveness in text-to-text retrieval, utilizes a
sequence-to-sequence model to directly generate candidate identifiers based on natural …

LiNR: Model Based Neural Retrieval on GPUs at LinkedIn

F Borisyuk, Q Song, M Zhou, G Parameswaran… - Proceedings of the 33rd …, 2024 - dl.acm.org
This paper introduces LiNR, LinkedIn's large-scale, GPU-based retrieval system. LiNR
supports a billion-sized index on GPU models. We discuss our experiences and challenges …

Report on The Search Futures Workshop at ECIR 2024

L Azzopardi, CLA Clarke, P Kantor, B Mitra… - ACM SIGIR Forum, 2024 - dl.acm.org
The First Search Futures Workshop, in conjunction with the Fourty-sixth European
Conference on Information Retrieval (ECIR) 2024, looked into the future of search to ask …

Bridging Search and Recommendation in Generative Retrieval: Does One Task Help the Other?

G Penha, A Vardasbi, E Palumbo, M De Nadai… - Proceedings of the 18th …, 2024 - dl.acm.org
Generative retrieval for search and recommendation is a promising paradigm for retrieving
items, offering an alternative to traditional methods that depend on external indexes and …