From matching to generation: A survey on generative information retrieval

X Li, J **, Y Zhou, Y Zhang, P Zhang, Y Zhu… - arxiv preprint arxiv …, 2024 - arxiv.org
Information Retrieval (IR) systems are crucial tools for users to access information, widely
applied in scenarios like search engines, question answering, and recommendation …

Retrieval augmented generation (rag) and beyond: A comprehensive survey on how to make your llms use external data more wisely

S Zhao, Y Yang, Z Wang, Z He, LK Qiu… - arxiv preprint arxiv …, 2024 - arxiv.org
Large language models (LLMs) augmented with external data have demonstrated
remarkable capabilities in completing real-world tasks. Techniques for integrating external …

A survey of generative search and recommendation in the era of large language models

Y Li, X Lin, W Wang, F Feng, L Pang, W Li, L Nie… - arxiv preprint arxiv …, 2024 - arxiv.org
With the information explosion on the Web, search and recommendation are foundational
infrastructures to satisfying users' information needs. As the two sides of the same coin, both …

VISTA: visualized text embedding for universal multi-modal retrieval

J Zhou, Z Liu, S **ao, B Zhao, Y **ong - arxiv preprint arxiv:2406.04292, 2024 - arxiv.org
Multi-modal retrieval becomes increasingly popular in practice. However, the existing
retrievers are mostly text-oriented, which lack the capability to process visual information …

The Synergy between Data and Multi-Modal Large Language Models: A Survey from Co-Development Perspective

Z Qin, D Chen, W Zhang, L Yao, Y Huang… - arxiv preprint arxiv …, 2024 - arxiv.org
The rapid development of large language models (LLMs) has been witnessed in recent
years. Based on the powerful LLMs, multi-modal LLMs (MLLMs) extend the modality from …

Ace: A generative cross-modal retrieval framework with coarse-to-fine semantic modeling

M Fang, S Ji, J Zuo, H Huang, Y **a, J Zhu… - arxiv preprint arxiv …, 2024 - arxiv.org
Generative retrieval, which has demonstrated effectiveness in text-to-text retrieval, utilizes a
sequence-to-sequence model to directly generate candidate identifiers based on natural …

Retrieval-augmented generation with graphs (graphrag)

H Han, Y Wang, H Shomer, K Guo, J Ding, Y Lei… - arxiv preprint arxiv …, 2024 - arxiv.org
Retrieval-augmented generation (RAG) is a powerful technique that enhances downstream
task execution by retrieving additional information, such as knowledge, skills, and tools from …

Ml-mamba: Efficient multi-modal large language model utilizing mamba-2

W Huang, J Pan, J Tang, Y Ding, Y **ng… - arxiv preprint arxiv …, 2024 - arxiv.org
Multimodal Large Language Models (MLLMs) have attracted much attention for their
multifunctionality. However, traditional Transformer architectures incur significant overhead …

Trust in internal or external knowledge? generative multi-modal entity linking with knowledge retriever

X Long, J Zeng, F Meng, J Zhou… - Findings of the …, 2024 - aclanthology.org
Multi-modal entity linking (MEL) is a challenging task that requires accurate prediction of
entities within extensive search spaces, utilizing multi-modal contexts. Existing generative …

[HTML][HTML] GDT Framework: Integrating Generative Design and Design Thinking for Sustainable Development in the AI Era

Y Chen, Z Qin, L Sun, J Wu, W Ai, J Chao, H Li, J Li - Sustainability, 2025 - mdpi.com
The ability of AI to process vast datasets can enhance creativity, but its rigid knowledge base
and lack of reflective thinking limit sustainable design. Generative Design Thinking (GDT) …