Lm-infinite: Zero-shot extreme length generalization for large language models

C Han, Q Wang, H Peng, W **ong, Y Chen, H Ji… - arxiv preprint arxiv …, 2023 - arxiv.org
Today's large language models (LLMs) typically train on short text segments (eg,< 4K
tokens) due to the quadratic complexity of their Transformer architectures. As a result, their …

Structrag: Boosting knowledge intensive reasoning of llms via inference-time hybrid information structurization

Z Li, X Chen, H Yu, H Lin, Y Lu, Q Tang… - The Thirteenth …, 2024 - openreview.net
Retrieval-augmented generation (RAG) is a key means to effectively enhance large
language models (LLMs) in many knowledge-based tasks. However, existing RAG methods …

Towards lifespan cognitive systems

Y Wang, C Han, T Wu, X He, W Zhou, N Sadeq… - arxiv preprint arxiv …, 2024 - arxiv.org
Building a human-like system that continuously interacts with complex environments--
whether simulated digital worlds or human society--presents several key challenges. Central …

CMDBench: A Benchmark for Coarse-to-fine Multimodal Data Discovery in Compound AI Systems

Y Feng, S Rahman, A Feng, V Chen… - Proceedings of the …, 2024 - dl.acm.org
Compound AI systems (CASs) that employ LLMs as agents to accomplish knowledge-
intensive tasks via interactions with tools and data retrievers have garnered significant …

UncertaintyRAG: Span-Level Uncertainty Enhanced Long-Context Modeling for Retrieval-Augmented Generation

Z Li, J **ong, F Ye, C Zheng, X Wu, J Lu, Z Wan… - arxiv preprint arxiv …, 2024 - arxiv.org
We present UncertaintyRAG, a novel approach for long-context Retrieval-Augmented
Generation (RAG) that utilizes Signal-to-Noise Ratio (SNR)-based span uncertainty to …

Does RAG Really Perform Bad For Long-Context Processing?

K Luo, Z Liu, P Zhang, H Qian, J Zhao, K Liu - arxiv preprint arxiv …, 2025 - arxiv.org
The efficient processing of long context poses a serious challenge for large language
models (LLMs). Recently, retrieval-augmented generation (RAG) has emerged as a …

LongRAG: Evaluating Long-Context & Long-Form Retrieval-Augmented Generation with Key Point Recall

Z Qi, R Xu, Z Guo, C Wang, H Zhang, W Xu - arxiv preprint arxiv …, 2024 - arxiv.org
Retrieval-augmented generation (RAG) is a promising approach to address the limitations of
fixed knowledge in large language models (LLMs). However, current benchmarks for …

GeAR: Generation Augmented Retrieval

H Liu, S Huang, J Liu, Y Zhan, H Sun, W Deng… - arxiv preprint arxiv …, 2025 - arxiv.org
Document retrieval techniques form the foundation for the development of large-scale
information systems. The prevailing methodology is to construct a bi-encoder and compute …

Mitigating Privacy Risks in LLM Embeddings from Embedding Inversion

T Liu, H Yao, T Wu, Z Qin, F Lin, K Ren… - arxiv preprint arxiv …, 2024 - arxiv.org
Embeddings have become a cornerstone in the functionality of large language models
(LLMs) due to their ability to transform text data into rich, dense numerical representations …

QCG-Rerank: Chunks Graph Rerank with Query Expansion in Retrieval-Augmented LLMs for Tourism Domain

Q Wei, M Yang, C Han, J Wei, M Zhang, F Shi… - arxiv preprint arxiv …, 2024 - arxiv.org
Retrieval-Augmented Generation (RAG) mitigates the issue of hallucination in Large
Language Models (LLMs) by integrating information retrieval techniques. However, in the …