Llm-based edge intelligence: A comprehensive survey on architectures, applications, security and trustworthiness

O Friha, MA Ferrag, B Kantarci… - IEEE Open Journal …, 2024 - ieeexplore.ieee.org
The integration of Large Language Models (LLMs) and Edge Intelligence (EI) introduces a
groundbreaking paradigm for intelligent edge devices. With their capacity for human-like …

Advancing transformer architecture in long-context large language models: A comprehensive survey

Y Huang, J Xu, J Lai, Z Jiang, T Chen, Z Li… - arxiv preprint arxiv …, 2023 - arxiv.org
Transformer-based Large Language Models (LLMs) have been applied in diverse areas
such as knowledge bases, human interfaces, and dynamic agents, and marking a stride …

[PDF][PDF] Retrieval-augmented generation for large language models: A survey

Y Gao, Y **ong, X Gao, K Jia, J Pan, Y Bi… - arxiv preprint arxiv …, 2023 - simg.baai.ac.cn
Large language models (LLMs) demonstrate powerful capabilities, but they still face
challenges in practical applications, such as hallucinations, slow knowledge updates, and …

[PDF][PDF] A survey of large language models

WX Zhao, K Zhou, J Li, T Tang… - arxiv preprint arxiv …, 2023 - paper-notes.zhjwpku.com
Ever since the Turing Test was proposed in the 1950s, humans have explored the mastering
of language intelligence by machine. Language is essentially a complex, intricate system of …

Autogen: Enabling next-gen llm applications via multi-agent conversation

Q Wu, G Bansal, J Zhang, Y Wu, B Li, E Zhu… - arxiv preprint arxiv …, 2023 - arxiv.org
AutoGen is an open-source framework that allows developers to build LLM applications via
multiple agents that can converse with each other to accomplish tasks. AutoGen agents are …

[PDF][PDF] Understanding the planning of LLM agents: A survey

X Huang, W Liu, X Chen, X Wang, H Wang… - arxiv preprint arxiv …, 2024 - researchgate.net
Abstract As Large Language Models (LLMs) have shown significant intelligence, the
progress to leverage LLMs as planning modules of autonomous agents has attracted more …

Towards efficient generative large language model serving: A survey from algorithms to systems

X Miao, G Oliaro, Z Zhang, X Cheng, H **… - arxiv preprint arxiv …, 2023 - arxiv.org
In the rapidly evolving landscape of artificial intelligence (AI), generative large language
models (LLMs) stand at the forefront, revolutionizing how we interact with our data. However …

Wise: Rethinking the knowledge memory for lifelong model editing of large language models

P Wang, Z Li, N Zhang, Z Xu, Y Yao… - Advances in …, 2025 - proceedings.neurips.cc
Large language models (LLMs) need knowledge updates to meet the ever-growing world
facts and correct the hallucinated responses, facilitating the methods of lifelong model …

A survey on the memory mechanism of large language model based agents

Z Zhang, X Bo, C Ma, R Li, X Chen, Q Dai, J Zhu… - arxiv preprint arxiv …, 2024 - arxiv.org
Large language model (LLM) based agents have recently attracted much attention from the
research and industry communities. Compared with original LLMs, LLM-based agents are …

Llmlingua-2: Data distillation for efficient and faithful task-agnostic prompt compression

Z Pan, Q Wu, H Jiang, M **a, X Luo, J Zhang… - arxiv preprint arxiv …, 2024 - arxiv.org
This paper focuses on task-agnostic prompt compression for better generalizability and
efficiency. Considering the redundancy in natural language, existing approaches compress …