Mobile edge intelligence for large language models: A contemporary survey

G Qu, Q Chen, W Wei, Z Lin, X Chen… - … Surveys & Tutorials, 2025 - ieeexplore.ieee.org
On-device large language models (LLMs), referring to running LLMs on edge devices, have
raised considerable interest since they are more cost-effective, latency-efficient, and privacy …

Llm-based edge intelligence: A comprehensive survey on architectures, applications, security and trustworthiness

O Friha, MA Ferrag, B Kantarci… - IEEE Open Journal …, 2024 - ieeexplore.ieee.org
The integration of Large Language Models (LLMs) and Edge Intelligence (EI) introduces a
groundbreaking paradigm for intelligent edge devices. With their capacity for human-like …

A survey on the memory mechanism of large language model based agents

Z Zhang, X Bo, C Ma, R Li, X Chen, Q Dai, J Zhu… - arxiv preprint arxiv …, 2024 - arxiv.org
Large language model (LLM) based agents have recently attracted much attention from the
research and industry communities. Compared with original LLMs, LLM-based agents are …

Mvgamba: Unify 3d content generation as state space sequence modeling

X Yi, Z Wu, Q Shen, Q Xu, P Zhou… - Advances in …, 2025 - proceedings.neurips.cc
Recent 3D large reconstruction models (LRMs) can generate high-quality 3D content in sub-
seconds by integrating multi-view diffusion models with scalable multi-view reconstructors …

Efficient training and inference: Techniques for large language models using llama

SR Cunningham, D Archambault, A Kung - Authorea Preprints, 2024 - techrxiv.org
To enhance the efficiency of language models, it would involve optimizing their training and
inference processes to reduce computational demands while maintaining high performance …

On-device language models: A comprehensive review

J Xu, Z Li, W Chen, Q Wang, X Gao, Q Cai… - arxiv preprint arxiv …, 2024 - arxiv.org
The advent of large language models (LLMs) revolutionized natural language processing
applications, and running LLMs on edge devices has become increasingly attractive for …

Comparative evaluation of commercial large language models on promptbench: An english and chinese perspective

S Wang, Q Ouyang, B Wang - 2024 - researchsquare.com
This study embarks on an exploration of the performance disparities observed between
English and Chinese in large language models (LLMs), motivated by the growing need for …

[PDF][PDF] Efficient model compression and knowledge distillation on llama 2: Achieving high performance with reduced computational cost

Q Huangpu, H Gao - 2024 - files.osf.io
This study investigates the application of model compression and knowledge distillation
techniques to enhance the computational efficiency of LLama 2, a Large Language Model …

Higher performance of mistral large on mmlu benchmark through two-stage knowledge distillation

J Wilkins, M Rodriguez - 2024 - researchsquare.com
Large language models (LLM) have undergone significant transformations through the
application of knowledge distillation techniques aimed at enhancing performance on …