On-device language models: A comprehensive review

J Xu, Z Li, W Chen, Q Wang, X Gao, Q Cai… - arxiv preprint arxiv …, 2024 - arxiv.org
The advent of large language models (LLMs) revolutionized natural language processing
applications, and running LLMs on edge devices has become increasingly attractive for …

[PDF][PDF] A comprehensive survey of small language models in the era of large language models: Techniques, enhancements, applications, collaboration with llms, and …

F Wang, Z Zhang, X Zhang, Z Wu, T Mo, Q Lu… - arxiv preprint arxiv …, 2024 - ai.radensa.ru
Large language models (LLM) have demonstrated emergent abilities in text generation,
question answering, and reasoning, facilitating various tasks and domains. Despite their …

Large language model performance benchmarking on mobile platforms: A thorough evaluation

J **ao, Q Huang, X Chen, C Tian - arxiv preprint arxiv:2410.03613, 2024 - arxiv.org
As large language models (LLMs) increasingly integrate into every aspect of our work and
daily lives, there are growing concerns about user privacy, which push the trend toward local …

A survey on Deep Learning in Edge-Cloud Collaboration: Model partitioning, privacy preservation, and prospects

X Zhang, R Razavi-Far, H Isah, A David… - Knowledge-Based …, 2025 - Elsevier
Recently, the rapid advancements of AI technologies and mobile computing have led to the
growing prevalence of smart devices and rising demands for on-device Deep Learning …

Embodied AI-Enhanced Vehicular Networks: An Integrated Large Language Models and Reinforcement Learning Method

R Zhang, C Zhao, H Du, D Niyato, J Wang… - arxiv preprint arxiv …, 2025 - arxiv.org
This paper investigates adaptive transmission strategies in embodied AI-enhanced
vehicular networks by integrating large language models (LLMs) for semantic information …

SlimLM: An Efficient Small Language Model for On-Device Document Assistance

TM Pham, PT Nguyen, S Yoon, VD Lai… - arxiv preprint arxiv …, 2024 - arxiv.org
While small language models (SLMs) show promises for mobile deployment, their real-world
performance and applications on smartphones remains underexplored. We present SlimLM …

[PDF][PDF] Enhanced Hybrid Inference Techniques for Scalable On-Device LLM Personalization and Cloud Integration

T Peng, L Liu, M Gupta, K Mehta, A Nair, S Desai… - contexts, 2024 - researchgate.net
Large language models (LLMs) have transformed the landscape of natural language
processing, yet personalization remains a key challenge, particularly in terms of scalability …