On-device language models: A comprehensive review

J Xu, Z Li, W Chen, Q Wang, X Gao, Q Cai… - arxiv preprint arxiv …, 2024 - arxiv.org
The advent of large language models (LLMs) revolutionized natural language processing
applications, and running LLMs on edge devices has become increasingly attractive for …

Llamafactory: Unified efficient fine-tuning of 100+ language models

Y Zheng, R Zhang, J Zhang, Y Ye, Z Luo… - arxiv preprint arxiv …, 2024 - arxiv.org
Efficient fine-tuning is vital for adapting large language models (LLMs) to downstream tasks.
However, it requires non-trivial efforts to implement these methods on different models. We …

Rest-mcts*: Llm self-training via process reward guided tree search

D Zhang, S Zhoubian, Z Hu, Y Yue… - Advances in Neural …, 2025 - proceedings.neurips.cc
Recent methodologies in LLM self-training mostly rely on LLM generating responses and
filtering those with correct output answers as training data. This approach often yields a low …

Minference 1.0: Accelerating pre-filling for long-context llms via dynamic sparse attention

H Jiang, Y Li, C Zhang, Q Wu, X Luo… - Advances in …, 2025 - proceedings.neurips.cc
The computational challenges of Large Language Model (LLM) inference remain a
significant barrier to their widespread deployment, especially as prompt lengths continue to …

Internlm-xcomposer-2.5: A versatile large vision language model supporting long-contextual input and output

P Zhang, X Dong, Y Zang, Y Cao, R Qian… - arxiv preprint arxiv …, 2024 - arxiv.org
We present InternLM-XComposer-2.5 (IXC-2.5), a versatile large-vision language model that
supports long-contextual input and output. IXC-2.5 excels in various text-image …

From crowdsourced data to high-quality benchmarks: Arena-hard and benchbuilder pipeline

T Li, WL Chiang, E Frick, L Dunlap, T Wu, B Zhu… - arxiv preprint arxiv …, 2024 - arxiv.org
The rapid evolution of Large Language Models (LLMs) has outpaced the development of
model evaluation, highlighting the need for continuous curation of new, challenging …

Kangaroo: A powerful video-language model supporting long-context video input

J Liu, Y Wang, H Ma, X Wu, X Ma, X Wei, J Jiao… - arxiv preprint arxiv …, 2024 - arxiv.org
Rapid advancements have been made in extending Large Language Models (LLMs) to
Large Multi-modal Models (LMMs). However, extending input modality of LLMs to video data …

Simulated misuse of large language models and clinical credit systems

JT Anibal, HB Huth, J Gunkel, SK Gregurick… - NPJ Digital …, 2024 - nature.com
In the future, large language models (LLMs) may enhance the delivery of healthcare, but
there are risks of misuse. These methods may be trained to allocate resources via unjust …

Simulating classroom education with llm-empowered agents

Z Zhang, D Zhang-Li, J Yu, L Gong, J Zhou… - arxiv preprint arxiv …, 2024 - arxiv.org
Large language models (LLMs) have been applied across various intelligent educational
tasks to assist teaching. While preliminary studies have focused on task-specific …

A survey on multilingual large language models: Corpora, alignment, and bias

Y Xu, L Hu, J Zhao, Z Qiu, K XU, Y Ye, H Gu - arxiv preprint arxiv …, 2024 - arxiv.org
Based on the foundation of Large Language Models (LLMs), Multilingual LLMs (MLLMs)
have been developed to address the challenges faced in multilingual natural language …