Google Наука

G Qu, Q Chen, W Wei, Z Lin, X Chen… - … Surveys & Tutorials, 2025 - ieeexplore.ieee.org

On-device large language models (LLMs), referring to running LLMs on edge devices, have
raised considerable interest since they are more cost-effective, latency-efficient, and privacy …

Запазване Позоваване С позовавания в 31 Сродни статии Всички 5 версии

[Free GPT-4]
[DeepSeek]

[PDF] ieee.org

Llm-based edge intelligence: A comprehensive survey on architectures, applications, security and trustworthiness

O Friha, MA Ferrag, B Kantarci… - IEEE Open Journal …, 2024 - ieeexplore.ieee.org

The integration of Large Language Models (LLMs) and Edge Intelligence (EI) introduces a
groundbreaking paradigm for intelligent edge devices. With their capacity for human-like …

Запазване Позоваване С позовавания в 32 Сродни статии Всички 2 версии

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

A survey on the memory mechanism of large language model based agents

Z Zhang, X Bo, C Ma, R Li, X Chen, Q Dai, J Zhu… - arxiv preprint arxiv …, 2024 - arxiv.org

Large language model (LLM) based agents have recently attracted much attention from the
research and industry communities. Compared with original LLMs, LLM-based agents are …

Запазване Позоваване С позовавания в 76 Сродни статии Всички 3 версии Във вид на HTML

[Free GPT-4]
[DeepSeek]

[PDF] neurips.cc

Mvgamba: Unify 3d content generation as state space sequence modeling

X Yi, Z Wu, Q Shen, Q Xu, P Zhou… - Advances in …, 2025 - proceedings.neurips.cc

Recent 3D large reconstruction models (LRMs) can generate high-quality 3D content in sub-
seconds by integrating multi-view diffusion models with scalable multi-view reconstructors …

Запазване Позоваване С позовавания в 9 Сродни статии Всички 5 версии Във вид на HTML

[Free GPT-4]
[DeepSeek]

[PDF] techrxiv.org

Efficient training and inference: Techniques for large language models using llama

SR Cunningham, D Archambault, A Kung - Authorea Preprints, 2024 - techrxiv.org

To enhance the efficiency of language models, it would involve optimizing their training and
inference processes to reduce computational demands while maintaining high performance …

Запазване Позоваване С позовавания в 64 Сродни статии Всички 2 версии Във вид на HTML

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

On-device language models: A comprehensive review

J Xu, Z Li, W Chen, Q Wang, X Gao, Q Cai… - arxiv preprint arxiv …, 2024 - arxiv.org

The advent of large language models (LLMs) revolutionized natural language processing
applications, and running LLMs on edge devices has become increasingly attractive for …

Запазване Позоваване С позовавания в 23 Сродни статии Всички 3 версии Във вид на HTML

[Free GPT-4]
[DeepSeek]

[PDF] researchsquare.com

Comparative evaluation of commercial large language models on promptbench: An english and chinese perspective

S Wang, Q Ouyang, B Wang - 2024 - researchsquare.com

This study embarks on an exploration of the performance disparities observed between
English and Chinese in large language models (LLMs), motivated by the growing need for …

Запазване Позоваване С позовавания в 53 Сродни статии Всички 4 версии Във вид на HTML

[Free GPT-4]
[DeepSeek]

[HTML] mdpi.com

[HTML][HTML] Large language models for wearable sensor-based human activity recognition, health monitoring, and behavioral modeling: a survey of early trends, datasets …

E Ferrara - Sensors, 2024 - mdpi.com

The proliferation of wearable technology enables the generation of vast amounts of sensor
data, offering significant opportunities for advancements in health monitoring, activity …

Запазване Позоваване С позовавания в 12 Сродни статии Всички 12 версии Кеширана версия

[Free GPT-4]
[DeepSeek]

[PDF] osf.io

[PDF][PDF] Efficient model compression and knowledge distillation on llama 2: Achieving high performance with reduced computational cost

Q Huangpu, H Gao - 2024 - files.osf.io

This study investigates the application of model compression and knowledge distillation
techniques to enhance the computational efficiency of LLama 2, a Large Language Model …

Запазване Позоваване С позовавания в 45 Сродни статии Всички 4 версии Във вид на HTML

[Free GPT-4]
[DeepSeek]

[PDF] researchsquare.com

Higher performance of mistral large on mmlu benchmark through two-stage knowledge distillation

J Wilkins, M Rodriguez - 2024 - researchsquare.com

Large language models (LLM) have undergone significant transformations through the
application of knowledge distillation techniques aimed at enhancing performance on …

Запазване Позоваване С позовавания в 46 Сродни статии Всички 3 версии Във вид на HTML

Създаване на сигнал

Позоваване

Разширено търсене

Запазено в „Моята библиотека“

Beyond efficiency: A systematic survey of resource-efficient large language models

Mobile edge intelligence for large language models: A contemporary survey

Llm-based edge intelligence: A comprehensive survey on architectures, applications, security and trustworthiness

A survey on the memory mechanism of large language model based agents

Mvgamba: Unify 3d content generation as state space sequence modeling

Efficient training and inference: Techniques for large language models using llama

On-device language models: A comprehensive review

Comparative evaluation of commercial large language models on promptbench: An english and chinese perspective

[HTML][HTML] Large language models for wearable sensor-based human activity recognition, health monitoring, and behavioral modeling: a survey of early trends, datasets …

[PDF][PDF] Efficient model compression and knowledge distillation on llama 2: Achieving high performance with reduced computational cost

Higher performance of mistral large on mmlu benchmark through two-stage knowledge distillation