Google Academic

J Xu, Z Li, W Chen, Q Wang, X Gao, Q Cai… - arxiv preprint arxiv …, 2024 - arxiv.org

The advent of large language models (LLMs) revolutionized natural language processing
applications, and running LLMs on edge devices has become increasingly attractive for …

Salvați Citați Citat de 23 ori Articole cu conținut similar Toate cele 3 versiuni Afișare ca HTML

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Llamafactory: Unified efficient fine-tuning of 100+ language models

Y Zheng, R Zhang, J Zhang, Y Ye, Z Luo… - arxiv preprint arxiv …, 2024 - arxiv.org

Efficient fine-tuning is vital for adapting large language models (LLMs) to downstream tasks.
However, it requires non-trivial efforts to implement these methods on different models. We …

Salvați Citați Citat de 328 ori Articole cu conținut similar Toate cele 4 versiuni Afișare ca HTML

[Free GPT-4]
[DeepSeek]

[PDF] neurips.cc

Rest-mcts*: Llm self-training via process reward guided tree search

D Zhang, S Zhoubian, Z Hu, Y Yue… - Advances in Neural …, 2025 - proceedings.neurips.cc

Recent methodologies in LLM self-training mostly rely on LLM generating responses and
filtering those with correct output answers as training data. This approach often yields a low …

Salvați Citați Citat de 81 ori Articole cu conținut similar Toate cele 6 versiuni Afișare ca HTML

[Free GPT-4]
[DeepSeek]

[PDF] neurips.cc

Minference 1.0: Accelerating pre-filling for long-context llms via dynamic sparse attention

H Jiang, Y Li, C Zhang, Q Wu, X Luo… - Advances in …, 2025 - proceedings.neurips.cc

The computational challenges of Large Language Model (LLM) inference remain a
significant barrier to their widespread deployment, especially as prompt lengths continue to …

Salvați Citați Citat de 54 ori Articole cu conținut similar Toate cele 4 versiuni Afișare ca HTML

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Internlm-xcomposer-2.5: A versatile large vision language model supporting long-contextual input and output

P Zhang, X Dong, Y Zang, Y Cao, R Qian… - arxiv preprint arxiv …, 2024 - arxiv.org

We present InternLM-XComposer-2.5 (IXC-2.5), a versatile large-vision language model that
supports long-contextual input and output. IXC-2.5 excels in various text-image …

Salvați Citați Citat de 84 ori Articole cu conținut similar Toate cele 3 versiuni Afișare ca HTML

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

From crowdsourced data to high-quality benchmarks: Arena-hard and benchbuilder pipeline

T Li, WL Chiang, E Frick, L Dunlap, T Wu, B Zhu… - arxiv preprint arxiv …, 2024 - arxiv.org

The rapid evolution of Large Language Models (LLMs) has outpaced the development of
model evaluation, highlighting the need for continuous curation of new, challenging …

Salvați Citați Citat de 92 ori Articole cu conținut similar Toate cele 5 versiuni Afișare ca HTML

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Kangaroo: A powerful video-language model supporting long-context video input

J Liu, Y Wang, H Ma, X Wu, X Ma, X Wei, J Jiao… - arxiv preprint arxiv …, 2024 - arxiv.org

Rapid advancements have been made in extending Large Language Models (LLMs) to
Large Multi-modal Models (LMMs). However, extending input modality of LLMs to video data …

Salvați Citați Citat de 34 ori Articole cu conținut similar Toate cele 3 versiuni Afișare ca HTML

[Free GPT-4]
[DeepSeek]

[PDF] nature.com

Simulated misuse of large language models and clinical credit systems

JT Anibal, HB Huth, J Gunkel, SK Gregurick… - NPJ Digital …, 2024 - nature.com

In the future, large language models (LLMs) may enhance the delivery of healthcare, but
there are risks of misuse. These methods may be trained to allocate resources via unjust …

Salvați Citați Citat de 5 ori Articole cu conținut similar Toate cele 11 versiuni

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Simulating classroom education with llm-empowered agents

Z Zhang, D Zhang-Li, J Yu, L Gong, J Zhou… - arxiv preprint arxiv …, 2024 - arxiv.org

Large language models (LLMs) have been applied across various intelligent educational
tasks to assist teaching. While preliminary studies have focused on task-specific …

Salvați Citați Citat de 37 ori Articole cu conținut similar Toate cele 3 versiuni Afișare ca HTML

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

A survey on multilingual large language models: Corpora, alignment, and bias

Y Xu, L Hu, J Zhao, Z Qiu, K XU, Y Ye, H Gu - arxiv preprint arxiv …, 2024 - arxiv.org

Based on the foundation of Large Language Models (LLMs), Multilingual LLMs (MLLMs)
have been developed to address the challenges faced in multilingual natural language …

Salvați Citați Citat de 32 ori Articole cu conținut similar Toate cele 2 versiuni Afișare ca HTML

Creează alerta

Citați

Căutare avansată

Salvat în Bibliotecă

Chatglm: A family of large language models from glm-130b to glm-4 all tools

On-device language models: A comprehensive review

Llamafactory: Unified efficient fine-tuning of 100+ language models

Rest-mcts*: Llm self-training via process reward guided tree search

Minference 1.0: Accelerating pre-filling for long-context llms via dynamic sparse attention

Internlm-xcomposer-2.5: A versatile large vision language model supporting long-contextual input and output

From crowdsourced data to high-quality benchmarks: Arena-hard and benchbuilder pipeline

Kangaroo: A powerful video-language model supporting long-context video input

Simulated misuse of large language models and clinical credit systems

Simulating classroom education with llm-empowered agents

A survey on multilingual large language models: Corpora, alignment, and bias