Google Academic

Y Mao, Y Ge, Y Fan, W Xu, Y Mi, Z Hu… - Frontiers of Computer …, 2025 - Springer

Abstract Low-Rank Adaptation (LoRA), which updates the dense neural network layers with
pluggable low-rank matrices, is one of the best performed parameter efficient fine-tuning …

Salvați Citați Citat de 21 ori Articole cu conținut similar Toate cele 5 versiuni

[Free GPT-4]
[DeepSeek]

[PDF] neurips.cc

The impact of initialization on lora finetuning dynamics

S Hayou, N Ghosh, B Yu - Advances in Neural Information …, 2025 - proceedings.neurips.cc

In this paper, we study the role of initialization in Low Rank Adaptation (LoRA) as originally
introduced in Hu et al.(2021). Essentially, to start from the pretrained model, one can either …

Salvați Citați Citat de 9 ori Articole cu conținut similar Toate cele 5 versiuni Afișare ca HTML

[Free GPT-4]
[DeepSeek]

[PDF] neurips.cc

SFT: Efficient, Scalable and Generalizable LLM Fine-tuning by Structured Sparsity

X Yang, J Leng, G Guo, J Zhao… - Advances in …, 2025 - proceedings.neurips.cc

Current PEFT methods for LLMs can achieve high quality, efficient training, or scalable
serving, but not all three simultaneously. To address this limitation, we investigate sparse …

Salvați Citați Citat de 2 ori Articole cu conținut similar Toate cele 4 versiuni Afișare ca HTML

[Free GPT-4]
[DeepSeek]

[PDF] neurips.cc

Structured Unrestricted-Rank Matrices for Parameter Efficient Finetuning

A Sehanobish, KA Dubey… - Advances in …, 2025 - proceedings.neurips.cc

Recent efforts to scale Transformer models have demonstrated rapid progress across a wide
range of tasks (Wei at. al 2022). However, fine-tuning these models for downstream tasks is …

Salvați Citați Citat de 1 ori Articole cu conținut similar Toate cele 5 versiuni Afișare ca HTML

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Prompt compression for large language models: A survey

Z Li, Y Liu, Y Su, N Collier - arxiv preprint arxiv:2410.12388, 2024 - arxiv.org

Leveraging large language models (LLMs) for complex natural language tasks typically
requires long-form prompts to convey detailed requirements and information, which results …

Salvați Citați Citat de 2 ori Articole cu conținut similar Toate cele 3 versiuni Afișare ca HTML

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Tensor Product Attention Is All You Need

Y Zhang, Y Liu, H Yuan, Z Qin, Y Yuan, Q Gu… - arxiv preprint arxiv …, 2025 - arxiv.org

Scaling language models to handle longer input sequences typically necessitates large key-
value (KV) caches, resulting in substantial memory overhead during inference. In this paper …

Salvați Citați Citat de 1 ori Articole cu conținut similar Toate cele 2 versiuni Afișare ca HTML

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

PMSS: Pretrained Matrices Skeleton Selection for LLM Fine-tuning

Q Wang, X Hu, W Xu, W Liu, J Luan, B Wang - arxiv preprint arxiv …, 2024 - arxiv.org

Low-rank adaptation (LoRA) and its variants have recently gained much interest due to their
ability to avoid excessive inference costs. However, LoRA still encounters the following …

Salvați Citați Citat de 3 ori Articole cu conținut similar Toate cele 4 versiuni Afișare ca HTML

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Slim: Let llm learn more and forget less with soft lora and identity mixture

J Han, L Du, H Du, X Zhou, Y Wu, W Zheng… - arxiv preprint arxiv …, 2024 - arxiv.org

Although many efforts have been made, it is still a challenge to balance the training budget,
downstream performance, and the general capabilities of the LLMs in many applications …

Salvați Citați Citat de 2 ori Articole cu conținut similar Toate cele 2 versiuni Afișare ca HTML

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Accurate and efficient fine-tuning of quantized large language models through optimal balance

A Shen, Q Wang, Z Lai, X Li, D Li - arxiv preprint arxiv:2407.17029, 2024 - arxiv.org

Large Language Models (LLMs) have demonstrated impressive performance across various
domains. However, the enormous number of model parameters makes fine-tuning …

Salvați Citați Citat de 2 ori Articole cu conținut similar Toate cele 2 versiuni Afișare ca HTML

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

MedCare: Advancing medical LLMs through decoupling clinical alignment and knowledge aggregation

Y Liao, S Jiang, Z Chen, Y Wang, Y Wang - arxiv preprint arxiv …, 2024 - arxiv.org

Large language models (LLMs) have shown substantial progress in natural language
understanding and generation, proving valuable especially in the medical field. Despite …

Salvați Citați Citat de 2 ori Articole cu conținut similar Toate cele 6 versiuni Afișare ca HTML

Creează alerta

Citați

Căutare avansată

Salvat în Bibliotecă

Mora: High-rank updating for parameter-efficient fine-tuning

A survey on lora of large language models

The impact of initialization on lora finetuning dynamics

SFT: Efficient, Scalable and Generalizable LLM Fine-tuning by Structured Sparsity

Structured Unrestricted-Rank Matrices for Parameter Efficient Finetuning

Prompt compression for large language models: A survey

Tensor Product Attention Is All You Need

PMSS: Pretrained Matrices Skeleton Selection for LLM Fine-tuning

Slim: Let llm learn more and forget less with soft lora and identity mixture

Accurate and efficient fine-tuning of quantized large language models through optimal balance

MedCare: Advancing medical LLMs through decoupling clinical alignment and knowledge aggregation