Google Akademik

Z Han, C Gao, J Liu, J Zhang, SQ Zhang - arxiv preprint arxiv:2403.14608, 2024 - arxiv.org

Large models represent a groundbreaking advancement in multiple application fields,
enabling remarkable achievements across various tasks. However, their unprecedented …

Kaydet Alıntı yap Alıntılanma sayısı: 245 İlgili makaleler 2 sürümün hepsi HTML olarak görüntüle

[Free GPT-4]

[PDF] arxiv.org

Pre-trained language models for text generation: A survey

J Li, T Tang, WX Zhao, JY Nie, JR Wen - ACM Computing Surveys, 2024 - dl.acm.org

Text Generation aims to produce plausible and readable text in human language from input
data. The resurgence of deep learning has greatly advanced this field, in particular, with the …

Kaydet Alıntı yap Alıntılanma sayısı: 414 İlgili makaleler 7 sürümün hepsi

[Free GPT-4]

[PDF] arxiv.org

A survey of large language models

WX Zhao, K Zhou, J Li, T Tang, X Wang, Y Hou… - arxiv preprint arxiv …, 2023 - arxiv.org

Language is essentially a complex, intricate system of human expressions governed by
grammatical rules. It poses a significant challenge to develop capable AI algorithms for …

Kaydet Alıntı yap Alıntılanma sayısı: 3554 İlgili makaleler 4 sürümün hepsi HTML olarak görüntüle

[Free GPT-4]

[PDF] arxiv.org

Llama-adapter: Efficient fine-tuning of language models with zero-init attention

R Zhang, J Han, C Liu, P Gao, A Zhou, X Hu… - arxiv preprint arxiv …, 2023 - arxiv.org

We present LLaMA-Adapter, a lightweight adaption method to efficiently fine-tune LLaMA
into an instruction-following model. Using 52K self-instruct demonstrations, LLaMA-Adapter …

Kaydet Alıntı yap Alıntılanma sayısı: 731 İlgili makaleler 3 sürümün hepsi HTML olarak görüntüle

[Free GPT-4]

[PDF] mit.edu

A survey on model compression for large language models

X Zhu, J Li, Y Liu, C Ma, W Wang - Transactions of the Association for …, 2024 - direct.mit.edu

Abstract Large Language Models (LLMs) have transformed natural language processing
tasks successfully. Yet, their large size and high computational needs pose challenges for …

Kaydet Alıntı yap Alıntılanma sayısı: 229 İlgili makaleler 2 sürümün hepsi

[Free GPT-4]

[PDF] arxiv.org

Aligning large language models with human: A survey

Y Wang, W Zhong, L Li, F Mi, X Zeng, W Huang… - arxiv preprint arxiv …, 2023 - arxiv.org

Large Language Models (LLMs) trained on extensive textual corpora have emerged as
leading solutions for a broad array of Natural Language Processing (NLP) tasks. Despite …

Kaydet Alıntı yap Alıntılanma sayısı: 282 İlgili makaleler 2 sürümün hepsi HTML olarak görüntüle

[Free GPT-4]

[PDF] arxiv.org

Efficient large language models: A survey

Z Wan, X Wang, C Liu, S Alam, Y Zheng, J Liu… - arxiv preprint arxiv …, 2023 - arxiv.org

Large Language Models (LLMs) have demonstrated remarkable capabilities in important
tasks such as natural language understanding and language generation, and thus have the …

Kaydet Alıntı yap Alıntılanma sayısı: 126 İlgili makaleler 7 sürümün hepsi HTML olarak görüntüle

[Free GPT-4]

[PDF] arxiv.org

Vera: Vector-based random matrix adaptation

DJ Kopiczko, T Blankevoort, YM Asano - arxiv preprint arxiv:2310.11454, 2023 - arxiv.org

Low-rank adapation (LoRA) is a popular method that reduces the number of trainable
parameters when finetuning large language models, but still faces acute storage challenges …

Kaydet Alıntı yap Alıntılanma sayısı: 126 İlgili makaleler 3 sürümün hepsi HTML olarak görüntüle

[Free GPT-4]

[PDF] arxiv.org

Chronos: Learning the language of time series

AF Ansari, L Stella, C Turkmen, X Zhang… - arxiv preprint arxiv …, 2024 - arxiv.org

We introduce Chronos, a simple yet effective framework for pretrained probabilistic time
series models. Chronos tokenizes time series values using scaling and quantization into a …

Kaydet Alıntı yap Alıntılanma sayısı: 183 İlgili makaleler 3 sürümün hepsi HTML olarak görüntüle

[Free GPT-4]

[PDF] arxiv.org

Loftq: Lora-fine-tuning-aware quantization for large language models

Y Li, Y Yu, C Liang, P He, N Karampatziakis… - arxiv preprint arxiv …, 2023 - arxiv.org

Quantization is an indispensable technique for serving Large Language Models (LLMs) and
has recently found its way into LoRA fine-tuning. In this work we focus on the scenario where …

Kaydet Alıntı yap Alıntılanma sayısı: 139 İlgili makaleler 4 sürümün hepsi HTML olarak görüntüle

Uyarı oluştur

Alıntı yap

Gelişmiş arama

Kitaplığım'a kaydedildi

AdaLoRA: Adaptive budget allocation for parameter-efficient fine-tuning

Parameter-efficient fine-tuning for large models: A comprehensive survey

Pre-trained language models for text generation: A survey

A survey of large language models

Llama-adapter: Efficient fine-tuning of language models with zero-init attention

A survey on model compression for large language models

Aligning large language models with human: A survey

Efficient large language models: A survey

Vera: Vector-based random matrix adaptation

Chronos: Learning the language of time series

Loftq: Lora-fine-tuning-aware quantization for large language models