محقق Google

Y Matsubara, M Levorato, F Restuccia - ACM Computing Surveys, 2022‏ - dl.acm.org‏

Mobile devices such as smartphones and autonomous vehicles increasingly rely on deep
neural networks (DNNs) to execute complex inference tasks such as image classification …‏

ذخیره ارجاع بیان شده در 233 یافته مقاله‌های مربوط تمام نسخه‌های 5

[Free GPT-4]
[DeepSeek]

[PDF] neurips.cc

Fine-tuning language models with just forward passes‏

S Malladi, T Gao, E Nichani… - Advances in …, 2023‏ - proceedings.neurips.cc‏

Fine-tuning language models (LMs) has yielded success on diverse downstream tasks, but
as LMs grow in size, backpropagation requires a prohibitively large amount of memory …‏

ذخیره ارجاع بیان شده در 199 یافته مقاله‌های مربوط تمام نسخه‌های 6 نسخه HTML

[Free GPT-4]
[DeepSeek]

[PDF] openreview.net

Language models are super mario: Absorbing abilities from homologous models as a free lunch‏

L Yu, B Yu, H Yu, F Huang, Y Li - Forty-first International Conference …, 2024‏ - openreview.net‏

In this paper, we unveil that Language Models (LMs) can acquire new capabilities by
assimilating parameters from homologous models without retraining or GPUs. We first …‏

ذخیره ارجاع بیان شده در 191 یافته مقاله‌های مربوط تمام نسخه‌های 7 نسخه HTML

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Adapting language models to compress contexts‏

A Chevalier, A Wettig, A Ajith, D Chen - ar**
downstream tasks. However, existing methods such as BERT model a single document, and …‏

ذخیره ارجاع بیان شده در 375 یافته مقاله‌های مربوط تمام نسخه‌های 11 نسخه HTML

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Finetuned language models are zero-shot learners‏

J Wei, M Bosma, VY Zhao, K Guu, AW Yu… - arxiv preprint arxiv …, 2021‏ - arxiv.org‏

This paper explores a simple method for improving the zero-shot learning abilities of
language models. We show that instruction tuning--finetuning language models on a …‏

ذخیره ارجاع بیان شده در 3509 یافته مقاله‌های مربوط تمام نسخه‌های 8 نسخه HTML

[Free GPT-4]
[DeepSeek]

[PDF] neurips.cc

True few-shot learning with language models‏

E Perez, D Kiela, K Cho - Advances in neural information …, 2021‏ - proceedings.neurips.cc‏

Pretrained language models (LMs) perform well on many tasks even when learning from a
few examples, but prior work uses many held-out examples to tune various aspects of …‏

ذخیره ارجاع بیان شده در 434 یافته مقاله‌های مربوط تمام نسخه‌های 8 نسخه HTML

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Documenting large webtext corpora: A case study on the colossal clean crawled corpus‏

J Dodge, M Sap, A Marasović, W Agnew… - arxiv preprint arxiv …, 2021‏ - arxiv.org‏

Large language models have led to remarkable progress on many NLP tasks, and
researchers are turning to ever-larger text corpora to train them. Some of the largest corpora …‏

ذخیره ارجاع بیان شده در 492 یافته مقاله‌های مربوط تمام نسخه‌های 8 نسخه HTML

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Time travel in llms: Tracing data contamination in large language models‏

S Golchin, M Surdeanu - arxiv preprint arxiv:2308.08493, 2023‏ - arxiv.org‏

Data contamination, ie, the presence of test data from downstream tasks in the training data
of large language models (LLMs), is a potential major issue in measuring LLMs' real …‏

ذخیره ارجاع بیان شده در 125 یافته مقاله‌های مربوط تمام نسخه‌های 6 نسخه HTML

ایجاد هشدار

ارجاع

جستجوی پیشرفته

در «کتابخانه من» ذخیره شد

The second pascal recognising textual entailment challenge

Split computing and early exiting for deep learning applications: Survey and research challenges‏

Fine-tuning language models with just forward passes‏

Language models are super mario: Absorbing abilities from homologous models as a free lunch‏

Adapting language models to compress contexts‏

Finetuned language models are zero-shot learners‏

True few-shot learning with language models‏

Documenting large webtext corpora: A case study on the colossal clean crawled corpus‏

Time travel in llms: Tracing data contamination in large language models‏