- Academic Search

H Naveed, AU Khan, S Qiu, M Saqib, S Anwar… - arxiv preprint arxiv …, 2023 - arxiv.org

Large Language Models (LLMs) have recently demonstrated remarkable capabilities in
natural language processing tasks and beyond. This success of LLMs has led to a large …

Salva Cita Citato da 700 Articoli correlati Tutte e 3 le versioni Versione HTML

Semantic structure in deep learning

E Pavlick - Annual Review of Linguistics, 2022 - annualreviews.org

Deep learning has recently come to dominate computational linguistics, leading to claims of
human-level performance in a range of language processing tasks. Like much previous …

Salva Cita Citato da 64 Articoli correlati Tutte e 2 le versioni

[Free GPT-4]

[PDF] neurips.cc

Language is not all you need: Aligning perception with language models

S Huang, L Dong, W Wang, Y Hao… - Advances in …, 2023 - proceedings.neurips.cc

A big convergence of language, multimodal perception, action, and world modeling is a key
step toward artificial general intelligence. In this work, we introduce KOSMOS-1, a …

Salva Cita Citato da 478 Articoli correlati Tutte e 5 le versioni Versione HTML

[Free GPT-4]

[PDF] aclanthology.org

Is ChatGPT a general-purpose natural language processing task solver?

C Qin, A Zhang, Z Zhang, J Chen, M Yasunaga… - arxiv preprint arxiv …, 2023 - arxiv.org

Spurred by advancements in scale, large language models (LLMs) have demonstrated the
ability to perform a variety of natural language processing (NLP) tasks zero-shot--ie, without …

Salva Cita Citato da 732 Articoli correlati Tutte e 4 le versioni Versione HTML

[Free GPT-4]

[PDF] neurips.cc

Symbolic discovery of optimization algorithms

X Chen, C Liang, D Huang, E Real… - Advances in neural …, 2024 - proceedings.neurips.cc

We present a method to formulate algorithm discovery as program search, and apply it to
discover optimization algorithms for deep neural network training. We leverage efficient …

Salva Cita Citato da 439 Articoli correlati Tutte e 7 le versioni Versione HTML

[Free GPT-4]

[PDF] neurips.cc

Scaling data-constrained language models

N Muennighoff, A Rush, B Barak… - Advances in …, 2023 - proceedings.neurips.cc

The current trend of scaling language models involves increasing both parameter count and
training dataset size. Extrapolating this trend suggests that training dataset size may soon be …

Salva Cita Citato da 221 Articoli correlati Tutte e 7 le versioni Versione HTML

[Free GPT-4]

[PDF] arxiv.org

Beyond the imitation game: Quantifying and extrapolating the capabilities of language models

A Srivastava, A Rastogi, A Rao, AAM Shoeb… - arxiv preprint arxiv …, 2022 - arxiv.org

Language models demonstrate both quantitative improvement and new qualitative
capabilities with increasing scale. Despite their potentially transformative impact, these new …

Salva Cita Citato da 1293 Articoli correlati Tutte e 11 le versioni Versione HTML

[Free GPT-4]

[PDF] neurips.cc

Fine-tuning language models with just forward passes

S Malladi, T Gao, E Nichani… - Advances in …, 2023 - proceedings.neurips.cc

Fine-tuning language models (LMs) has yielded success on diverse downstream tasks, but
as LMs grow in size, backpropagation requires a prohibitively large amount of memory …

Salva Cita Citato da 182 Articoli correlati Tutte e 6 le versioni Versione HTML

[Free GPT-4]

[PDF] neurips.cc

Few-shot parameter-efficient fine-tuning is better and cheaper than in-context learning

H Liu, D Tam, M Muqeeth, J Mohta… - Advances in …, 2022 - proceedings.neurips.cc

Few-shot in-context learning (ICL) enables pre-trained language models to perform a
previously-unseen task without any gradient-based training by feeding a small number of …

Salva Cita Citato da 850 Articoli correlati Tutte e 8 le versioni Versione HTML

[Free GPT-4]

[PDF] arxiv.org

Why can gpt learn in-context? language models implicitly perform gradient descent as meta-optimizers

D Dai, Y Sun, L Dong, Y Hao, S Ma, Z Sui… - arxiv preprint arxiv …, 2022 - arxiv.org

Large pretrained language models have shown surprising in-context learning (ICL) ability.
With a few demonstration input-label pairs, they can predict the label for an unseen input …

Salva Cita Citato da 360 Articoli correlati Tutte e 4 le versioni Versione HTML

Crea avviso

Cita

Ricerca avanzata

Salvato in La mia biblioteca

The commitmentbank: Investigating projection in naturally occurring discourse

A comprehensive overview of large language models

Semantic structure in deep learning

Language is not all you need: Aligning perception with language models

Is ChatGPT a general-purpose natural language processing task solver?

Symbolic discovery of optimization algorithms

Scaling data-constrained language models

Beyond the imitation game: Quantifying and extrapolating the capabilities of language models

Fine-tuning language models with just forward passes

Few-shot parameter-efficient fine-tuning is better and cheaper than in-context learning

Why can gpt learn in-context? language models implicitly perform gradient descent as meta-optimizers