Google Académico

M Binz, I Dasgupta, AK Jagadish… - Behavioral and Brain …, 2024 - cambridge.org

Psychologists and neuroscientists extensively rely on computational models for studying
and analyzing the human mind. Traditionally, such computational models have been hand …

Guardar Citar Citado por 41 Artículos relacionados Las 9 versiones

[Free GPT-4]

[PDF] sagepub.com Full View

Collective intelligence for deep learning: A survey of recent developments

D Ha, Y Tang - Collective Intelligence, 2022 - journals.sagepub.com

In the past decade, we have witnessed the rise of deep learning to dominate the field of
artificial intelligence. Advances in artificial neural networks alongside corresponding …

Guardar Citar Citado por 94 Artículos relacionados Las 6 versiones Full View

[Free GPT-4]

[PDF] mlr.press

Transformers learn in-context by gradient descent

J Von Oswald, E Niklasson… - International …, 2023 - proceedings.mlr.press

At present, the mechanisms of in-context learning in Transformers are not well understood
and remain mostly an intuition. In this paper, we suggest that training Transformers on auto …

Guardar Citar Citado por 439 Artículos relacionados Las 9 versiones Versión en HTML

[Free GPT-4]

[PDF] neurips.cc

Transformers as statisticians: Provable in-context learning with in-context algorithm selection

Y Bai, F Chen, H Wang, C **ong… - Advances in neural …, 2024 - proceedings.neurips.cc

Neural sequence models based on the transformer architecture have demonstrated
remarkable\emph {in-context learning}(ICL) abilities, where they can perform new tasks …

Guardar Citar Citado por 188 Artículos relacionados Las 7 versiones Versión en HTML

[Free GPT-4]

[PDF] neurips.cc

What can transformers learn in-context? a case study of simple function classes

S Garg, D Tsipras, PS Liang… - Advances in Neural …, 2022 - proceedings.neurips.cc

In-context learning is the ability of a model to condition on a prompt sequence consisting of
in-context examples (input-output pairs corresponding to some task) along with a new query …

Guardar Citar Citado por 419 Artículos relacionados Las 7 versiones Versión en HTML

[Free GPT-4]

[PDF] arxiv.org

Why can gpt learn in-context? language models implicitly perform gradient descent as meta-optimizers

D Dai, Y Sun, L Dong, Y Hao, S Ma, Z Sui… - arxiv preprint arxiv …, 2022 - arxiv.org

Large pretrained language models have shown surprising in-context learning (ICL) ability.
With a few demonstration input-label pairs, they can predict the label for an unseen input …

Guardar Citar Citado por 360 Artículos relacionados Las 4 versiones Versión en HTML

[Free GPT-4]

[PDF] mlr.press

Transformers as algorithms: Generalization and stability in in-context learning

Y Li, ME Ildiz, D Papailiopoulos… - … on Machine Learning, 2023 - proceedings.mlr.press

In-context learning (ICL) is a type of prompting where a transformer model operates on a
sequence of (input, output) examples and performs inference on-the-fly. In this work, we …

Guardar Citar Citado por 133 Artículos relacionados Las 8 versiones Versión en HTML

[Free GPT-4]

[PDF] arxiv.org

Learning to (learn at test time): Rnns with expressive hidden states

Y Sun, X Li, K Dalal, J Xu, A Vikram, G Zhang… - arxiv preprint arxiv …, 2024 - arxiv.org

Self-attention performs well in long context but has quadratic complexity. Existing RNN
layers have linear complexity, but their performance in long context is limited by the …

Guardar Citar Citado por 44 Artículos relacionados Versión en HTML

[Free GPT-4]

[PDF] mlr.press

Linear transformers are secretly fast weight programmers

I Schlag, K Irie, J Schmidhuber - International Conference on …, 2021 - proceedings.mlr.press

We show the formal equivalence of linearised self-attention mechanisms and fast weight
controllers from the early'90s, where a slow neural net learns by gradient descent to …

Guardar Citar Citado por 227 Artículos relacionados Las 4 versiones Versión en HTML

[Free GPT-4]

[HTML] aip.org

[HTML][HTML] Deep language models for interpretative and predictive materials science

Y Hu, MJ Buehler - APL Machine Learning, 2023 - pubs.aip.org

Machine learning (ML) has emerged as an indispensable methodology to describe,
discover, and predict complex physical phenomena that efficiently help us learn underlying …

Guardar Citar Citado por 70 Artículos relacionados Las 2 versiones

Crear alerta

Citar

Búsqueda avanzada

Guardado en Mi biblioteca

Meta learning backpropagation and improving it

Meta-learned models of cognition

Collective intelligence for deep learning: A survey of recent developments

Transformers learn in-context by gradient descent

Transformers as statisticians: Provable in-context learning with in-context algorithm selection

What can transformers learn in-context? a case study of simple function classes

Why can gpt learn in-context? language models implicitly perform gradient descent as meta-optimizers

Transformers as algorithms: Generalization and stability in in-context learning

Learning to (learn at test time): Rnns with expressive hidden states

Linear transformers are secretly fast weight programmers

[HTML][HTML] Deep language models for interpretative and predictive materials science