Google Académico

Y Li, ME Ildiz, D Papailiopoulos… - … on Machine Learning, 2023 - proceedings.mlr.press

In-context learning (ICL) is a type of prompting where a transformer model operates on a
sequence of (input, output) examples and performs inference on-the-fly. In this work, we …

Guardar Citar Citado por 133 Artículos relacionados Las 8 versiones Versión en HTML

[Free GPT-4]

[PDF] neurips.cc

Fedavg with fine tuning: Local updates lead to representation learning

L Collins, H Hassani, A Mokhtari… - Advances in Neural …, 2022 - proceedings.neurips.cc

Abstract The Federated Averaging (FedAvg) algorithm, which consists of alternating
between a few local stochastic gradient updates at client nodes, followed by a model …

Guardar Citar Citado por 93 Artículos relacionados Las 6 versiones Versión en HTML

[Free GPT-4]

[PDF] thecvf.com

Architecture, dataset and model-scale agnostic data-free meta-learning

Z Hu, L Shen, Z Wang, T Liu… - Proceedings of the …, 2023 - openaccess.thecvf.com

The goal of data-free meta-learning is to learn useful prior knowledge from a collection of
pre-trained models without accessing their training data. However, existing works only solve …

Guardar Citar Citado por 13 Artículos relacionados Las 8 versiones Versión en HTML

[Free GPT-4]

[PDF] mlr.press

Meta-learning without data via wasserstein distributionally-robust model fusion

Z Wang, X Wang, L Shen, Q Suo… - Uncertainty in …, 2022 - proceedings.mlr.press

Existing meta-learning works assume that each task has available training and testing data.
However, there are many available pre-trained models without accessing their training data …

Guardar Citar Citado por 26 Artículos relacionados Las 3 versiones Versión en HTML

[Free GPT-4]

[PDF] arxiv.org

The neural process family: Survey, applications and perspectives

S Jha, D Gong, X Wang, RE Turner, L Yao - arxiv preprint arxiv …, 2022 - arxiv.org

The standard approaches to neural network implementation yield powerful function
approximation capabilities but are limited in their abilities to learn meta representations and …

Guardar Citar Citado por 32 Artículos relacionados Las 2 versiones Versión en HTML

[Free GPT-4]

[HTML] nih.gov

[HTML][HTML] Provable multi-task representation learning by two-layer relu neural networks

L Collins, H Hassani, M Soltanolkotabi… - … of machine learning …, 2024 - pmc.ncbi.nlm.nih.gov

An increasingly popular machine learning paradigm is to pretrain a neural network (NN) on
many tasks offline, then adapt it to downstream tasks, often by re-training only the last linear …

Guardar Citar Citado por 11 Artículos relacionados Las 3 versiones

[Free GPT-4]

[PDF] neurips.cc

Understanding benign overfitting in gradient-based meta learning

L Chen, S Lu, T Chen - Advances in neural information …, 2022 - proceedings.neurips.cc

Meta learning has demonstrated tremendous success in few-shot learning with limited
supervised data. In those settings, the meta model is usually overparameterized. While the …

Guardar Citar Citado por 18 Artículos relacionados Las 10 versiones Versión en HTML

[Free GPT-4]

[PDF] arxiv.org

Offline multi-task transfer rl with representational penalization

A Bose, SS Du, M Fazel - arxiv preprint arxiv:2402.12570, 2024 - arxiv.org

We study the problem of representation transfer in offline Reinforcement Learning (RL),
where a learner has access to episodic data from a number of source tasks collected a …

Guardar Citar Citado por 9 Artículos relacionados Las 2 versiones Versión en HTML

[Free GPT-4]

[PDF] mlr.press

Understanding inverse scaling and emergence in multitask representation learning

ME Ildiz, Z Zhao, S Oymak - International Conference on …, 2024 - proceedings.mlr.press

Large language models exhibit strong multitasking capabilities, however, their learning
dynamics as a function of task characteristics, sample size, and model complexity remain …

Guardar Citar Citado por 1 Artículos relacionados Versión en HTML

[Free GPT-4]

[PDF] aaai.org

Provable pathways: Learning multiple tasks over multiple paths

Y Li, S Oymak - Proceedings of the AAAI Conference on Artificial …, 2023 - ojs.aaai.org

Constructing useful representations across a large number of tasks is a key requirement for
sample-efficient intelligent systems. A traditional idea in multitask learning (MTL) is building …

Guardar Citar Citado por 4 Artículos relacionados Las 5 versiones Versión en HTML

Crear alerta

Citar

Búsqueda avanzada

Guardado en Mi biblioteca

Towards sample-efficient overparameterized meta-learning

Transformers as algorithms: Generalization and stability in in-context learning

Fedavg with fine tuning: Local updates lead to representation learning

Architecture, dataset and model-scale agnostic data-free meta-learning

Meta-learning without data via wasserstein distributionally-robust model fusion

The neural process family: Survey, applications and perspectives

[HTML][HTML] Provable multi-task representation learning by two-layer relu neural networks

Understanding benign overfitting in gradient-based meta learning

Offline multi-task transfer rl with representational penalization

Understanding inverse scaling and emergence in multitask representation learning

Provable pathways: Learning multiple tasks over multiple paths