- Academic Search

R Hadsell, D Rao, AA Rusu, R Pascanu - Trends in cognitive sciences, 2020 - cell.com

Artificial intelligence research has seen enormous progress over the past few decades, but it
predominantly relies on fixed datasets and stationary environments. Continual learning is an …

Salva Cita Citato da 587 Articoli correlati Tutte e 6 le versioni

[Free GPT-4]

[PDF] jair.org

Towards continual reinforcement learning: A review and perspectives

K Khetarpal, M Riemer, I Rish, D Precup - Journal of Artificial Intelligence …, 2022 - jair.org

In this article, we aim to provide a literature review of different formulations and approaches
to continual reinforcement learning (RL), also known as lifelong or non-stationary RL. We …

Salva Cita Citato da 336 Articoli correlati Tutte e 9 le versioni Versione HTML

[Free GPT-4]

[PDF] thecvf.com

Efficient diffusion training via min-snr weighting strategy

T Hang, S Gu, C Li, J Bao, D Chen… - Proceedings of the …, 2023 - openaccess.thecvf.com

Denoising diffusion models have been a mainstream approach for image generation,
however, training these models often suffers from slow convergence. In this paper, we …

Salva Cita Citato da 131 Articoli correlati Tutte e 5 le versioni Versione HTML

[Free GPT-4]

[PDF] thecvf.com

Prompt-aligned gradient for prompt tuning

B Zhu, Y Niu, Y Han, Y Wu… - Proceedings of the IEEE …, 2023 - openaccess.thecvf.com

Thanks to the large pre-trained vision-language models (VLMs) like CLIP, we can craft a
zero-shot classifier by discrete prompt design, eg, the confidence score of an image …

Salva Cita Citato da 289 Articoli correlati Tutte e 6 le versioni Versione HTML

[Free GPT-4]

[PDF] thecvf.com

An empirical study of training end-to-end vision-and-language transformers

ZY Dou, Y Xu, Z Gan, J Wang, S Wang… - Proceedings of the …, 2022 - openaccess.thecvf.com

Abstract Vision-and-language (VL) pre-training has proven to be highly effective on various
VL downstream tasks. While recent work has shown that fully transformer-based VL models …

Salva Cita Citato da 412 Articoli correlati Tutte e 6 le versioni Versione HTML

[Free GPT-4]

[PDF] neurips.cc

Diffusion model is an effective planner and data synthesizer for multi-task reinforcement learning

H He, C Bai, K Xu, Z Yang, W Zhang… - Advances in neural …, 2023 - proceedings.neurips.cc

Diffusion models have demonstrated highly-expressive generative capabilities in vision and
NLP. Recent studies in reinforcement learning (RL) have shown that diffusion models are …

Salva Cita Citato da 72 Articoli correlati Tutte e 5 le versioni Versione HTML

[Free GPT-4]

[PDF] arxiv.org

Imagenet-21k pretraining for the masses

T Ridnik, E Ben-Baruch, A Noy… - arxiv preprint arxiv …, 2021 - arxiv.org

ImageNet-1K serves as the primary dataset for pretraining deep learning models for
computer vision tasks. ImageNet-21K dataset, which is bigger and more diverse, is used …

Salva Cita Citato da 716 Articoli correlati Tutte e 7 le versioni Versione HTML

[Free GPT-4]

[PDF] neurips.cc

Rewarded soups: towards pareto-optimal alignment by interpolating weights fine-tuned on diverse rewards

A Rame, G Couairon, C Dancette… - Advances in …, 2024 - proceedings.neurips.cc

Foundation models are first pre-trained on vast unsupervised datasets and then fine-tuned
on labeled data. Reinforcement learning, notably from human feedback (RLHF), can further …

Salva Cita Citato da 97 Articoli correlati Tutte e 7 le versioni Versione HTML

[Free GPT-4]

[PDF] arxiv.org

Roboagent: Generalization and efficiency in robot manipulation via semantic augmentations and action chunking

H Bharadhwaj, J Vakil, M Sharma… - … on Robotics and …, 2024 - ieeexplore.ieee.org

The grand aim of having a single robot that can manipulate arbitrary objects in diverse
settings is at odds with the paucity of robotics datasets. Acquiring and growing such datasets …

Salva Cita Citato da 88 Articoli correlati Tutte e 3 le versioni

[Free GPT-4]

[PDF] neurips.cc

Conflict-averse gradient descent for multi-task learning

B Liu, X Liu, X **, P Stone… - Advances in Neural …, 2021 - proceedings.neurips.cc

The goal of multi-task learning is to enable more efficient learning than single task learning
by sharing model structures for a diverse set of tasks. A standard multi-task learning …

Salva Cita Citato da 323 Articoli correlati Tutte e 9 le versioni Versione HTML

Crea avviso

Cita

Ricerca avanzata

Salvato in La mia biblioteca

Gradient surgery for multi-task learning

[HTML][HTML] Embracing change: Continual learning in deep neural networks

Towards continual reinforcement learning: A review and perspectives

Efficient diffusion training via min-snr weighting strategy

Prompt-aligned gradient for prompt tuning

An empirical study of training end-to-end vision-and-language transformers

Diffusion model is an effective planner and data synthesizer for multi-task reinforcement learning

Imagenet-21k pretraining for the masses

Rewarded soups: towards pareto-optimal alignment by interpolating weights fine-tuned on diverse rewards

Roboagent: Generalization and efficiency in robot manipulation via semantic augmentations and action chunking

Conflict-averse gradient descent for multi-task learning