- Academic Search

P Ladosz, L Weng, M Kim, H Oh - Information Fusion, 2022 - Elsevier

This paper reviews exploration techniques in deep reinforcement learning. Exploration
techniques are of primary importance when solving sparse reward problems. In sparse …

Salva Cita Citato da 372 Articoli correlati Tutte e 5 le versioni

[Free GPT-4]

[PDF] jair.org

Towards continual reinforcement learning: A review and perspectives

K Khetarpal, M Riemer, I Rish, D Precup - Journal of Artificial Intelligence …, 2022 - jair.org

In this article, we aim to provide a literature review of different formulations and approaches
to continual reinforcement learning (RL), also known as lifelong or non-stationary RL. We …

Salva Cita Citato da 338 Articoli correlati Tutte e 9 le versioni Versione HTML

[Free GPT-4]

[PDF] arxiv.org

On the opportunities and risks of foundation models

R Bommasani, DA Hudson, E Adeli, R Altman… - arxiv preprint arxiv …, 2021 - arxiv.org

AI is undergoing a paradigm shift with the rise of models (eg, BERT, DALL-E, GPT-3) that are
trained on broad data at scale and are adaptable to a wide range of downstream tasks. We …

Salva Cita Citato da 4687 Articoli correlati Tutte e 2 le versioni Versione HTML

[Free GPT-4]

[PDF] nowpublishers.com

[CITAZIONE][C] An introduction to variational autoencoders

DP Kingma, M Welling - Foundations and Trends® in …, 2019 - nowpublishers.com

An Introduction to Variational Autoencoders Page 1 An Introduction to Variational Autoencoders
Page 2 Other titles in Foundations and Trends R in Machine Learning Computational Optimal …

Salva Cita Citato da 3306 Articoli correlati Tutte e 11 le versioni Ricerca biblioteche Versione HTML

[Free GPT-4]

[PDF] arxiv.org

Emergent tool use from multi-agent autocurricula

B Baker, I Kanitscheider, T Markov, Y Wu… - arxiv preprint arxiv …, 2019 - arxiv.org

Through multi-agent competition, the simple objective of hide-and-seek, and standard
reinforcement learning algorithms at scale, we find that agents create a self-supervised …

Salva Cita Citato da 912 Articoli correlati Tutte e 3 le versioni Versione HTML

[Free GPT-4]

[PDF] nowpublishers.com

An introduction to deep reinforcement learning

V François-Lavet, P Henderson, R Islam… - … and Trends® in …, 2018 - nowpublishers.com

Deep reinforcement learning is the combination of reinforcement learning (RL) and deep
learning. This field of research has been able to solve a wide range of complex …

Salva Cita Citato da 1956 Articoli correlati Tutte e 16 le versioni Ricerca biblioteche Versione HTML

[Free GPT-4]

[PDF] mlr.press

Planning to explore via self-supervised world models

R Sekar, O Rybkin, K Daniilidis… - International …, 2020 - proceedings.mlr.press

Reinforcement learning allows solving complex tasks, however, the learning tends to be task-
specific and the sample efficiency remains a challenge. We present Plan2Explore, a self …

Salva Cita Citato da 458 Articoli correlati Tutte e 8 le versioni Versione HTML

Reinforcement learning for intelligent healthcare applications: A survey

A Coronato, M Naeem, G De Pietro… - Artificial intelligence in …, 2020 - Elsevier

Discovering new treatments and personalizing existing ones is one of the major goals of
modern clinical research. In the last decade, Artificial Intelligence (AI) has enabled the …

Salva Cita Citato da 320 Articoli correlati Tutte e 5 le versioni

[Free GPT-4]

[PDF] neurips.cc

Behavior from the void: Unsupervised active pre-training

H Liu, P Abbeel - Advances in Neural Information …, 2021 - proceedings.neurips.cc

We introduce a new unsupervised pre-training method for reinforcement learning called
APT, which stands for Active Pre-Training. APT learns behaviors and representations by …

Salva Cita Citato da 219 Articoli correlati Tutte e 7 le versioni Versione HTML

[Free GPT-4]

[PDF] arxiv.org

Large-scale study of curiosity-driven learning

Y Burda, H Edwards, D Pathak, A Storkey… - arxiv preprint arxiv …, 2018 - arxiv.org

Reinforcement learning algorithms rely on carefully engineering environment rewards that
are extrinsic to the agent. However, annotating each environment with hand-designed …

Salva Cita Citato da 918 Articoli correlati Tutte e 9 le versioni Versione HTML

Crea avviso

Cita

Ricerca avanzata

Salvato in La mia biblioteca

Variational information maximisation for intrinsically motivated reinforcement learning

Exploration in deep reinforcement learning: A survey

Towards continual reinforcement learning: A review and perspectives

On the opportunities and risks of foundation models

[CITAZIONE][C] An introduction to variational autoencoders

Emergent tool use from multi-agent autocurricula

An introduction to deep reinforcement learning

Planning to explore via self-supervised world models

Reinforcement learning for intelligent healthcare applications: A survey

Behavior from the void: Unsupervised active pre-training

Large-scale study of curiosity-driven learning