Exploration in deep reinforcement learning: A survey

P Ladosz, L Weng, M Kim, H Oh - Information Fusion, 2022 - Elsevier
This paper reviews exploration techniques in deep reinforcement learning. Exploration
techniques are of primary importance when solving sparse reward problems. In sparse …

Towards continual reinforcement learning: A review and perspectives

K Khetarpal, M Riemer, I Rish, D Precup - Journal of Artificial Intelligence …, 2022 - jair.org
In this article, we aim to provide a literature review of different formulations and approaches
to continual reinforcement learning (RL), also known as lifelong or non-stationary RL. We …

Reinforcement learning from passive data via latent intentions

D Ghosh, CA Bhateja, S Levine - … Conference on Machine …, 2023 - proceedings.mlr.press
Passive observational data, such as human videos, is abundant and rich in information, yet
remains largely untapped by current RL methods. Perhaps surprisingly, we show that …

Hierarchical reinforcement learning: A survey and open research challenges

M Hutsebaut-Buysse, K Mets, S Latré - Machine Learning and Knowledge …, 2022 - mdpi.com
Reinforcement learning (RL) allows an agent to solve sequential decision-making problems
by interacting with an environment in a trial-and-error fashion. When these environments are …

Understanding and preventing capacity loss in reinforcement learning

C Lyle, M Rowland, W Dabney - arxiv preprint arxiv:2204.09560, 2022 - arxiv.org
The reinforcement learning (RL) problem is rife with sources of non-stationarity, making it a
notoriously difficult problem domain for the application of neural networks. We identify a …

Count-based exploration with the successor representation

MC Machado, MG Bellemare, M Bowling - Proceedings of the AAAI …, 2020 - ojs.aaai.org
In this paper we introduce a simple approach for exploration in reinforcement learning (RL)
that allows us to develop theoretically justified algorithms in the tabular case but that is also …

Understanding self-predictive learning for reinforcement learning

Y Tang, ZD Guo, PH Richemond… - International …, 2023 - proceedings.mlr.press
We study the learning dynamics of self-predictive learning for reinforcement learning, a
family of algorithms that learn representations by minimizing the prediction error of their own …

Learning structures: predictive representations, replay, and generalization

I Momennejad - Current Opinion in Behavioral Sciences, 2020 - Elsevier
Memory and planning rely on learning the structure of relationships among experiences.
Compact representations of these structures guide flexible behavior in humans and animals …

SDRL: interpretable and data-efficient deep reinforcement learning leveraging symbolic planning

D Lyu, F Yang, B Liu, S Gustafson - … of the AAAI Conference on Artificial …, 2019 - ojs.aaai.org
Deep reinforcement learning (DRL) has gained great success by learning directly from high-
dimensional sensory inputs, yet is notorious for the lack of interpretability. Interpretability of …

Option discovery using deep skill chaining

A Bagaria, G Konidaris - International Conference on Learning …, 2019 - openreview.net
Autonomously discovering temporally extended actions, or skills, is a longstanding goal of
hierarchical reinforcement learning. We propose a new algorithm that combines skill …