- Academic Search

P Ladosz, L Weng, M Kim, H Oh - Information Fusion, 2022 - Elsevier

This paper reviews exploration techniques in deep reinforcement learning. Exploration
techniques are of primary importance when solving sparse reward problems. In sparse …

Speichern Zitieren Zitiert von: 373 Ähnliche Artikel Alle 5 Versionen

[Free GPT-4]

[PDF] jair.org

Towards continual reinforcement learning: A review and perspectives

K Khetarpal, M Riemer, I Rish, D Precup - Journal of Artificial Intelligence …, 2022 - jair.org

In this article, we aim to provide a literature review of different formulations and approaches
to continual reinforcement learning (RL), also known as lifelong or non-stationary RL. We …

Speichern Zitieren Zitiert von: 341 Ähnliche Artikel Alle 9 Versionen HTML-Version

[Free GPT-4]

[PDF] mlr.press

Reinforcement learning from passive data via latent intentions

D Ghosh, CA Bhateja, S Levine - … Conference on Machine …, 2023 - proceedings.mlr.press

Passive observational data, such as human videos, is abundant and rich in information, yet
remains largely untapped by current RL methods. Perhaps surprisingly, we show that …

Speichern Zitieren Zitiert von: 44 Ähnliche Artikel Alle 6 Versionen HTML-Version

[Free GPT-4]

[PDF] mdpi.com

Hierarchical reinforcement learning: A survey and open research challenges

M Hutsebaut-Buysse, K Mets, S Latré - Machine Learning and Knowledge …, 2022 - mdpi.com

Reinforcement learning (RL) allows an agent to solve sequential decision-making problems
by interacting with an environment in a trial-and-error fashion. When these environments are …

Speichern Zitieren Zitiert von: 112 Ähnliche Artikel Alle 8 Versionen Im Cache

[Free GPT-4]

[PDF] arxiv.org

Understanding and preventing capacity loss in reinforcement learning

C Lyle, M Rowland, W Dabney - arxiv preprint arxiv:2204.09560, 2022 - arxiv.org

The reinforcement learning (RL) problem is rife with sources of non-stationarity, making it a
notoriously difficult problem domain for the application of neural networks. We identify a …

Speichern Zitieren Zitiert von: 116 Ähnliche Artikel Alle 4 Versionen HTML-Version

[Free GPT-4]

[PDF] aaai.org

Count-based exploration with the successor representation

MC Machado, MG Bellemare, M Bowling - Proceedings of the AAAI …, 2020 - ojs.aaai.org

In this paper we introduce a simple approach for exploration in reinforcement learning (RL)
that allows us to develop theoretically justified algorithms in the tabular case but that is also …

Speichern Zitieren Zitiert von: 235 Ähnliche Artikel Alle 10 Versionen HTML-Version

[Free GPT-4]

[PDF] mlr.press

Understanding self-predictive learning for reinforcement learning

Y Tang, ZD Guo, PH Richemond… - International …, 2023 - proceedings.mlr.press

We study the learning dynamics of self-predictive learning for reinforcement learning, a
family of algorithms that learn representations by minimizing the prediction error of their own …

Speichern Zitieren Zitiert von: 34 Ähnliche Artikel Alle 7 Versionen HTML-Version

[Free GPT-4]

[HTML] nih.gov

Learning structures: predictive representations, replay, and generalization

I Momennejad - Current Opinion in Behavioral Sciences, 2020 - Elsevier

Memory and planning rely on learning the structure of relationships among experiences.
Compact representations of these structures guide flexible behavior in humans and animals …

Speichern Zitieren Zitiert von: 159 Ähnliche Artikel Alle 11 Versionen

[Free GPT-4]

[PDF] aaai.org

SDRL: interpretable and data-efficient deep reinforcement learning leveraging symbolic planning

D Lyu, F Yang, B Liu, S Gustafson - … of the AAAI Conference on Artificial …, 2019 - ojs.aaai.org

Deep reinforcement learning (DRL) has gained great success by learning directly from high-
dimensional sensory inputs, yet is notorious for the lack of interpretability. Interpretability of …

Speichern Zitieren Zitiert von: 195 Ähnliche Artikel Alle 13 Versionen HTML-Version

[Free GPT-4]

[PDF] openreview.net

Option discovery using deep skill chaining

A Bagaria, G Konidaris - International Conference on Learning …, 2019 - openreview.net

Autonomously discovering temporally extended actions, or skills, is a longstanding goal of
hierarchical reinforcement learning. We propose a new algorithm that combines skill …

Speichern Zitieren Zitiert von: 138 Ähnliche Artikel Alle 10 Versionen HTML-Version

Alert erstellen

Zitieren

Erweiterte Suche

In „Meine Bibliothek“ gespeichert

Eigenoption discovery through the deep successor representation

Exploration in deep reinforcement learning: A survey

Towards continual reinforcement learning: A review and perspectives

Reinforcement learning from passive data via latent intentions

Hierarchical reinforcement learning: A survey and open research challenges

Understanding and preventing capacity loss in reinforcement learning

Count-based exploration with the successor representation

Understanding self-predictive learning for reinforcement learning

Learning structures: predictive representations, replay, and generalization

SDRL: interpretable and data-efficient deep reinforcement learning leveraging symbolic planning

Option discovery using deep skill chaining