- Academic Search

A Aubret, L Matignon, S Hassas - Entropy, 2023 - mdpi.com

The reinforcement learning (RL) research area is very active, with an important number of
new contributions, especially considering the emergent field of deep RL (DRL). However, a …

保存引用被引用次数：47 相关文章所有 10 个版本网页快照

[Free GPT-4]

[PDF] neurips.cc

Byol-explore: Exploration by bootstrapped prediction

Z Guo, S Thakoor, M Pîslar… - Advances in neural …, 2022 - proceedings.neurips.cc

We present BYOL-Explore, a conceptually simple yet general approach for curiosity-driven
exploration in visually complex environments. BYOL-Explore learns the world …

保存引用被引用次数：72 相关文章所有 5 个版本 HTML 版

[Free GPT-4]

[PDF] jmlr.org

Convex reinforcement learning in finite trials

M Mutti, R De Santi, P De Bartolomeis… - Journal of Machine …, 2023 - jmlr.org

Convex Reinforcement Learning (RL) is a recently introduced framework that generalizes
the standard RL objective to any convex (or concave) function of the state distribution …

保存引用被引用次数：14 相关文章所有 5 个版本 HTML 版

[Free GPT-4]

[PDF] neurips.cc

Maximum state entropy exploration using predecessor and successor representations

AK Jain, L Lehnert, I Rish… - Advances in Neural …, 2024 - proceedings.neurips.cc

Animals have a developed ability to explore that aids them in important tasks such as
locating food, exploring for shelter, and finding misplaced items. These exploration skills …

保存引用被引用次数：9 相关文章所有 7 个版本 HTML 版

[Free GPT-4]

[PDF] neurips.cc

Accelerating reinforcement learning with value-conditional state entropy exploration

D Kim, J Shin, P Abbeel, Y Seo - Advances in Neural …, 2024 - proceedings.neurips.cc

A promising technique for exploration is to maximize the entropy of visited state distribution,
ie, state entropy, by encouraging uniform coverage of visited state space. While it has been …

保存引用被引用次数：17 相关文章所有 5 个版本 HTML 版

[Free GPT-4]

[PDF] academia.edu

[PDF][PDF] Structure in reinforcement learning: A survey and open problems

A Mohan, A Zhang, M Lindauer - arxiv preprint arxiv:2306.16021, 2023 - academia.edu

Reinforcement Learning (RL), bolstered by the expressive capabilities of Deep Neural
Networks (DNNs) for function approximation, has demonstrated considerable success in …

保存引用被引用次数：17 相关文章所有 2 个版本 HTML 版

[Free GPT-4]

[PDF] aaai.org

Cem: Constrained entropy maximization for task-agnostic safe exploration

Q Yang, MTJ Spaan - Proceedings of the AAAI Conference on Artificial …, 2023 - ojs.aaai.org

In the absence of assigned tasks, a learning agent typically seeks to explore its environment
efficiently. However, the pursuit of exploration will bring more safety risks. An under-explored …

保存引用被引用次数：17 相关文章所有 7 个版本 HTML 版

[Free GPT-4]

[PDF] mlr.press

The importance of non-markovianity in maximum state entropy exploration

M Mutti, R De Santi, M Restelli - International Conference on …, 2022 - proceedings.mlr.press

In the maximum state entropy exploration framework, an agent interacts with a reward-free
environment to learn a policy that maximizes the entropy of the expected state visitations it is …

保存引用被引用次数：30 相关文章所有 7 个版本 HTML 版

[Free GPT-4]

[PDF] neurips.cc

Challenging common assumptions in convex reinforcement learning

M Mutti, R De Santi… - Advances in Neural …, 2022 - proceedings.neurips.cc

Abstract The classic Reinforcement Learning (RL) formulation concerns the maximization of
a scalar reward function. More recently, convex RL has been introduced to extend the RL …

保存引用被引用次数：22 相关文章所有 9 个版本 HTML 版

[Free GPT-4]

[PDF] arxiv.org

Pretraining in deep reinforcement learning: A survey

Z **e, Z Lin, J Li, S Li, D Ye - arxiv preprint arxiv:2211.03959, 2022 - arxiv.org

The past few years have seen rapid progress in combining reinforcement learning (RL) with
deep learning. Various breakthroughs ranging from games to robotics have spurred the …

保存引用被引用次数：21 相关文章所有 2 个版本 HTML 版

创建快讯

引用

高级搜索

已保存到“我的图书馆”

Geometric entropic exploration

An information-theoretic perspective on intrinsic motivation in reinforcement learning: A survey

Byol-explore: Exploration by bootstrapped prediction

Convex reinforcement learning in finite trials

Maximum state entropy exploration using predecessor and successor representations

Accelerating reinforcement learning with value-conditional state entropy exploration

[PDF][PDF] Structure in reinforcement learning: A survey and open problems

Cem: Constrained entropy maximization for task-agnostic safe exploration

The importance of non-markovianity in maximum state entropy exploration

Challenging common assumptions in convex reinforcement learning

Pretraining in deep reinforcement learning: A survey