A study of global and episodic bonuses for exploration in contextual mdps

M Henaff, M Jiang, R Raileanu - International Conference on …, 2023 - proceedings.mlr.press
Exploration in environments which differ across episodes has received increasing attention
in recent years. Current methods use some combination of global novelty bonuses …

Automatic intrinsic reward sha** for exploration in deep reinforcement learning

M Yuan, B Li, X **, W Zeng - International Conference on …, 2023 - proceedings.mlr.press
Abstract We present AIRS: Automatic Intrinsic Reward Sha** that intelligently and
adaptively provides high-quality intrinsic rewards to enhance exploration in reinforcement …

Effective Exploration Based on the Structural Information Principles

X Zeng, H Peng, A Li - arxiv preprint arxiv:2410.06621, 2024 - arxiv.org
Traditional information theory provides a valuable foundation for Reinforcement Learning,
particularly through representation learning and entropy maximization for agent exploration …

Goal-conditioned reinforcement learning with disentanglement-based reachability planning

Z Qian, M You, H Zhou, X Xu… - IEEE Robotics and …, 2023 - ieeexplore.ieee.org
Goal-Conditioned Reinforcement Learning (GCRL) can enable agents to spontaneously set
diverse goals to learn a set of skills. Despite the excellent works proposed in various fields …

RLeXplore: Accelerating Research in Intrinsically-Motivated Reinforcement Learning

M Yuan, RC Castanyer, B Li, X **, G Berseth… - arxiv preprint arxiv …, 2024 - arxiv.org
Extrinsic rewards can effectively guide reinforcement learning (RL) agents in specific tasks.
However, extrinsic rewards frequently fall short in complex environments due to the …

Mol-AIR: Molecular Reinforcement Learning with Adaptive Intrinsic Rewards for Goal-directed Molecular Generation

J Park, J Ahn, J Choi, J Kim - arxiv preprint arxiv:2403.20109, 2024 - arxiv.org
Optimizing techniques for discovering molecular structures with desired properties is crucial
in artificial intelligence (AI)-based drug discovery. Combining deep generative models with …

A Novel State Space Exploration Method for the Sparse-Reward Reinforcement Learning Environment

X Liu, L Ma, Z Chen, C Zheng, R Chen, Y Liao… - … and Applications of …, 2023 - Springer
Sparse-reward reinforcement learning environments pose a particular challenge because
the agent receives infrequent rewards, making it difficult to learn an optimal policy. In this …

A Collaborative Perspective on Exploration in Reinforcement Learning

Y Fu, H Zhang, D Wu, W Xu, B Boulet - openreview.net
Exploration is one of the central topic in reinforcement learning (RL). Many existing
approaches take a single agent perspective when tackling this problem. In this work, we …