Offline Reinforcement Learning With Behavior Value Regularization
Offline reinforcement learning (offline RL) aims to find task-solving policies from prerecorded
datasets without online environment interaction. It is unfortunate that extrapolation errors can …
datasets without online environment interaction. It is unfortunate that extrapolation errors can …
Efficient multi-goal reinforcement learning via value consistency prioritization
Goal-conditioned reinforcement learning (RL) with sparse rewards remains a challenging
problem in deep RL. Hindsight Experience Replay (HER) has been demonstrated to be an …
problem in deep RL. Hindsight Experience Replay (HER) has been demonstrated to be an …
Offline reinforcement learning with imbalanced datasets
The prevalent use of benchmarks in current offline reinforcement learning (RL) research has
led to a neglect of the imbalance of real-world dataset distributions in the development of …
led to a neglect of the imbalance of real-world dataset distributions in the development of …
Are Expressive Models Truly Necessary for Offline RL?
Among various branches of offline reinforcement learning (RL) methods, goal-conditioned
supervised learning (GCSL) has gained increasing popularity as it formulates the offline RL …
supervised learning (GCSL) has gained increasing popularity as it formulates the offline RL …
Reinforcing Competitive Multi-Agents for Playing So Long Sucker
M Sharan, C Adak - arxiv preprint arxiv:2411.11057, 2024 - arxiv.org
This paper examines the use of classical deep reinforcement learning (DRL) algorithms,
DQN, DDQN, and Dueling DQN, in the strategy game So Long Sucker (SLS), a diplomacy …
DQN, DDQN, and Dueling DQN, in the strategy game So Long Sucker (SLS), a diplomacy …