Offline Reinforcement Learning With Behavior Value Regularization

L Huang, B Dong, W **e… - IEEE Transactions on …, 2024 - ieeexplore.ieee.org
Offline reinforcement learning (offline RL) aims to find task-solving policies from prerecorded
datasets without online environment interaction. It is unfortunate that extrapolation errors can …

Efficient multi-goal reinforcement learning via value consistency prioritization

J Xu, S Li, R Yang, C Yuan, L Han - Journal of Artificial Intelligence …, 2023 - jair.org
Goal-conditioned reinforcement learning (RL) with sparse rewards remains a challenging
problem in deep RL. Hindsight Experience Replay (HER) has been demonstrated to be an …

Offline reinforcement learning with imbalanced datasets

L Jiang, S Cheng, J Qiu, H Xu, WK Chan… - arxiv preprint arxiv …, 2023 - arxiv.org
The prevalent use of benchmarks in current offline reinforcement learning (RL) research has
led to a neglect of the imbalance of real-world dataset distributions in the development of …

Are Expressive Models Truly Necessary for Offline RL?

G Wang, H Niu, J Li, L Jiang, J Hu, X Zhan - arxiv preprint arxiv …, 2024 - arxiv.org
Among various branches of offline reinforcement learning (RL) methods, goal-conditioned
supervised learning (GCSL) has gained increasing popularity as it formulates the offline RL …

Reinforcing Competitive Multi-Agents for Playing So Long Sucker

M Sharan, C Adak - arxiv preprint arxiv:2411.11057, 2024 - arxiv.org
This paper examines the use of classical deep reinforcement learning (DRL) algorithms,
DQN, DDQN, and Dueling DQN, in the strategy game So Long Sucker (SLS), a diplomacy …