- Academic Search

L Huang, B Dong, W **e… - IEEE Transactions on …, 2024 - ieeexplore.ieee.org

Offline reinforcement learning (offline RL) aims to find task-solving policies from prerecorded
datasets without online environment interaction. It is unfortunate that extrapolation errors can …

Save Cite Cited by 6 Related articles All 3 versions Free GPT-4 DeepSeek

[Free GPT-4]
[DeepSeek]

[PDF] jair.org Full View

Efficient multi-goal reinforcement learning via value consistency prioritization

J Xu, S Li, R Yang, C Yuan, L Han - Journal of Artificial Intelligence …, 2023 - jair.org

Goal-conditioned reinforcement learning (RL) with sparse rewards remains a challenging
problem in deep RL. Hindsight Experience Replay (HER) has been demonstrated to be an …

Save Cite Cited by 4 Related articles All 4 versions Free GPT-4 DeepSeek View as HTML

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Offline reinforcement learning with imbalanced datasets

L Jiang, S Cheng, J Qiu, H Xu, WK Chan… - arxiv preprint arxiv …, 2023 - arxiv.org

The prevalent use of benchmarks in current offline reinforcement learning (RL) research has
led to a neglect of the imbalance of real-world dataset distributions in the development of …

Save Cite Cited by 4 Related articles All 2 versions Free GPT-4 DeepSeek View as HTML

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Are Expressive Models Truly Necessary for Offline RL?

G Wang, H Niu, J Li, L Jiang, J Hu, X Zhan - arxiv preprint arxiv …, 2024 - arxiv.org

Among various branches of offline reinforcement learning (RL) methods, goal-conditioned
supervised learning (GCSL) has gained increasing popularity as it formulates the offline RL …

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Reinforcing Competitive Multi-Agents for Playing So Long Sucker

M Sharan, C Adak - arxiv preprint arxiv:2411.11057, 2024 - arxiv.org

This paper examines the use of classical deep reinforcement learning (DRL) algorithms,
DQN, DDQN, and Dueling DQN, in the strategy game So Long Sucker (SLS), a diplomacy …

Create alert

Cite

Advanced search

Saved to My library

Curriculum goal-conditioned imitation for offline reinforcement learning

Offline Reinforcement Learning With Behavior Value Regularization

Efficient multi-goal reinforcement learning via value consistency prioritization

Offline reinforcement learning with imbalanced datasets

Are Expressive Models Truly Necessary for Offline RL?

Reinforcing Competitive Multi-Agents for Playing So Long Sucker