Google Acadèmic

Map-based experience replay: a memory-efficient solution to catastrophic forgetting in reinforcem...

Turnitin 降AI改写早检测系统早降重系统 Turnitin-UK版万方检测-期刊版维普编辑部版 Grammarly检测 Paperpass检测 checkpass检测 PaperYY检测

Latent Landmark Graph for Efficient Exploration-exploitation Balance in Hierarchical Reinforcement Learning

Q Zhang, H Zhang, D **ng, B Xu - Machine Intelligence Research, 2025 - Springer

Goal-conditioned hierarchical reinforcement learning (GCHRL) decomposes the desired
goal into subgoals and conducts exploration and exploitation in the subgoal space. Its …

Desa Cita Articles relacionats Totes les 3 versions Free GPT-4 DeepSeek

Decoding BatchNorm statistics via anchors pool for data-free models based on continual learning

X Li, W Wang, G Xu - Neural Computing and Applications, 2024 - Springer

Generating high-quality samples reversely from existing models is a significant technique in
continual learning and knowledge distillation. Existing approaches either fail to generate …

Desa Cita Articles relacionats

Trajectory Progress-Based Prioritizing and Intrinsic Reward Mechanism for Robust Training of Robotic Manipulations

W Liang, Y Liu, J Wang, ZX Yang - IEEE Transactions on …, 2024 - ieeexplore.ieee.org

Training robots by model-free deep reinforcement learning (DRL) to carry out robotic
manipulation tasks without sufficient successful experiences is challenging. Hindsight …

Desa Cita Articles relacionats

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Natural Mitigation of Catastrophic Interference: Continual Learning in Power-Law Learning Environments

A Gandhi, RS Shah, V Marupudi… - arxiv preprint arxiv …, 2024 - ebooks.iospress.nl

Neural networks often suffer from catastrophic interference (CI): performance on previously
learned tasks drops off significantly when learning a new task. This contrasts strongly with …

Desa Cita Articles relacionats Totes les 2 versions Free GPT-4 DeepSeek

Crea una alerta

Cita

Cerca avançada

S'ha desat a La meva biblioteca

Map-based experience replay: a memory-efficient solution to catastrophic forgetting in reinforcem...

Latent Landmark Graph for Efficient Exploration-exploitation Balance in Hierarchical Reinforcement Learning

Decoding BatchNorm statistics via anchors pool for data-free models based on continual learning

Trajectory Progress-Based Prioritizing and Intrinsic Reward Mechanism for Robust Training of Robotic Manipulations

Natural Mitigation of Catastrophic Interference: Continual Learning in Power-Law Learning Environments