Exploration in deep reinforcement learning: From single-agent to multiagent domain

J Hao, T Yang, H Tang, C Bai, J Liu… - … on Neural Networks …, 2023 - ieeexplore.ieee.org
Deep reinforcement learning (DRL) and deep multiagent reinforcement learning (MARL)
have achieved significant success across a wide range of domains, including game artificial …

Task phasing: Automated curriculum learning from demonstrations

V Bajaj, G Sharon, P Stone - Proceedings of the International …, 2023 - ojs.aaai.org
Applying reinforcement learning (RL) to sparse reward domains is notoriously challenging
due to insufficient guiding signals. Common RL techniques for addressing such domains …

Entropy regularization methods for parameter space exploration

S Han, W Zhou, S Lü, S Zhu, X Gong - Information Sciences, 2023 - Elsevier
Entropy regularization is an important approach to improve exploration and enhance policy
stability for reinforcement learning. However, in previous study, entropy regularization is …

Optimizing resource allocation in UAV-assisted ultra-dense networks for enhanced performance and security

PG Ye, J Zheng, X Ren, J Huang, Z Zhang, Y Pang… - Information …, 2024 - Elsevier
The deployment of unmanned aerial vehicles (UAVs) in ultra-dense networks (UNDs) has
significantly advanced network capabilities in 5G/6G environments, addressing coverage …

Mnemonic Dictionary Learning for Intrinsic Motivation in Reinforcement Learning

R Yan, Z Wu, Y Zhan, P Tao, Z Wang… - … Joint Conference on …, 2023 - ieeexplore.ieee.org
Reinforcement learning for hard-exploration tasks remains challenging due to the long-term
dependence and sparse-and-delay rewards in complex environments. In these challenging …

Autoencoder Reconstruction Model for Long-Horizon Exploration

R Yan, Y Wu, Y Gan, Y Yang, Z Yu, Z Liu… - … Joint Conference on …, 2024 - ieeexplore.ieee.org
Conventional reinforcement learning (RL) algorithms often necessitate millions of
environment interactions to ascertain an efficacious policy. In stark contrast, humans …

[HTML][HTML] Generative subgoal oriented multi-agent reinforcement learning through potential field

S Li, H Jiang, Y Liu, J Zhang, X Xu, D Liu - Neural Networks, 2024 - Elsevier
Multi-agent reinforcement learning (MARL) effectively improves the learning speed of agents
in sparse reward tasks with the guide of subgoals. However, existing works sever the …