A survey on explainable reinforcement learning: Concepts, algorithms, challenges

Y Qing, S Liu, J Song, H Wang, M Song - arxiv preprint arxiv:2211.06665, 2022‏ - arxiv.org
Reinforcement Learning (RL) is a popular machine learning paradigm where intelligent
agents interact with the environment to fulfill a long-term goal. Driven by the resurgence of …

Attention-guided contrastive role representations for multi-agent reinforcement learning

Z Hu, Z Zhang, H Li, C Chen, H Ding… - arxiv preprint arxiv …, 2023‏ - arxiv.org
Real-world multi-agent tasks usually involve dynamic team composition with the emergence
of roles, which should also be a key to efficient cooperation in multi-agent reinforcement …

Long-short-view aware multi-agent reinforcement learning for signal snippet distillation in delirium movement detection

Q Pan, H Wang, J Lou, Y Zhang, B Ji, S Li - Information Sciences, 2024‏ - Elsevier
Automatic movement analysis utilizing surveillance video is believed to be an important and
convenient way for timely delirium detection in an Intensive Care Unit (ICU). However, video …

Interaction pattern disentangling for multi-agent reinforcement learning

S Liu, J Song, Y Zhou, N Yu, K Chen… - … on Pattern Analysis …, 2024‏ - ieeexplore.ieee.org
Deep cooperative multi-agent reinforcement learning has demonstrated its remarkable
success over a wide spectrum of complex control tasks. However, recent advances in multi …

Egocentric 3D Skeleton Learning in a Deep Neural Network Encodes Obese-like Motion Representations

J Kwon, M Sa, H Kim, Y Seong… - Experimental …, 2024‏ - pmc.ncbi.nlm.nih.gov
Obesity is a growing health concern, mainly caused by poor dietary habits. Yet, accurately
tracking the diet and food intake of individuals with obesity is challenging. Although 3D …

ISFORS-MIX: Multi-agent reinforcement learning with Importance-Sampling-Free Off-policy learning and Regularized-Softmax Mixing network

J Rao, C Wang, M Liu, J Lei, W Giernacki - Knowledge-Based Systems, 2025‏ - Elsevier
In multi-agent reinforcement learning (MARL), the low quality of value function and the
estimation bias and variance in value function decomposition (VFD) are critical challenges …

Temporal Prototype-Aware Learning for Active Voltage Control on Power Distribution Networks

F Xu, S Liu, Y Qing, Y Zhou, Y Wang… - Proceedings of the 30th …, 2024‏ - dl.acm.org
Active Voltage Control (AVC) on the Power Distribution Networks (PDNs) aims to stabilize
the voltage levels to ensure efficient and reliable operation of power systems. With the …

CausalCOMRL: Context-Based Offline Meta-Reinforcement Learning with Causal Representation

Z Zhang, W Meng, H Sun, G Pan - arxiv preprint arxiv:2502.00983, 2025‏ - arxiv.org
Context-based offline meta-reinforcement learning (OMRL) methods have achieved
appealing success by leveraging pre-collected offline datasets to develop task …

The Composite Task Challenge for Cooperative Multi-Agent Reinforcement Learning

Y Li, Y Chen, L Zhang, S Li, G Pan - arxiv preprint arxiv:2502.00345, 2025‏ - arxiv.org
The significant role of division of labor (DOL) in promoting cooperation is widely recognized
in real-world applications. Many cooperative multi-agent reinforcement learning (MARL) …

Heterogeneous Value Decomposition Policy Fusion for Multi-Agent Cooperation

S Wang, Y Zhou, Z Zhao, R Zhang, J Shao… - arxiv preprint arxiv …, 2025‏ - arxiv.org
Value decomposition (VD) has become one of the most prominent solutions in cooperative
multi-agent reinforcement learning. Most existing methods generally explore how to factorize …