A survey of progress on cooperative multi-agent reinforcement learning in open environment

L Yuan, Z Zhang, L Li, C Guan, Y Yu - arxiv preprint arxiv:2312.01058, 2023 - arxiv.org
Multi-agent Reinforcement Learning (MARL) has gained wide attention in recent years and
has made progress in various fields. Specifically, cooperative MARL focuses on training a …

RiskQ: risk-sensitive multi-agent reinforcement learning value factorization

S Shen, C Ma, C Li, W Liu, Y Fu… - Advances in Neural …, 2023 - proceedings.neurips.cc
Multi-agent systems are characterized by environmental uncertainty, varying policies of
agents, and partial observability, which result in significant risks. In the context of Multi-Agent …

Pitfall of optimism: Distributional reinforcement learning by randomizing risk criterion

T Cho, S Han, H Lee, K Lee… - Advances in Neural …, 2023 - proceedings.neurips.cc
Distributional reinforcement learning algorithms have attempted to utilize estimated
uncertainty for exploration, such as optimism in the face of uncertainty. However, using the …

Improving Risk-averse Distributional Reinforcement Learning with Adaptive Risk-seeking Exploration

Z Wang, C Hu, X Zhang - IFAC-PapersOnLine, 2024 - Elsevier
In autonomous driving, reinforcement learning's interactive learning capabilities have
garnered interest. Among these, risk-averse distributional reinforcement learning has …

A Robust Quantile Huber Loss with Interpretable Parameter Adjustment in Distributional Reinforcement Learning

P Malekzadeh, KN Plataniotis… - ICASSP 2024-2024 …, 2024 - ieeexplore.ieee.org
Distributional Reinforcement Learning (RL) estimates return distribution mainly by learning
quantile values via minimizing the quantile Huber loss function, entailing a threshold …

Tackling Uncertainties in Multi-Agent Reinforcement Learning through Integration of Agent Termination Dynamics

S Hazra, P Dasgupta, S Dey - arxiv preprint arxiv:2501.12061, 2025 - arxiv.org
Multi-Agent Reinforcement Learning (MARL) has gained significant traction for solving
complex real-world tasks, but the inherent stochasticity and uncertainty in these …

Distributional Reinforcement Learning with Online Risk-awareness Adaption

Y Wu, W Huang - arxiv preprint arxiv:2310.05179, 2023 - arxiv.org
The use of reinforcement learning (RL) in practical applications requires considering sub-
optimal outcomes, which depend on the agent's familiarity with the uncertain environment …

Intrinsic exploration-motivation in cultural knowledge evolution

A Siebers - 2023 - inria.hal.science
In cultural knowledge evolution simulated by multi-agent simulations, agents can improve
the accuracy of their knowledge by interacting with other agents and adapting their …

Advancing Efficiency and Safety in Autonomous Sequential Decision Making

P Malekzadeh - 2024 - search.proquest.com
The advent of reinforcement learning (RL) has significantly transformed decision making in
autonomous systems. However, its practical deployment faces substantial obstacles, chiefly …

[PDF][PDF] DRL-ORA: Distributional Reinforcement Learning with Online Risk Adaption

Y Wu, W Huang, CP Ho - researchgate.net
One of the main challenges in reinforcement learning (RL) is that the agent has to make
decisions that would influence the future performance without having complete knowledge …