RMIX: Learning risk-sensitive policies for cooperative reinforcement learning agents

W Qiu, X Wang, R Yu, R Wang, X He… - Advances in …, 2021 - proceedings.neurips.cc
Current value-based multi-agent reinforcement learning methods optimize individual Q
values to guide individuals' behaviours via centralized training with decentralized execution …

Risk-adaptive approaches to stochastic optimization: A survey

JO Royset - SIAM Review, 2025 - SIAM
Uncertainty is prevalent in engineering design and data-driven problems and, more broadly,
in decision making. Due to inherent risk-averseness and ambiguity about assumptions, it is …

Improving robustness via risk averse distributional reinforcement learning

R Singh, Q Zhang, Y Chen - Learning for Dynamics and …, 2020 - proceedings.mlr.press
One major obstacle that precludes the success of reinforcement learning in real-world
applications is the lack of robustness, either to model uncertainties or external disturbances …

[PDF][PDF] Concentration bounds for CVaR estimation: The cases of light-tailed and heavy-tailed distributions

LA Prashanth, K Jagannathan… - Proceedings of the 37th …, 2020 - proceedings.mlr.press
Abstract Conditional Value-at-Risk (CVaR) is a widely used risk metric in applications such
as finance. We derive concentration bounds for CVaR estimates, considering separately the …

Concentration of risk measures: A Wasserstein distance approach

SP Bhat, P LA - Advances in neural information processing …, 2019 - proceedings.neurips.cc
Known finite-sample concentration bounds for the Wasserstein distance between the
empirical and true distribution of a random variable are used to derive a two-sided …

[PDF][PDF] Distribution oblivious, risk-aware algorithms for multi-armed bandits with unbounded rewards.

A Kagrecha, J Nair, KP Jagannathan - NeurIPS, 2019 - proceedings.neurips.cc
Classical multi-armed bandit problems use the expected value of an arm as a metric to
evaluate its goodness. However, the expected value is a risk-neutral metric. In many …

Sentinel: taming uncertainty with ensemble based distributional reinforcement learning

H Eriksson, D Basu, M Alibeigi… - Uncertainty in …, 2022 - proceedings.mlr.press
In this paper, we consider risk-sensitive sequential decision-making in Reinforcement
Learning (RL). Our contributions are two-fold. First, we introduce a novel and coherent …

Guarantees on robot system performance using stochastic simulation rollouts

JA Vincent, AO Feldman… - IEEE Transactions on …, 2024 - ieeexplore.ieee.org
In this article, we provide finite-sample performance guarantees for control policies executed
on stochastic robotic systems. Given an open-or closed-loop policy and a finite set of …

Pac-bayesian bound for the conditional value at risk

Z Mhammedi, B Guedj… - Advances in Neural …, 2020 - proceedings.neurips.cc
Abstract Conditional Value at Risk ($\textsc {CVaR} $) is a``coherent risk measure''which
generalizes expectation (reduced to a boundary parameter setting). Widely used in …

Exploring the risk dynamics of US green energy stocks: A green time-varying beta approach

C Sen, G Chakrabarti - Energy Economics, 2024 - Elsevier
Green investments play a crucial role in fighting climate change and facilitating the shift
towards a low-carbon economy in line with goals of the Paris Agreement. This paper focuses …