RMIX: Learning risk-sensitive policies for cooperative reinforcement learning agents
Current value-based multi-agent reinforcement learning methods optimize individual Q
values to guide individuals' behaviours via centralized training with decentralized execution …
values to guide individuals' behaviours via centralized training with decentralized execution …
Risk-adaptive approaches to stochastic optimization: A survey
JO Royset - SIAM Review, 2025 - SIAM
Uncertainty is prevalent in engineering design and data-driven problems and, more broadly,
in decision making. Due to inherent risk-averseness and ambiguity about assumptions, it is …
in decision making. Due to inherent risk-averseness and ambiguity about assumptions, it is …
Improving robustness via risk averse distributional reinforcement learning
One major obstacle that precludes the success of reinforcement learning in real-world
applications is the lack of robustness, either to model uncertainties or external disturbances …
applications is the lack of robustness, either to model uncertainties or external disturbances …
[PDF][PDF] Concentration bounds for CVaR estimation: The cases of light-tailed and heavy-tailed distributions
Abstract Conditional Value-at-Risk (CVaR) is a widely used risk metric in applications such
as finance. We derive concentration bounds for CVaR estimates, considering separately the …
as finance. We derive concentration bounds for CVaR estimates, considering separately the …
Concentration of risk measures: A Wasserstein distance approach
Known finite-sample concentration bounds for the Wasserstein distance between the
empirical and true distribution of a random variable are used to derive a two-sided …
empirical and true distribution of a random variable are used to derive a two-sided …
[PDF][PDF] Distribution oblivious, risk-aware algorithms for multi-armed bandits with unbounded rewards.
Classical multi-armed bandit problems use the expected value of an arm as a metric to
evaluate its goodness. However, the expected value is a risk-neutral metric. In many …
evaluate its goodness. However, the expected value is a risk-neutral metric. In many …
Sentinel: taming uncertainty with ensemble based distributional reinforcement learning
In this paper, we consider risk-sensitive sequential decision-making in Reinforcement
Learning (RL). Our contributions are two-fold. First, we introduce a novel and coherent …
Learning (RL). Our contributions are two-fold. First, we introduce a novel and coherent …
Guarantees on robot system performance using stochastic simulation rollouts
JA Vincent, AO Feldman… - IEEE Transactions on …, 2024 - ieeexplore.ieee.org
In this article, we provide finite-sample performance guarantees for control policies executed
on stochastic robotic systems. Given an open-or closed-loop policy and a finite set of …
on stochastic robotic systems. Given an open-or closed-loop policy and a finite set of …
Pac-bayesian bound for the conditional value at risk
Abstract Conditional Value at Risk ($\textsc {CVaR} $) is a``coherent risk measure''which
generalizes expectation (reduced to a boundary parameter setting). Widely used in …
generalizes expectation (reduced to a boundary parameter setting). Widely used in …
Exploring the risk dynamics of US green energy stocks: A green time-varying beta approach
Green investments play a crucial role in fighting climate change and facilitating the shift
towards a low-carbon economy in line with goals of the Paris Agreement. This paper focuses …
towards a low-carbon economy in line with goals of the Paris Agreement. This paper focuses …