[PDF][PDF] Concentration bounds for CVaR estimation: The cases of light-tailed and heavy-tailed distributions

LA Prashanth, K Jagannathan… - Proceedings of the 37th …, 2020 - proceedings.mlr.press
Abstract Conditional Value-at-Risk (CVaR) is a widely used risk metric in applications such
as finance. We derive concentration bounds for CVaR estimates, considering separately the …

Near-minimax-optimal distributional reinforcement learning with a generative model

M Rowland, K Li, R Munos, C Lyle… - Advances in Neural …, 2025 - proceedings.neurips.cc
We propose a new algorithm for model-based distributional reinforcement learning (RL),
and prove that it is minimax-optimal for approximating return distributions in the generative …

Stochastic optimization for spectral risk measures

R Mehta, V Roulet, K Pillutla, L Liu… - International …, 2023 - proceedings.mlr.press
Spectral risk objectives–also called L-risks–allow for learning systems to interpolate
between optimizing average-case performance (as in empirical risk minimization) and worst …

Risk-sensitive reinforcement learning via policy gradient search

LA Prashanth, MC Fu - Foundations and Trends® in Machine …, 2022 - nowpublishers.com
The objective in a traditional reinforcement learning (RL) problem is to find a policy that
optimizes the expected value of a performance metric such as the infinite-horizon cumulative …

Learning by minimizing the sum of ranked range

S Hu, Y Ying, S Lyu - Advances in Neural Information …, 2020 - proceedings.neurips.cc
In forming learning objectives, one oftentimes needs to aggregate a set of individual values
to a single output. Such cases occur in the aggregate loss, which combines individual losses …

Sum of ranked range loss for supervised learning

S Hu, Y Ying, X Wang, S Lyu - Journal of Machine Learning Research, 2022 - jmlr.org
In forming learning objectives, one oftentimes needs to aggregate a set of individual values
to a single output. Such cases occur in the aggregate loss, which combines individual losses …

Supervised learning with general risk functionals

L Leqi, A Huang, Z Lipton… - … on Machine Learning, 2022 - proceedings.mlr.press
Standard uniform convergence results bound the generalization gap of the expected loss
over a hypothesis class. The emergence of risk-sensitive learning requires generalization …

STL robustness risk over discrete-time stochastic processes

L Lindemann, N Matni… - 2021 60th IEEE …, 2021 - ieeexplore.ieee.org
We present a framework to interpret signal temporal logic (STL) formulas over discrete-time
stochastic processes in terms of the induced risk. Each realization of a stochastic process …

Pac-bayesian bound for the conditional value at risk

Z Mhammedi, B Guedj… - Advances in Neural …, 2020 - proceedings.neurips.cc
Abstract Conditional Value at Risk ($\textsc {CVaR} $) is a``coherent risk measure''which
generalizes expectation (reduced to a boundary parameter setting). Widely used in …

Risk of stochastic systems for temporal logic specifications

L Lindemann, L Jiang, N Matni, GJ Pappas - ACM Transactions on …, 2023 - dl.acm.org
The wide availability of data coupled with the computational advances in artificial
intelligence and machine learning promise to enable many future technologies such as …