Μελετητής Google

[PDF][PDF] Concentration bounds for CVaR estimation: The cases of light-tailed and heavy-tailed distributions

LA Prashanth, K Jagannathan… - Proceedings of the 37th …, 2020 - proceedings.mlr.press

Abstract Conditional Value-at-Risk (CVaR) is a widely used risk metric in applications such
as finance. We derive concentration bounds for CVaR estimates, considering separately the …

Αποθήκευση Παράθεση Γίνεται αναφορά σε 67 Σχετικά άρθρα Όλες οι 8 εκδοχές Προβολή ως HTML

[Free GPT-4]
[DeepSeek]

[PDF] neurips.cc

Near-minimax-optimal distributional reinforcement learning with a generative model

M Rowland, K Li, R Munos, C Lyle… - Advances in Neural …, 2025 - proceedings.neurips.cc

We propose a new algorithm for model-based distributional reinforcement learning (RL),
and prove that it is minimax-optimal for approximating return distributions in the generative …

Αποθήκευση Παράθεση Γίνεται αναφορά σε 3 Σχετικά άρθρα Όλες οι 5 εκδοχές Προβολή ως HTML

[Free GPT-4]
[DeepSeek]

[PDF] mlr.press

Stochastic optimization for spectral risk measures

R Mehta, V Roulet, K Pillutla, L Liu… - International …, 2023 - proceedings.mlr.press

Spectral risk objectives–also called L-risks–allow for learning systems to interpolate
between optimizing average-case performance (as in empirical risk minimization) and worst …

Αποθήκευση Παράθεση Γίνεται αναφορά σε 12 Σχετικά άρθρα Όλες οι 7 εκδοχές Προβολή ως HTML

[Free GPT-4]
[DeepSeek]

[PDF] nowpublishers.com

Risk-sensitive reinforcement learning via policy gradient search

LA Prashanth, MC Fu - Foundations and Trends® in Machine …, 2022 - nowpublishers.com

The objective in a traditional reinforcement learning (RL) problem is to find a policy that
optimizes the expected value of a performance metric such as the infinite-horizon cumulative …

Αποθήκευση Παράθεση Γίνεται αναφορά σε 44 Σχετικά άρθρα Όλες οι 8 εκδοχές Αναζήτηση βιβλιοθήκης Προβολή ως HTML

[Free GPT-4]
[DeepSeek]

[PDF] neurips.cc

Learning by minimizing the sum of ranked range

S Hu, Y Ying, S Lyu - Advances in Neural Information …, 2020 - proceedings.neurips.cc

In forming learning objectives, one oftentimes needs to aggregate a set of individual values
to a single output. Such cases occur in the aggregate loss, which combines individual losses …

Αποθήκευση Παράθεση Γίνεται αναφορά σε 31 Σχετικά άρθρα Όλες οι 10 εκδοχές Προβολή ως HTML

[Free GPT-4]
[DeepSeek]

[PDF] jmlr.org

Sum of ranked range loss for supervised learning

S Hu, Y Ying, X Wang, S Lyu - Journal of Machine Learning Research, 2022 - jmlr.org

In forming learning objectives, one oftentimes needs to aggregate a set of individual values
to a single output. Such cases occur in the aggregate loss, which combines individual losses …

Αποθήκευση Παράθεση Γίνεται αναφορά σε 24 Σχετικά άρθρα Όλες οι 4 εκδοχές Προβολή ως HTML

[Free GPT-4]
[DeepSeek]

[PDF] mlr.press

Supervised learning with general risk functionals

L Leqi, A Huang, Z Lipton… - … on Machine Learning, 2022 - proceedings.mlr.press

Standard uniform convergence results bound the generalization gap of the expected loss
over a hypothesis class. The emergence of risk-sensitive learning requires generalization …

Αποθήκευση Παράθεση Γίνεται αναφορά σε 11 Σχετικά άρθρα Όλες οι 5 εκδοχές Προβολή ως HTML

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

STL robustness risk over discrete-time stochastic processes

L Lindemann, N Matni… - 2021 60th IEEE …, 2021 - ieeexplore.ieee.org

We present a framework to interpret signal temporal logic (STL) formulas over discrete-time
stochastic processes in terms of the induced risk. Each realization of a stochastic process …

Αποθήκευση Παράθεση Γίνεται αναφορά σε 22 Σχετικά άρθρα Όλες οι 7 εκδοχές

[Free GPT-4]
[DeepSeek]

[PDF] neurips.cc

Pac-bayesian bound for the conditional value at risk

Z Mhammedi, B Guedj… - Advances in Neural …, 2020 - proceedings.neurips.cc

Abstract Conditional Value at Risk ($\textsc {CVaR} $) is a``coherent risk measure''which
generalizes expectation (reduced to a boundary parameter setting). Widely used in …

Αποθήκευση Παράθεση Γίνεται αναφορά σε 25 Σχετικά άρθρα Όλες οι 13 εκδοχές Προβολή ως HTML

[Free GPT-4]
[DeepSeek]

[PDF] acm.org

Risk of stochastic systems for temporal logic specifications

L Lindemann, L Jiang, N Matni, GJ Pappas - ACM Transactions on …, 2023 - dl.acm.org

The wide availability of data coupled with the computational advances in artificial
intelligence and machine learning promise to enable many future technologies such as …

Αποθήκευση Παράθεση Γίνεται αναφορά σε 11 Σχετικά άρθρα Όλες οι 8 εκδοχές

Δημιουργία ειδοποίησης

Παράθεση

Σύνθετη αναζήτηση

Αποθηκεύτηκε στη Βιβλιοθήκη μου

Concentration of risk measures: A Wasserstein distance approach

[PDF][PDF] Concentration bounds for CVaR estimation: The cases of light-tailed and heavy-tailed distributions

Near-minimax-optimal distributional reinforcement learning with a generative model

Stochastic optimization for spectral risk measures

Risk-sensitive reinforcement learning via policy gradient search

Learning by minimizing the sum of ranked range

Sum of ranked range loss for supervised learning

Supervised learning with general risk functionals

STL robustness risk over discrete-time stochastic processes

Pac-bayesian bound for the conditional value at risk

Risk of stochastic systems for temporal logic specifications