Turnitin
降AI改写
早检测系统
早降重系统
Turnitin-UK版
万方检测-期刊版
维普编辑部版
Grammarly检测
Paperpass检测
checkpass检测
PaperYY检测
Distributionally robust model-based reinforcement learning with large state spaces
Three major challenges in reinforcement learning are the complex dynamical systems with
large state spaces, the costly data acquisition processes, and the deviation of real-world …
large state spaces, the costly data acquisition processes, and the deviation of real-world …
Distributionally robust reinforcement learning with interactive data collection: Fundamental hardness and near-optimal algorithm
The sim-to-real gap, which represents the disparity between training and testing
environments, poses a significant challenge in reinforcement learning (RL). A promising …
environments, poses a significant challenge in reinforcement learning (RL). A promising …
[PDF][PDF] DRAGO: Primal-Dual Coupled Variance Reduction for Faster Distributionally Robust Optimization
We consider the penalized distributionally robust optimization (DRO) problem with a closed,
convex uncertainty set, a setting that encompasses learning using f-DRO and spectral/L-risk …
convex uncertainty set, a setting that encompasses learning using f-DRO and spectral/L-risk …
A Primal-Dual Algorithm for Faster Distributionally Robust Optimization
We consider the penalized distributionally robust optimization (DRO) problem with a closed,
convex uncertainty set, a setting that encompasses the $ f $-DRO, Wasserstein-DRO, and …
convex uncertainty set, a setting that encompasses the $ f $-DRO, Wasserstein-DRO, and …
Learning a Single Neuron Robustly to Distributional Shifts and Adversarial Label Noise
We study the problem of learning a single neuron with respect to the $ L_2^ 2$-loss in the
presence of adversarial distribution shifts, where the labels can be arbitrary, and the goal is …
presence of adversarial distribution shifts, where the labels can be arbitrary, and the goal is …
Relative-Translation Invariant Wasserstein Distance
We introduce a new family of distances, relative-translation invariant Wasserstein distances
($ RW_p $), for measuring the similarity of two probability distributions under distribution …
($ RW_p $), for measuring the similarity of two probability distributions under distribution …
Policy Gradient for Robust Markov Decision Processes
We develop a generic policy gradient method with the global optimality guarantee for robust
Markov Decision Processes (MDPs). While policy gradient methods are widely used for …
Markov Decision Processes (MDPs). While policy gradient methods are widely used for …
Wasserstein Distributionally Robust Control and State Estimation for Partially Observable Linear Systems
This paper presents a novel Wasserstein distributionally robust control and state estimation
algorithm for partially observable linear stochastic systems, where the probability …
algorithm for partially observable linear stochastic systems, where the probability …
Distributionally Robust Safety Verification for Markov Decision Processes
In this paper, we propose a distributionally robust safety verification method for Markov
decision processes where only an ambiguous transition kernel is available instead of the …
decision processes where only an ambiguous transition kernel is available instead of the …
Dynamic Programs on Partially Ordered Sets
TJ Sargent, J Stachurski - arxiv preprint arxiv:2308.02148, 2023 - arxiv.org
We introduce a framework that represents a dynamic program as a family of operators acting
on a partially ordered set. We provide an optimality theory based only on order-theoretic …
on a partially ordered set. We provide an optimality theory based only on order-theoretic …