Settling the sample complexity of model-based offline reinforcement learning
Settling the sample complexity of model-based offline reinforcement learning Page 1 The
Annals of Statistics 2024, Vol. 52, No. 1, 233–260 https://doi.org/10.1214/23-AOS2342 © …
Annals of Statistics 2024, Vol. 52, No. 1, 233–260 https://doi.org/10.1214/23-AOS2342 © …
Distributionally robust model-based offline reinforcement learning with near-optimal sample complexity
This paper concerns the central issues of model robustness and sample efficiency in offline
reinforcement learning (RL), which aims to learn to perform decision making from history …
reinforcement learning (RL), which aims to learn to perform decision making from history …
Distributionally robust off-dynamics reinforcement learning: Provable efficiency with linear function approximation
We study off-dynamics Reinforcement Learning (RL), where the policy is trained on a source
domain and deployed to a distinct target domain. We aim to solve this problem via online …
domain and deployed to a distinct target domain. We aim to solve this problem via online …
Seeing is not believing: Robust reinforcement learning against spurious correlation
Robustness has been extensively studied in reinforcement learning (RL) to handle various
forms of uncertainty such as random perturbations, rare events, and malicious attacks. In this …
forms of uncertainty such as random perturbations, rare events, and malicious attacks. In this …
Minimax optimal and computationally efficient algorithms for distributionally robust offline reinforcement learning
Distributionally robust offline reinforcement learning (RL), which seeks robust policy training
against environment perturbation by modeling dynamics uncertainty, calls for function …
against environment perturbation by modeling dynamics uncertainty, calls for function …
Settling the sample complexity of online reinforcement learning
A central issue lying at the heart of online reinforcement learning (RL) is data efficiency.
While a number of recent works achieved asymptotically minimal regret in online RL, the …
While a number of recent works achieved asymptotically minimal regret in online RL, the …
Sample-efficient robust multi-agent reinforcement learning in the face of environmental uncertainty
To overcome the sim-to-real gap in reinforcement learning (RL), learned policies must
maintain robustness against environmental uncertainties. While robust RL has been widely …
maintain robustness against environmental uncertainties. While robust RL has been widely …
Sample complexity of offline distributionally robust linear markov decision processes
In offline reinforcement learning (RL), the absence of active exploration calls for attention on
the model robustness to tackle the sim-to-real gap, where the discrepancy between the …
the model robustness to tackle the sim-to-real gap, where the discrepancy between the …
Distributionally robust model-based reinforcement learning with large state spaces
Three major challenges in reinforcement learning are the complex dynamical systems with
large state spaces, the costly data acquisition processes, and the deviation of real-world …
large state spaces, the costly data acquisition processes, and the deviation of real-world …
Towards minimax optimality of model-based robust reinforcement learning
We study the sample complexity of obtaining an $\epsilon $-optimal policy in\emph {Robust}
discounted Markov Decision Processes (RMDPs), given only access to a generative model …
discounted Markov Decision Processes (RMDPs), given only access to a generative model …