Policy mirror descent for reinforcement learning: Linear convergence, new sampling complexity, and generalized problem classes

G Lan - Mathematical programming, 2023 - Springer
We present new policy mirror descent (PMD) methods for solving reinforcement learning
(RL) problems with either strongly convex or general convex regularizers. By exploring the …

Learning with little mixing

I Ziemann, S Tu - Advances in Neural Information …, 2022 - proceedings.neurips.cc
We study square loss in a realizable time-series framework with martingale difference noise.
Our main result is a fast rate excess risk bound which shows that whenever a trajectory …

Safe, learning-based MPC for highway driving under lane-change uncertainty: A distributionally robust approach

M Schuurmans, A Katriniok, C Meissen, HE Tseng… - Artificial Intelligence, 2023 - Elsevier
We present a case study applying learning-based distributionally robust model predictive
control to highway motion planning under stochastic uncertainty of the lane change behavior …

Active learning for nonlinear system identification with guarantees

H Mania, MI Jordan, B Recht - Journal of Machine Learning Research, 2022 - jmlr.org
While the identification of nonlinear dynamical systems is a fundamental building block of
model-based reinforcement learning and feedback control, its sample complexity is only …

Estimating the mixing time of ergodic markov chains

G Wolfer, A Kontorovich - Conference on Learning Theory, 2019 - proceedings.mlr.press
We address the problem of estimating the mixing time $ t_ {\mathsf {mix}} $ of an arbitrary
ergodic finite Markov chain from a single trajectory of length $ m $. The reversible case was …

A general framework for learning-based distributionally robust MPC of Markov jump systems

M Schuurmans, P Patrinos - IEEE Transactions on Automatic …, 2023 - ieeexplore.ieee.org
In this article, we present a data-driven learning model predictive control (MPC) scheme for
chance-constrained Markov jump systems with unknown switching probabilities. Using …

Adaptive data analysis with correlated observations

A Kontorovich, M Sadigurschi… - … on Machine Learning, 2022 - proceedings.mlr.press
The vast majority of the work on adaptive data analysis focuses on the case where the
samples in the dataset are independent. Several approaches and tools have been …