When to stop value iteration: stability and near-optimality versus computation

M Granzotto, R Postoyan, D Nešić… - … for Dynamics and …, 2021 - proceedings.mlr.press
Value iteration (VI) is a ubiquitous algorithm for optimal control, planning, and reinforcement
learning schemes. Under the right assumptions, VI is a vital tool to generate inputs with …