- Academic Search

M Gargiani, A Zanelli… - IEEE Control …, 2022 - ieeexplore.ieee.org

Policy iteration and value iteration are at the core of many (approximate) dynamic
programming methods. For Markov Decision Processes with finite state and action spaces …

Opslaan Citeren Geciteerd door 9 Verwante artikelen Alle 4 versies

[Free GPT-4]

[PDF] arxiv.org

Dynamic programming through the lens of semismooth Newton-type methods (extended version)

M Gargiani, A Zanelli, D Liao-McPherson… - arxiv preprint arxiv …, 2022 - arxiv.org

Policy iteration and value iteration are at the core of many (approximate) dynamic
programming methods. For Markov Decision Processes with finite state and action spaces …

Opslaan Citeren Geciteerd door 3 Verwante artikelen Alle 3 versies HTML-versie

[Free GPT-4]

[PDF] arxiv.org

Inexact GMRES policy iteration for large-scale Markov decision processes

M Gargiani, D Liao-McPherson, A Zanelli, J Lygeros - IFAC-PapersOnLine, 2023 - Elsevier

Policy iteration enjoys a local quadratic rate of contraction, but its iterations are
computationally expensive for Markov decision processes (MDPs) with a large number of …

Opslaan Citeren Geciteerd door 1 Verwante artikelen Alle 4 versies

Melding maken

Citeren

Geavanceerd zoeken

Opgeslagen in Mijn bibliotheek

Parallel and flexible dynamic programming via the randomized mini-batch operator

Dynamic programming through the lens of semismooth Newton-type methods

Dynamic programming through the lens of semismooth Newton-type methods (extended version)

Inexact GMRES policy iteration for large-scale Markov decision processes