Partially Observed Optimal Stochastic Control: Regularity, Optimality, Approximations, and Learning

AD Kara, S Yuksel - arxiv preprint arxiv:2412.06735, 2024 - arxiv.org
In this review/tutorial article, we present recent progress on optimal control of partially
observed Markov Decision Processes (POMDPs). We first present regularity and continuity …

Optimal sequential vector quantization of Markov sources

VS Borkar, SK Mitter, S Tatikonda - SIAM journal on control and optimization, 2001 - SIAM
The problem of sequential vector quantization of a stationary Markov source is cast as an
equivalent stochastic control problem with partial observations. This problem is analyzed …

[BOEK][B] Finite Approximations in discrete-time stochastic control

N Saldi, T Linder, S Yüksel - 2018 - Springer
Control and optimization of dynamical systems in the presence of stochastic uncertainty is a
mature field with a large range of applications. A comprehensive treatment of such problems …

Average Cost Optimality of Partially Observed MDPs: Contraction of Nonlinear Filters and Existence of Optimal Solutions and Approximations

YE Demirci, AD Kara, S Yüksel - SIAM Journal on Control and Optimization, 2024 - SIAM
The average cost optimality is known to be a challenging problem for partially observable
stochastic control, with few results available beyond the finite state, action, and …

An optimal Markovian quantization algorithm for multi-dimensional stochastic control problems

G Pages, H Pham, J Printems - Stochastics and dynamics, 2004 - World Scientific
We propose a probabilistic numerical method based on optimal quantization to solve some
multi-dimensional stochastic control problems that arise, for example, in mathematical …

Markov decision processes with incomplete information and semiuniform feller transition probabilities

EA Feinberg, PO Kasyanov, MZ Zgurovsky - SIAM Journal on Control and …, 2022 - SIAM
This paper deals with control of partially observable discrete-time stochastic systems. It
introduces and studies Markov Decision Processes with Incomplete Information and with …

On near optimality of the set of finite-state controllers for average cost POMDP

H Yu, DP Bertsekas - Mathematics of Operations Research, 2008 - pubsonline.informs.org
We consider the average cost problem for partially observable Markov decision processes
(POMDP) with finite state, observation, and control spaces. We prove that there exists an ε …

Average cost dynamic programming equations for controlled Markov chains with partial observations

VS Borkar - SIAM Journal on Control and Optimization, 2000 - SIAM
The value function for the average cost control of a class of partially observed Markov chains
is derived as the" vanishing discount limit," in a suitable sense, of the value functions for the …

Partially observable risk-sensitive Markov decision processes

N Bäauerle, U Rieder - Mathematics of Operations …, 2017 - pubsonline.informs.org
We consider the problem of minimizing a certainty equivalent of the total or discounted cost
over a finite and an infinite time horizon that is generated by a partially observable Markov …

Equivalent conditions for weak continuity of nonlinear filters

EA Feinberg, PO Kasyanov - Systems & Control Letters, 2023 - Elsevier
This paper studies weak continuity of nonlinear filters. It is well-known that Borel
measurability of transition probabilities for problems with incomplete state observations is …