Decision-theoretic planning: Structural assumptions and computational leverage
Planning under uncertainty is a central problem in the study of automated sequential
decision making, and has been addressed by researchers in many different fields, including …
decision making, and has been addressed by researchers in many different fields, including …
[PDF][PDF] Point-based value iteration: An anytime algorithm for POMDPs
(PBVI) algorithm for POMDP planning. PBVI approximates an exact value iteration solution
by selecting a small set of representative belief points and then tracking the value and its …
by selecting a small set of representative belief points and then tracking the value and its …
Optimization methods to solve adaptive management problems
Determining the best management actions is challenging when critical information is
missing. However, urgency and limited resources require that decisions must be made …
missing. However, urgency and limited resources require that decisions must be made …
Perseus: Randomized point-based value iteration for POMDPs
Partially observable Markov decision processes (POMDPs) form an attractive and principled
framework for agent planning under uncertainty. Point-based approximate techniques for …
framework for agent planning under uncertainty. Point-based approximate techniques for …
Value-function approximations for partially observable Markov decision processes
M Hauskrecht - Journal of artificial intelligence research, 2000 - jair.org
Partially observable Markov decision processes (POMDPs) provide an elegant
mathematical framework for modeling complex decision and planning problems in …
mathematical framework for modeling complex decision and planning problems in …
Heuristic search value iteration for POMDPs
We present a novel POMDP planning algorithm called heuristic search value iteration
(HSVI). HSVI is an anytime algorithm that returns a policy and a provable bound on its regret …
(HSVI). HSVI is an anytime algorithm that returns a policy and a provable bound on its regret …
The hidden information state model: A practical framework for POMDP-based spoken dialogue management
This paper explains how Partially Observable Markov Decision Processes (POMDPs) can
provide a principled mathematical framework for modelling the inherent uncertainty in …
provide a principled mathematical framework for modelling the inherent uncertainty in …
The Cog project: Building a humanoid robot
To explore issues of developmental structure, physical embodiment, integration of multiple
sensory and motor systems, and social interaction, we have constructed an upper-torso …
sensory and motor systems, and social interaction, we have constructed an upper-torso …
Partially observable Markov decision processes
MTJ Spaan - Reinforcement learning: State-of-the-art, 2012 - Springer
For reinforcement learning in environments in which an agent has access to a reliable state
signal, methods based on the Markov decision process (MDP) have had many successes. In …
signal, methods based on the Markov decision process (MDP) have had many successes. In …
Anytime point-based approximations for large POMDPs
Abstract The Partially Observable Markov Decision Process has long been recognized as a
rich framework for real-world planning and control problems, especially in robotics. However …
rich framework for real-world planning and control problems, especially in robotics. However …