On the convergence of projective-simulation–based reinforcement learning in Markov decision processes
WL Boyajian, J Clausen, LM Trenkwalder… - Quantum machine …, 2020 - Springer
In recent years, the interest in leveraging quantum effects for enhancing machine learning
tasks has significantly increased. Many algorithms speeding up supervised and …
tasks has significantly increased. Many algorithms speeding up supervised and …
[BUCH][B] Markov decision processes in artificial intelligence
Markov Decision Processes (MDPs) are a mathematical framework for modeling sequential
decision problems under uncertainty as well as reinforcement learning problems. Written by …
decision problems under uncertainty as well as reinforcement learning problems. Written by …
Verification of Markov decision processes using learning algorithms
We present a general framework for applying machine-learning algorithms to the verification
of Markov decision processes (MDPs). The primary goal of these techniques is to improve …
of Markov decision processes (MDPs). The primary goal of these techniques is to improve …
[HTML][HTML] Real-time energy management of photovoltaic-assisted electric vehicle charging station by markov decision process
With the rapid development of electric vehicles (EVs), the dramatic rise in the demand for
electricity is creating heavy pressure on local grids. The combination of renewable energy …
electricity is creating heavy pressure on local grids. The combination of renewable energy …
Too many cooks: Bayesian inference for coordinating multi‐agent collaboration
Collaboration requires agents to coordinate their behavior on the fly, sometimes cooperating
to solve a single task together and other times dividing it up into sub‐tasks to work on in …
to solve a single task together and other times dividing it up into sub‐tasks to work on in …
Goal probability analysis in probabilistic planning: Exploring and enhancing the state of the art
Unavoidable dead-ends are common in many probabilistic planning problems, eg when
actions may fail or when operating under resource constraints. An important objective in …
actions may fail or when operating under resource constraints. An important objective in …
Automated aerial suspended cargo delivery through reinforcement learning
Cargo-bearing unmanned aerial vehicles (UAVs) have tremendous potential to assist
humans by delivering food, medicine, and other supplies. For time-critical cargo delivery …
humans by delivering food, medicine, and other supplies. For time-critical cargo delivery …
Learning swing-free trajectories for UAVs with a suspended load
Attaining autonomous flight is an important task in aerial robotics. Often flight trajectories are
not only subject to unknown system dynamics, but also to specific task constraints. This …
not only subject to unknown system dynamics, but also to specific task constraints. This …
Tight regret bounds for model-based reinforcement learning with greedy policies
State-of-the-art efficient model-based Reinforcement Learning (RL) algorithms typically act
by iteratively solving empirical models, ie, by performing full-planning on Markov Decision …
by iteratively solving empirical models, ie, by performing full-planning on Markov Decision …
A practitioner's guide to MDP model checking algorithms
Abstract Model checking undiscounted reachability and expected-reward properties on
Markov decision processes (MDPs) is key for the verification of systems that act under …
Markov decision processes (MDPs) is key for the verification of systems that act under …