On the convergence of projective-simulation–based reinforcement learning in Markov decision processes

WL Boyajian, J Clausen, LM Trenkwalder… - Quantum machine …, 2020 - Springer
In recent years, the interest in leveraging quantum effects for enhancing machine learning
tasks has significantly increased. Many algorithms speeding up supervised and …

[BUCH][B] Markov decision processes in artificial intelligence

O Sigaud, O Buffet - 2013 - books.google.com
Markov Decision Processes (MDPs) are a mathematical framework for modeling sequential
decision problems under uncertainty as well as reinforcement learning problems. Written by …

Verification of Markov decision processes using learning algorithms

T Brázdil, K Chatterjee, M Chmelik, V Forejt… - … for Verification and …, 2014 - Springer
We present a general framework for applying machine-learning algorithms to the verification
of Markov decision processes (MDPs). The primary goal of these techniques is to improve …

[HTML][HTML] Real-time energy management of photovoltaic-assisted electric vehicle charging station by markov decision process

Y Wu, J Zhang, A Ravey, D Chrenko, A Miraoui - Journal of Power Sources, 2020 - Elsevier
With the rapid development of electric vehicles (EVs), the dramatic rise in the demand for
electricity is creating heavy pressure on local grids. The combination of renewable energy …

Too many cooks: Bayesian inference for coordinating multi‐agent collaboration

SA Wu, RE Wang, JA Evans… - Topics in Cognitive …, 2021 - Wiley Online Library
Collaboration requires agents to coordinate their behavior on the fly, sometimes cooperating
to solve a single task together and other times dividing it up into sub‐tasks to work on in …

Goal probability analysis in probabilistic planning: Exploring and enhancing the state of the art

M Steinmetz, J Hoffmann, O Buffet - Journal of Artificial Intelligence …, 2016 - jair.org
Unavoidable dead-ends are common in many probabilistic planning problems, eg when
actions may fail or when operating under resource constraints. An important objective in …

Automated aerial suspended cargo delivery through reinforcement learning

A Faust, I Palunko, P Cruz, R Fierro, L Tapia - Artificial Intelligence, 2017 - Elsevier
Cargo-bearing unmanned aerial vehicles (UAVs) have tremendous potential to assist
humans by delivering food, medicine, and other supplies. For time-critical cargo delivery …

Learning swing-free trajectories for UAVs with a suspended load

A Faust, I Palunko, P Cruz, R Fierro… - 2013 IEEE International …, 2013 - ieeexplore.ieee.org
Attaining autonomous flight is an important task in aerial robotics. Often flight trajectories are
not only subject to unknown system dynamics, but also to specific task constraints. This …

Tight regret bounds for model-based reinforcement learning with greedy policies

Y Efroni, N Merlis, M Ghavamzadeh… - Advances in Neural …, 2019 - proceedings.neurips.cc
State-of-the-art efficient model-based Reinforcement Learning (RL) algorithms typically act
by iteratively solving empirical models, ie, by performing full-planning on Markov Decision …

A practitioner's guide to MDP model checking algorithms

A Hartmanns, S Junges, T Quatmann… - … Conference on Tools …, 2023 - Springer
Abstract Model checking undiscounted reachability and expected-reward properties on
Markov decision processes (MDPs) is key for the verification of systems that act under …