- Academic Search

On the convergence of projective-simulation–based reinforcement learning in Markov decision processes

WL Boyajian, J Clausen, LM Trenkwalder… - Quantum machine …, 2020 - Springer

In recent years, the interest in leveraging quantum effects for enhancing machine learning
tasks has significantly increased. Many algorithms speeding up supervised and …

Speichern Zitieren Zitiert von: 867 Ähnliche Artikel Alle 16 Versionen

[BUCH][B] Markov decision processes in artificial intelligence

O Sigaud, O Buffet - 2013 - books.google.com

Markov Decision Processes (MDPs) are a mathematical framework for modeling sequential
decision problems under uncertainty as well as reinforcement learning problems. Written by …

Speichern Zitieren Zitiert von: 392 Ähnliche Artikel Alle 7 Versionen Bibliothekssuche

[Free GPT-4]

[PDF] arxiv.org

Verification of Markov decision processes using learning algorithms

T Brázdil, K Chatterjee, M Chmelik, V Forejt… - … for Verification and …, 2014 - Springer

We present a general framework for applying machine-learning algorithms to the verification
of Markov decision processes (MDPs). The primary goal of these techniques is to improve …

Speichern Zitieren Zitiert von: 258 Ähnliche Artikel Alle 17 Versionen

[Free GPT-4]

[HTML] sciencedirect.com

[HTML][HTML] Real-time energy management of photovoltaic-assisted electric vehicle charging station by markov decision process

Y Wu, J Zhang, A Ravey, D Chrenko, A Miraoui - Journal of Power Sources, 2020 - Elsevier

With the rapid development of electric vehicles (EVs), the dramatic rise in the demand for
electricity is creating heavy pressure on local grids. The combination of renewable energy …

Speichern Zitieren Zitiert von: 91 Ähnliche Artikel Alle 6 Versionen

[Free GPT-4]

[PDF] wiley.com Full View

Too many cooks: Bayesian inference for coordinating multi‐agent collaboration

SA Wu, RE Wang, JA Evans… - Topics in Cognitive …, 2021 - Wiley Online Library

Collaboration requires agents to coordinate their behavior on the fly, sometimes cooperating
to solve a single task together and other times dividing it up into sub‐tasks to work on in …

Speichern Zitieren Zitiert von: 122 Ähnliche Artikel Alle 12 Versionen

[Free GPT-4]

[PDF] jair.org

Goal probability analysis in probabilistic planning: Exploring and enhancing the state of the art

M Steinmetz, J Hoffmann, O Buffet - Journal of Artificial Intelligence …, 2016 - jair.org

Unavoidable dead-ends are common in many probabilistic planning problems, eg when
actions may fail or when operating under resource constraints. An important objective in …

Speichern Zitieren Zitiert von: 54 Ähnliche Artikel Alle 9 Versionen HTML-Version

[Free GPT-4]

[PDF] sciencedirect.com

Automated aerial suspended cargo delivery through reinforcement learning

A Faust, I Palunko, P Cruz, R Fierro, L Tapia - Artificial Intelligence, 2017 - Elsevier

Cargo-bearing unmanned aerial vehicles (UAVs) have tremendous potential to assist
humans by delivering food, medicine, and other supplies. For time-critical cargo delivery …

Speichern Zitieren Zitiert von: 189 Ähnliche Artikel Alle 8 Versionen

[Free GPT-4]

[PDF] academia.edu

Learning swing-free trajectories for UAVs with a suspended load

A Faust, I Palunko, P Cruz, R Fierro… - 2013 IEEE International …, 2013 - ieeexplore.ieee.org

Attaining autonomous flight is an important task in aerial robotics. Often flight trajectories are
not only subject to unknown system dynamics, but also to specific task constraints. This …

Speichern Zitieren Zitiert von: 180 Ähnliche Artikel Alle 14 Versionen

[Free GPT-4]

[PDF] neurips.cc

Tight regret bounds for model-based reinforcement learning with greedy policies

Y Efroni, N Merlis, M Ghavamzadeh… - Advances in Neural …, 2019 - proceedings.neurips.cc

State-of-the-art efficient model-based Reinforcement Learning (RL) algorithms typically act
by iteratively solving empirical models, ie, by performing full-planning on Markov Decision …

Speichern Zitieren Zitiert von: 79 Ähnliche Artikel Alle 10 Versionen HTML-Version

[Free GPT-4]

[PDF] springer.com

A practitioner's guide to MDP model checking algorithms

A Hartmanns, S Junges, T Quatmann… - … Conference on Tools …, 2023 - Springer

Abstract Model checking undiscounted reachability and expected-reward properties on
Markov decision processes (MDPs) is key for the verification of systems that act under …

Speichern Zitieren Zitiert von: 27 Ähnliche Artikel Alle 8 Versionen

Alert erstellen

Zitieren

Erweiterte Suche

In „Meine Bibliothek“ gespeichert

Bounded real-time dynamic programming: RTDP with monotone upper bounds and performance guarantees

On the convergence of projective-simulation–based reinforcement learning in Markov decision processes

[BUCH][B] Markov decision processes in artificial intelligence

Verification of Markov decision processes using learning algorithms

[HTML][HTML] Real-time energy management of photovoltaic-assisted electric vehicle charging station by markov decision process

Too many cooks: Bayesian inference for coordinating multi‐agent collaboration

Goal probability analysis in probabilistic planning: Exploring and enhancing the state of the art

Automated aerial suspended cargo delivery through reinforcement learning

Learning swing-free trajectories for UAVs with a suspended load

Tight regret bounds for model-based reinforcement learning with greedy policies

A practitioner's guide to MDP model checking algorithms