- Academic Search

P Dai, DS Weld, J Goldsmith - Journal of Artificial Intelligence Research, 2011 - jair.org

Value iteration is a powerful yet inefficient algorithm for Markov decision processes (MDPs)
because it puts the majority of its effort into backing up the entire state space, which turns out …

Speichern Zitieren Zitiert von: 86 Ähnliche Artikel Alle 15 Versionen HTML-Version

[Free GPT-4]
[DeepSeek]

[PDF] iitp.ac.in

GPU based generation of state transition models using simulations for unmanned surface vehicle trajectory planning

A Thakur, P Svec, SK Gupta - Robotics and Autonomous Systems, 2012 - Elsevier

This paper describes GPU based algorithms to compute state transition models for
unmanned surface vehicles (USVs) using 6 degree of freedom (DOF) dynamics simulations …

Speichern Zitieren Zitiert von: 63 Ähnliche Artikel Alle 9 Versionen

[Free GPT-4]
[DeepSeek]

[PDF] github.io

Online planning for large markov decision processes with hierarchical decomposition

A Bai, F Wu, X Chen - ACM Transactions on Intelligent Systems and …, 2015 - dl.acm.org

Markov decision processes (MDPs) provide a rich framework for planning under uncertainty.
However, exactly solving a large MDP is usually intractable due to the “curse of …

Speichern Zitieren Zitiert von: 48 Ähnliche Artikel Alle 5 Versionen

[Free GPT-4]
[DeepSeek]

[HTML] sciencedirect.com

[HTML][HTML] Real-time dynamic programming for Markov decision processes with imprecise probabilities

KV Delgado, LN De Barros, DB Dias, S Sanner - Artificial Intelligence, 2016 - Elsevier

Abstract Markov Decision Processes have become the standard model for probabilistic
planning. However, when applied to many practical problems, the estimates of transition …

Speichern Zitieren Zitiert von: 40 Ähnliche Artikel Alle 7 Versionen

[Free GPT-4]
[DeepSeek]

[PDF] academia.edu

Power flow management in electric vehicles charging station using reinforcement learning

AO Erick, KA Folly - 2020 IEEE Congress on Evolutionary …, 2020 - ieeexplore.ieee.org

This paper investigates optimal power flow management problem in an electric vehicle
charging station. The charging station is powered by solar PV and is tied to the grid and a …

Speichern Zitieren Zitiert von: 21 Ähnliche Artikel Alle 5 Versionen

[Free GPT-4]
[DeepSeek]

[PDF] hal.science

Continuous search in constraint programming

A Arbelaez, Y Hamadi, M Sebag - 2010 22nd IEEE International …, 2010 - ieeexplore.ieee.org

This work presents the concept of Continuous Search (CS), which objective is to allow any
user to eventually get their constraint solver achieving a top performance on their problems …

Speichern Zitieren Zitiert von: 45 Ähnliche Artikel Alle 13 Versionen

[Free GPT-4]
[DeepSeek]

[PDF] psu.edu

Trajectory planning with look-ahead for unmanned sea surface vehicles to handle environmental disturbances

P Svec, M Schwartz, A Thakur… - 2011 IEEE/RSJ …, 2011 - ieeexplore.ieee.org

We present a look-ahead based trajectory planning algorithm for computation of dynamically
feasible trajectories for Unmanned Sea Surface Vehicles (USSV) operating in high seas …

Speichern Zitieren Zitiert von: 48 Ähnliche Artikel Alle 8 Versionen

[Free GPT-4]
[DeepSeek]

[PDF] mlr.press

Lookahead-bounded q-learning

I El Shar, D Jiang - International Conference on Machine …, 2020 - proceedings.mlr.press

We introduce the lookahead-bounded Q-learning (LBQL) algorithm, a new, provably
convergent variant of Q-learning that seeks to improve the performance of standard Q …

Speichern Zitieren Zitiert von: 14 Ähnliche Artikel Alle 5 Versionen HTML-Version

[Free GPT-4]
[DeepSeek]

[PDF] aaai.org

Efficient Constraint Generation for Stochastic Shortest Path Problems

J Schmalz, F Trevizan - Proceedings of the AAAI Conference on …, 2024 - ojs.aaai.org

Current methods for solving Stochastic Shortest Path Problems (SSPs) find states' costs-to-
go by applying Bellman backups, where state-of-the-art methods employ heuristics to select …

Speichern Zitieren Zitiert von: 1 Ähnliche Artikel Alle 4 Versionen HTML-Version

Motion Planning for The Estimation of Functions

A Raghavan, G Sartori… - 2023 62nd IEEE …, 2023 - ieeexplore.ieee.org

We consider the problem of estimation of an unknown real valued function with real valued
input by an agent. The agent exists in 3D Euclidean space. It is able to traverse in a 2D …

Speichern Zitieren Zitiert von: 2 Ähnliche Artikel Alle 3 Versionen

Alert erstellen

Zitieren

Erweiterte Suche

In „Meine Bibliothek“ gespeichert

Bayesian Real-Time Dynamic Programming.

Topological value iteration algorithms

GPU based generation of state transition models using simulations for unmanned surface vehicle trajectory planning

Online planning for large markov decision processes with hierarchical decomposition

[HTML][HTML] Real-time dynamic programming for Markov decision processes with imprecise probabilities

Power flow management in electric vehicles charging station using reinforcement learning

Continuous search in constraint programming

Trajectory planning with look-ahead for unmanned sea surface vehicles to handle environmental disturbances

Lookahead-bounded q-learning

Efficient Constraint Generation for Stochastic Shortest Path Problems

Motion Planning for The Estimation of Functions