Topological value iteration algorithms

P Dai, DS Weld, J Goldsmith - Journal of Artificial Intelligence Research, 2011 - jair.org
Value iteration is a powerful yet inefficient algorithm for Markov decision processes (MDPs)
because it puts the majority of its effort into backing up the entire state space, which turns out …

GPU based generation of state transition models using simulations for unmanned surface vehicle trajectory planning

A Thakur, P Svec, SK Gupta - Robotics and Autonomous Systems, 2012 - Elsevier
This paper describes GPU based algorithms to compute state transition models for
unmanned surface vehicles (USVs) using 6 degree of freedom (DOF) dynamics simulations …

Online planning for large markov decision processes with hierarchical decomposition

A Bai, F Wu, X Chen - ACM Transactions on Intelligent Systems and …, 2015 - dl.acm.org
Markov decision processes (MDPs) provide a rich framework for planning under uncertainty.
However, exactly solving a large MDP is usually intractable due to the “curse of …

[HTML][HTML] Real-time dynamic programming for Markov decision processes with imprecise probabilities

KV Delgado, LN De Barros, DB Dias, S Sanner - Artificial Intelligence, 2016 - Elsevier
Abstract Markov Decision Processes have become the standard model for probabilistic
planning. However, when applied to many practical problems, the estimates of transition …

Power flow management in electric vehicles charging station using reinforcement learning

AO Erick, KA Folly - 2020 IEEE Congress on Evolutionary …, 2020 - ieeexplore.ieee.org
This paper investigates optimal power flow management problem in an electric vehicle
charging station. The charging station is powered by solar PV and is tied to the grid and a …

Continuous search in constraint programming

A Arbelaez, Y Hamadi, M Sebag - 2010 22nd IEEE International …, 2010 - ieeexplore.ieee.org
This work presents the concept of Continuous Search (CS), which objective is to allow any
user to eventually get their constraint solver achieving a top performance on their problems …

Trajectory planning with look-ahead for unmanned sea surface vehicles to handle environmental disturbances

P Svec, M Schwartz, A Thakur… - 2011 IEEE/RSJ …, 2011 - ieeexplore.ieee.org
We present a look-ahead based trajectory planning algorithm for computation of dynamically
feasible trajectories for Unmanned Sea Surface Vehicles (USSV) operating in high seas …

Lookahead-bounded q-learning

I El Shar, D Jiang - International Conference on Machine …, 2020 - proceedings.mlr.press
We introduce the lookahead-bounded Q-learning (LBQL) algorithm, a new, provably
convergent variant of Q-learning that seeks to improve the performance of standard Q …

Efficient Constraint Generation for Stochastic Shortest Path Problems

J Schmalz, F Trevizan - Proceedings of the AAAI Conference on …, 2024 - ojs.aaai.org
Current methods for solving Stochastic Shortest Path Problems (SSPs) find states' costs-to-
go by applying Bellman backups, where state-of-the-art methods employ heuristics to select …

Motion Planning for The Estimation of Functions

A Raghavan, G Sartori… - 2023 62nd IEEE …, 2023 - ieeexplore.ieee.org
We consider the problem of estimation of an unknown real valued function with real valued
input by an agent. The agent exists in 3D Euclidean space. It is able to traverse in a 2D …