Topological value iteration algorithms
Value iteration is a powerful yet inefficient algorithm for Markov decision processes (MDPs)
because it puts the majority of its effort into backing up the entire state space, which turns out …
because it puts the majority of its effort into backing up the entire state space, which turns out …
GPU based generation of state transition models using simulations for unmanned surface vehicle trajectory planning
This paper describes GPU based algorithms to compute state transition models for
unmanned surface vehicles (USVs) using 6 degree of freedom (DOF) dynamics simulations …
unmanned surface vehicles (USVs) using 6 degree of freedom (DOF) dynamics simulations …
Online planning for large markov decision processes with hierarchical decomposition
Markov decision processes (MDPs) provide a rich framework for planning under uncertainty.
However, exactly solving a large MDP is usually intractable due to the “curse of …
However, exactly solving a large MDP is usually intractable due to the “curse of …
[HTML][HTML] Real-time dynamic programming for Markov decision processes with imprecise probabilities
Abstract Markov Decision Processes have become the standard model for probabilistic
planning. However, when applied to many practical problems, the estimates of transition …
planning. However, when applied to many practical problems, the estimates of transition …
Power flow management in electric vehicles charging station using reinforcement learning
This paper investigates optimal power flow management problem in an electric vehicle
charging station. The charging station is powered by solar PV and is tied to the grid and a …
charging station. The charging station is powered by solar PV and is tied to the grid and a …
Continuous search in constraint programming
This work presents the concept of Continuous Search (CS), which objective is to allow any
user to eventually get their constraint solver achieving a top performance on their problems …
user to eventually get their constraint solver achieving a top performance on their problems …
Trajectory planning with look-ahead for unmanned sea surface vehicles to handle environmental disturbances
We present a look-ahead based trajectory planning algorithm for computation of dynamically
feasible trajectories for Unmanned Sea Surface Vehicles (USSV) operating in high seas …
feasible trajectories for Unmanned Sea Surface Vehicles (USSV) operating in high seas …
Lookahead-bounded q-learning
We introduce the lookahead-bounded Q-learning (LBQL) algorithm, a new, provably
convergent variant of Q-learning that seeks to improve the performance of standard Q …
convergent variant of Q-learning that seeks to improve the performance of standard Q …
Efficient Constraint Generation for Stochastic Shortest Path Problems
Current methods for solving Stochastic Shortest Path Problems (SSPs) find states' costs-to-
go by applying Bellman backups, where state-of-the-art methods employ heuristics to select …
go by applying Bellman backups, where state-of-the-art methods employ heuristics to select …
Motion Planning for The Estimation of Functions
A Raghavan, G Sartori… - 2023 62nd IEEE …, 2023 - ieeexplore.ieee.org
We consider the problem of estimation of an unknown real valued function with real valued
input by an agent. The agent exists in 3D Euclidean space. It is able to traverse in a 2D …
input by an agent. The agent exists in 3D Euclidean space. It is able to traverse in a 2D …