Sqil: Imitation learning via reinforcement learning with sparse rewards
Learning to imitate expert behavior from demonstrations can be challenging, especially in
environments with high-dimensional, continuous observations and unknown dynamics …
environments with high-dimensional, continuous observations and unknown dynamics …
Approximate dynamic programming for ambulance redeployment
MS Maxwell, M Restrepo… - INFORMS Journal …, 2010 - pubsonline.informs.org
We present an approximate dynamic programming approach for making ambulance
redeployment decisions in an emergency medical service system. The primary decision is …
redeployment decisions in an emergency medical service system. The primary decision is …
Application of predictive control strategies to the management of complex networks in the urban water cycle [applications of control]
The management of the urban water cycle (UWC) is a subject of increasing interest because
of its social, economic, and environmental impact. The most important issues include the …
of its social, economic, and environmental impact. The most important issues include the …
Learning Tetris using the noisy cross-entropy method
The cross-entropy method is an efficient and general optimization algorithm. However, its
applicability in reinforcement learning (RL) seems to be limited because it often converges …
applicability in reinforcement learning (RL) seems to be limited because it often converges …
Where do you think you're going?: Inferring beliefs about dynamics from behavior
Inferring intent from observed behavior has been studied extensively within the frameworks
of Bayesian inverse planning and inverse reinforcement learning. These methods infer a …
of Bayesian inverse planning and inverse reinforcement learning. These methods infer a …
Large-scale unit commitment under uncertainty
Abstract The Unit Commitment problem in energy management aims at finding the optimal
productions schedule of a set of generation units while meeting various system-wide …
productions schedule of a set of generation units while meeting various system-wide …
Robust learning-based MPC for nonlinear constrained systems
This paper presents a robust learning-based predictive control strategy for nonlinear
systems subject to both input and output constraints, under the assumption that the model …
systems subject to both input and output constraints, under the assumption that the model …
Research on probabilistic methods for control system design
A novel approach based on probability and randomization has emerged to synergize with
the standard deterministic methods for control of systems with uncertainty. The main …
the standard deterministic methods for control of systems with uncertainty. The main …
Chance-constrained model predictive control for drinking water networks
This paper addresses a chance-constrained model predictive control (CC-MPC) strategy for
the management of drinking water networks (DWNs) based on a finite horizon stochastic …
the management of drinking water networks (DWNs) based on a finite horizon stochastic …
Cutting-set methods for robust convex optimization with pessimizing oracles
We consider a general worst-case robust convex optimization problem, with arbitrary
dependence on the uncertain parameters, which are assumed to lie in some given set of …
dependence on the uncertain parameters, which are assumed to lie in some given set of …