Sqil: Imitation learning via reinforcement learning with sparse rewards

S Reddy, AD Dragan, S Levine - arxiv preprint arxiv:1905.11108, 2019 - arxiv.org
Learning to imitate expert behavior from demonstrations can be challenging, especially in
environments with high-dimensional, continuous observations and unknown dynamics …

Approximate dynamic programming for ambulance redeployment

MS Maxwell, M Restrepo… - INFORMS Journal …, 2010 - pubsonline.informs.org
We present an approximate dynamic programming approach for making ambulance
redeployment decisions in an emergency medical service system. The primary decision is …

Application of predictive control strategies to the management of complex networks in the urban water cycle [applications of control]

C Ocampo-Martinez, V Puig… - IEEE Control …, 2013 - ieeexplore.ieee.org
The management of the urban water cycle (UWC) is a subject of increasing interest because
of its social, economic, and environmental impact. The most important issues include the …

Learning Tetris using the noisy cross-entropy method

I Szita, A Lörincz - Neural computation, 2006 - ieeexplore.ieee.org
The cross-entropy method is an efficient and general optimization algorithm. However, its
applicability in reinforcement learning (RL) seems to be limited because it often converges …

Where do you think you're going?: Inferring beliefs about dynamics from behavior

S Reddy, A Dragan, S Levine - Advances in Neural …, 2018 - proceedings.neurips.cc
Inferring intent from observed behavior has been studied extensively within the frameworks
of Bayesian inverse planning and inverse reinforcement learning. These methods infer a …

Large-scale unit commitment under uncertainty

M Tahanan, W van Ackooij, A Frangioni, F Lacalandra - 4or, 2015 - Springer
Abstract The Unit Commitment problem in energy management aims at finding the optimal
productions schedule of a set of generation units while meeting various system-wide …

Robust learning-based MPC for nonlinear constrained systems

JM Manzano, D Limon, DM de la Peña, JP Calliess - Automatica, 2020 - Elsevier
This paper presents a robust learning-based predictive control strategy for nonlinear
systems subject to both input and output constraints, under the assumption that the model …

Research on probabilistic methods for control system design

GC Calafiore, F Dabbene, R Tempo - Automatica, 2011 - Elsevier
A novel approach based on probability and randomization has emerged to synergize with
the standard deterministic methods for control of systems with uncertainty. The main …

Chance-constrained model predictive control for drinking water networks

JM Grosso, C Ocampo-Martínez, V Puig… - Journal of process …, 2014 - Elsevier
This paper addresses a chance-constrained model predictive control (CC-MPC) strategy for
the management of drinking water networks (DWNs) based on a finite horizon stochastic …

Cutting-set methods for robust convex optimization with pessimizing oracles

A Mutapcic, S Boyd - Optimization Methods & Software, 2009 - Taylor & Francis
We consider a general worst-case robust convex optimization problem, with arbitrary
dependence on the uncertain parameters, which are assumed to lie in some given set of …