Reinforcement learning for control: Performance, stability, and deep approximators

L Buşoniu, T De Bruin, D Tolić, J Kober… - Annual Reviews in …, 2018 - Elsevier
Reinforcement learning (RL) offers powerful algorithms to search for optimal controllers of
systems with nonlinear, possibly stochastic dynamics that are unknown or highly uncertain …

Closed-loop and activity-guided optogenetic control

L Grosenick, JH Marshel, K Deisseroth - Neuron, 2015 - cell.com
Advances in optical manipulation and observation of neural activity have set the stage for
widespread implementation of closed-loop and activity-guided optical control of neural …

High-speed finite control set model predictive control for power electronics

B Stellato, T Geyer, PJ Goulart - IEEE Transactions on power …, 2016 - ieeexplore.ieee.org
Common approaches for direct model predictive control (MPC) for current reference tracking
in power electronics suffer from the high computational complexity encountered when …

Global adaptive dynamic programming for continuous-time nonlinear systems

Y Jiang, ZP Jiang - IEEE Transactions on Automatic Control, 2015 - ieeexplore.ieee.org
This paper presents a novel method of global adaptive dynamic programming (ADP) for the
adaptive optimal control of nonlinear polynomial systems. The strategy consists of relaxing …

Learning convex optimization control policies

A Agrawal, S Barratt, S Boyd… - Learning for Dynamics …, 2020 - proceedings.mlr.press
Many control policies used in applications compute the input or action by solving a convex
optimization problem that depends on the current state and some parameters. Common …

Embedded optimization methods for industrial automatic control

HJ Ferreau, S Almér, R Verschueren, M Diehl, D Frick… - IFAC-PapersOnLine, 2017 - Elsevier
Starting in the late 1970s, optimization-based control has built up an impressive track record
of successful industrial applications, in particular in the petrochemical and process …

Distributed optimization for robot networks: From real-time convex optimization to game-theoretic self-organization

H Jaleel, JS Shamma - Proceedings of the IEEE, 2020 - ieeexplore.ieee.org
Recent advances in sensing, communication, and computing technologies have enabled
the use of multirobot systems for practical applications such as surveillance, area map** …

Performance bounds and suboptimal policies for multi–period investment

S Boyd, MT Mueller, B O'Donoghue… - … and Trends® in …, 2013 - nowpublishers.com
We consider dynamic trading of a portfolio of assets in discrete periods over a finite time
horizon, with arbitrary time-varying distribution of asset returns. The goal is to maximize the …

Robust optimization for MPC

B Houska, ME Villanueva - Handbook of model predictive control, 2019 - Springer
This chapter aims to give a concise overview of numerical methods and algorithms for
implementing robust model predictive control (MPC). We introduce the mathematical …

Approximate dynamic programming for stochastic reachability

N Kariotoglou, S Summers, T Summers… - 2013 European …, 2013 - ieeexplore.ieee.org
In this work we illustrate how approximate dynamic programing can be utilized to address
problems of stochastic reachability in infinite state and control spaces. In particular we focus …