Data-driven methods for building control—A review and promising future directions

ET Maddalena, Y Lian, CN Jones - Control Engineering Practice, 2020 - Elsevier
A review of the heating, ventilation and air-conditioning control problem for buildings is
presented with particular emphasis on its distinguishing features. Next, we not only examine …

Safe model-based reinforcement learning with stability guarantees

F Berkenkamp, M Turchetta… - Advances in neural …, 2017 - proceedings.neurips.cc
Reinforcement learning is a powerful paradigm for learning optimal policies from
experimental data. However, to find optimal policies, most reinforcement learning algorithms …

Safe controller optimization for quadrotors with Gaussian processes

F Berkenkamp, AP Schoellig… - 2016 IEEE international …, 2016 - ieeexplore.ieee.org
One of the most fundamental problems when designing controllers for dynamic systems is
the tuning of the controller parameters. Typically, a model of the system is used to obtain an …

Bayesian optimization with safety constraints: safe and automatic parameter tuning in robotics

F Berkenkamp, A Krause, AP Schoellig - Machine Learning, 2023 - Springer
Selecting the right tuning parameters for algorithms is a pravelent problem in machine
learning that can significantly affect the performance of algorithms. Data-efficient …

Safe learning and optimization techniques: Towards a survey of the state of the art

Y Kim, R Allmendinger, M López-Ibáñez - International Workshop on the …, 2020 - Springer
Safe learning and optimization deals with learning and optimization problems that avoid, as
much as possible, the evaluation of non-safe input points, which are solutions, policies, or …

Active learning of dynamics for data-driven control using Koopman operators

I Abraham, TD Murphey - IEEE Transactions on Robotics, 2019 - ieeexplore.ieee.org
This paper presents an active learning strategy for robotic systems that takes into account
task information, enables fast learning, and allows control to be readily synthesized by …

Safe exploration in finite markov decision processes with gaussian processes

M Turchetta, F Berkenkamp… - Advances in neural …, 2016 - proceedings.neurips.cc
In classical reinforcement learning agents accept arbitrary short term loss for long term gain
when exploring their environment. This is infeasible for safety critical applications such as …

Stagewise safe bayesian optimization with gaussian processes

Y Sui, V Zhuang, J Burdick… - … conference on machine …, 2018 - proceedings.mlr.press
Enforcing safety is a key aspect of many problems pertaining to sequential decision making
under uncertainty, which require the decisions made at every step to be both informative of …

Learning compositional models of robot skills for task and motion planning

Z Wang, CR Garrett, LP Kaelbling… - … Journal of Robotics …, 2021 - journals.sagepub.com
The objective of this work is to augment the basic abilities of a robot by learning to use
sensorimotor primitives to solve complex long-horizon manipulation problems. This requires …

Automatic LQR tuning based on Gaussian process global optimization

A Marco, P Hennig, J Bohg, S Schaal… - … conference on robotics …, 2016 - ieeexplore.ieee.org
This paper proposes an automatic controller tuning framework based on linear optimal
control combined with Bayesian optimization. With this framework, an initial set of controller …