Taking the human out of the loop: A review of Bayesian optimization

B Shahriari, K Swersky, Z Wang… - Proceedings of the …, 2015 - ieeexplore.ieee.org
Big Data applications are typically associated with systems involving large numbers of
users, massive complex software systems, and large-scale heterogeneous computing and …

A tutorial on Bayesian optimization of expensive cost functions, with application to active user modeling and hierarchical reinforcement learning

E Brochu, VM Cora, N De Freitas - arxiv preprint arxiv:1012.2599, 2010 - arxiv.org
We present a tutorial on Bayesian optimization, a method of finding the maximum of
expensive cost functions. Bayesian optimization employs the Bayesian technique of setting …

Data-driven method to learning personalized individual semantics to support linguistic multi-attribute decision making

CC Li, Y Dong, H Liang, W Pedrycz, F Herrera - Omega, 2022 - Elsevier
In parallel with the development of information and network technology, large amounts of
data are being generated by the Internet, and data-driven methodologies are now often …

Interactive machine learning for health informatics: when do we need the human-in-the-loop?

A Holzinger - Brain informatics, 2016 - Springer
Abstract Machine learning (ML) is the fastest growing field in computer science, and health
informatics is among the greatest challenges. The goal of ML is to develop algorithms which …

Constrained Bayesian optimization for automatic chemical design using variational autoencoders

RR Griffiths, JM Hernández-Lobato - Chemical science, 2020 - pubs.rsc.org
Automatic Chemical Design is a framework for generating novel molecules with optimized
properties. The original scheme, featuring Bayesian optimization over the latent space of a …

Predictive entropy search for efficient global optimization of black-box functions

JM Hernández-Lobato, MW Hoffman… - Advances in neural …, 2014 - proceedings.neurips.cc
We propose a novel information-theoretic approach for Bayesian optimization called
Predictive Entropy Search (PES). At each iteration, PES selects the next evaluation point that …

Better-than-demonstrator imitation learning via automatically-ranked demonstrations

DS Brown, W Goo, S Niekum - Conference on robot learning, 2020 - proceedings.mlr.press
The performance of imitation learning is typically upper-bounded by the performance of the
demonstrator. While recent empirical results demonstrate that ranked demonstrations allow …

Bayesian optimization in a billion dimensions via random embeddings

Z Wang, F Hutter, M Zoghi, D Matheson… - Journal of Artificial …, 2016 - jair.org
Bayesian optimization techniques have been successfully applied to robotics, planning,
sensor placement, recommendation, advertising, intelligent user interfaces and automatic …

Scaling data-driven robotics with reward sketching and batch reinforcement learning

S Cabi, SG Colmenarejo, A Novikov… - arxiv preprint arxiv …, 2019 - arxiv.org
We present a framework for data-driven robotics that makes use of a large dataset of
recorded robot experience and scales to several tasks using learned reward functions. We …

Learning reward functions by integrating human demonstrations and preferences

M Palan, NC Landolfi, G Shevchuk… - arxiv preprint arxiv …, 2019 - arxiv.org
Our goal is to accurately and efficiently learn reward functions for autonomous robots.
Current approaches to this problem include inverse reinforcement learning (IRL), which …