Bayesian reinforcement learning: A survey
Bayesian methods for machine learning have been widely investigated, yielding principled
methods for incorporating prior information into inference algorithms. In this survey, we …
methods for incorporating prior information into inference algorithms. In this survey, we …
A unified recipe for deriving (time-uniform) PAC-Bayes bounds
We present a unified framework for deriving PAC-Bayesian generalization bounds. Unlike
most previous literature on this topic, our bounds are anytime-valid (ie, time-uniform) …
most previous literature on this topic, our bounds are anytime-valid (ie, time-uniform) …
Risk bounds for the majority vote: From a PAC-Bayesian analysis to a learning algorithm
P Germain, A Lacasse, F Laviolette… - ar** deep learning models with uncertainty estimates in the form of set-valued …
[PDF][PDF] Fast rates in statistical and online learning
The speed with which a learning algorithm converges as it is presented with more data is a
central problem in machine learning—a fast rate of convergence means less data is needed …
central problem in machine learning—a fast rate of convergence means less data is needed …
[PDF][PDF] Bayesian nonparametric covariance regression
Capturing predictor-dependent correlations amongst the elements of a multivariate
response vector is fundamental to numerous applied domains, including neuroscience …
response vector is fundamental to numerous applied domains, including neuroscience …
PAC-Bayesian lifelong learning for multi-armed bandits
We present a PAC-Bayesian analysis of lifelong learning. In the lifelong learning problem, a
sequence of learning tasks is observed one-at-a-time, and the goal is to transfer information …
sequence of learning tasks is observed one-at-a-time, and the goal is to transfer information …
PAC-Bayesian soft actor-critic learning
Actor-critic algorithms address the dual goals of reinforcement learning (RL), policy
evaluation and improvement via two separate function approximators. The practicality of this …
evaluation and improvement via two separate function approximators. The practicality of this …
[PDF][PDF] Policy learning for domain selection in an extensible multi-domain spoken dialogue system
This paper proposes a Markov Decision Process and reinforcement learning based
approach for domain selection in a multidomain Spoken Dialogue System built on a …
approach for domain selection in a multidomain Spoken Dialogue System built on a …
PAC-Bayes control: learning policies that provably generalize to novel environments
Our goal is to learn control policies for robots that provably generalize well to novel
environments given a dataset of example environments. The key technical idea behind our …
environments given a dataset of example environments. The key technical idea behind our …