A tutorial on sparse Gaussian processes and variational inference
Gaussian processes (GPs) provide a framework for Bayesian inference that can offer
principled uncertainty estimates for a large range of problems. For example, if we consider …
principled uncertainty estimates for a large range of problems. For example, if we consider …
Model-augmented safe reinforcement learning for Volt-VAR control in power distribution networks
Volt-VAR control (VVC) is a critical tool to manage voltage profiles and reactive power flow
in power distribution networks by setting voltage regulating and reactive power …
in power distribution networks by setting voltage regulating and reactive power …
Multi-modal pain intensity assessment based on physiological signals: A deep learning perspective
Traditional pain assessment approaches ranging from self-reporting methods, to
observational scales, rely on the ability of an individual to accurately assess and …
observational scales, rely on the ability of an individual to accurately assess and …
A unified bellman optimality principle combining reward maximization and empowerment
F Leibfried, S Pascual-Diaz… - Advances in Neural …, 2019 - proceedings.neurips.cc
Empowerment is an information-theoretic method that can be used to intrinsically motivate
learning agents. It attempts to maximize an agent's control over the environment by …
learning agents. It attempts to maximize an agent's control over the environment by …
Reinforcement learning with simple sequence priors
In reinforcement learning (RL), simplicity is typically quantified on an action-by-action basis--
but this timescale ignores temporal regularities, like repetitions, often present in sequential …
but this timescale ignores temporal regularities, like repetitions, often present in sequential …
Mutual-Information Regularized Multi-Agent Policy Iteration
Despite the success of cooperative multi-agent reinforcement learning algorithms, most of
them focus on a single team composition, which prevents them from being used in more …
them focus on a single team composition, which prevents them from being used in more …
Hierarchically structured task-agnostic continual learning
One notable weakness of current machine learning algorithms is the poor ability of models
to solve new problems without forgetting previously acquired knowledge. The Continual …
to solve new problems without forgetting previously acquired knowledge. The Continual …
Disentangled skill embeddings for reinforcement learning
We propose a novel framework for multi-task reinforcement learning (MTRL). Using a
variational inference formulation, we learn policies that generalize across both changing …
variational inference formulation, we learn policies that generalize across both changing …
Language Model Adaption for Reinforcement Learning with Natural Language Action Space
Reinforcement learning with natural language action space often suffers from the curse of
dimensionality due to the combinatorial nature of the natural language. Previous research …
dimensionality due to the combinatorial nature of the natural language. Previous research …
Planning not to talk: Multiagent systems that are robust to communication loss
In a cooperative multiagent system, a collection of agents executes a joint policy in order to
achieve some common objective. The successful deployment of such systems hinges on the …
achieve some common objective. The successful deployment of such systems hinges on the …