A tutorial on sparse Gaussian processes and variational inference

F Leibfried, V Dutordoir, ST John… - arxiv preprint arxiv …, 2020 - arxiv.org
Gaussian processes (GPs) provide a framework for Bayesian inference that can offer
principled uncertainty estimates for a large range of problems. For example, if we consider …

Model-augmented safe reinforcement learning for Volt-VAR control in power distribution networks

Y Gao, N Yu - Applied Energy, 2022 - Elsevier
Volt-VAR control (VVC) is a critical tool to manage voltage profiles and reactive power flow
in power distribution networks by setting voltage regulating and reactive power …

Multi-modal pain intensity assessment based on physiological signals: A deep learning perspective

P Thiam, H Hihn, DA Braun, HA Kestler… - Frontiers in …, 2021 - frontiersin.org
Traditional pain assessment approaches ranging from self-reporting methods, to
observational scales, rely on the ability of an individual to accurately assess and …

A unified bellman optimality principle combining reward maximization and empowerment

F Leibfried, S Pascual-Diaz… - Advances in Neural …, 2019 - proceedings.neurips.cc
Empowerment is an information-theoretic method that can be used to intrinsically motivate
learning agents. It attempts to maximize an agent's control over the environment by …

Reinforcement learning with simple sequence priors

T Saanum, N Éltető, P Dayan… - Advances in Neural …, 2024 - proceedings.neurips.cc
In reinforcement learning (RL), simplicity is typically quantified on an action-by-action basis--
but this timescale ignores temporal regularities, like repetitions, often present in sequential …

Mutual-Information Regularized Multi-Agent Policy Iteration

D Ye, Z Lu - Advances in Neural Information Processing …, 2024 - proceedings.neurips.cc
Despite the success of cooperative multi-agent reinforcement learning algorithms, most of
them focus on a single team composition, which prevents them from being used in more …

Hierarchically structured task-agnostic continual learning

H Hihn, DA Braun - Machine Learning, 2023 - Springer
One notable weakness of current machine learning algorithms is the poor ability of models
to solve new problems without forgetting previously acquired knowledge. The Continual …

Disentangled skill embeddings for reinforcement learning

JC Petangoda, S Pascual-Diaz, V Adam… - arxiv preprint arxiv …, 2019 - arxiv.org
We propose a novel framework for multi-task reinforcement learning (MTRL). Using a
variational inference formulation, we learn policies that generalize across both changing …

Language Model Adaption for Reinforcement Learning with Natural Language Action Space

J Wang, J Li, X Han, D Ye, Z Lu - … of the 62nd Annual Meeting of …, 2024 - aclanthology.org
Reinforcement learning with natural language action space often suffers from the curse of
dimensionality due to the combinatorial nature of the natural language. Previous research …

Planning not to talk: Multiagent systems that are robust to communication loss

MO Karabag, C Neary, U Topcu - arxiv preprint arxiv:2201.06619, 2022 - arxiv.org
In a cooperative multiagent system, a collection of agents executes a joint policy in order to
achieve some common objective. The successful deployment of such systems hinges on the …