- Academic Search

F Leibfried, V Dutordoir, ST John… - arxiv preprint arxiv …, 2020 - arxiv.org

Gaussian processes (GPs) provide a framework for Bayesian inference that can offer
principled uncertainty estimates for a large range of problems. For example, if we consider …

Speichern Zitieren Zitiert von: 57 Ähnliche Artikel Alle 3 Versionen HTML-Version

[Free GPT-4]

[PDF] ucr.edu

Model-augmented safe reinforcement learning for Volt-VAR control in power distribution networks

Y Gao, N Yu - Applied Energy, 2022 - Elsevier

Volt-VAR control (VVC) is a critical tool to manage voltage profiles and reactive power flow
in power distribution networks by setting voltage regulating and reactive power …

Speichern Zitieren Zitiert von: 57 Ähnliche Artikel Alle 8 Versionen

[Free GPT-4]

[PDF] frontiersin.org

Multi-modal pain intensity assessment based on physiological signals: A deep learning perspective

P Thiam, H Hihn, DA Braun, HA Kestler… - Frontiers in …, 2021 - frontiersin.org

Traditional pain assessment approaches ranging from self-reporting methods, to
observational scales, rely on the ability of an individual to accurately assess and …

Speichern Zitieren Zitiert von: 40 Ähnliche Artikel Alle 7 Versionen Im Cache

[Free GPT-4]

[PDF] neurips.cc

A unified bellman optimality principle combining reward maximization and empowerment

F Leibfried, S Pascual-Diaz… - Advances in Neural …, 2019 - proceedings.neurips.cc

Empowerment is an information-theoretic method that can be used to intrinsically motivate
learning agents. It attempts to maximize an agent's control over the environment by …

Speichern Zitieren Zitiert von: 40 Ähnliche Artikel Alle 6 Versionen HTML-Version

[Free GPT-4]

[PDF] neurips.cc

Reinforcement learning with simple sequence priors

T Saanum, N Éltető, P Dayan… - Advances in Neural …, 2024 - proceedings.neurips.cc

In reinforcement learning (RL), simplicity is typically quantified on an action-by-action basis--
but this timescale ignores temporal regularities, like repetitions, often present in sequential …

Speichern Zitieren Zitiert von: 9 Ähnliche Artikel Alle 6 Versionen HTML-Version

[Free GPT-4]

[PDF] neurips.cc

Mutual-Information Regularized Multi-Agent Policy Iteration

D Ye, Z Lu - Advances in Neural Information Processing …, 2024 - proceedings.neurips.cc

Despite the success of cooperative multi-agent reinforcement learning algorithms, most of
them focus on a single team composition, which prevents them from being used in more …

Speichern Zitieren Ähnliche Artikel HTML-Version

[Free GPT-4]

[PDF] springer.com

Hierarchically structured task-agnostic continual learning

H Hihn, DA Braun - Machine Learning, 2023 - Springer

One notable weakness of current machine learning algorithms is the poor ability of models
to solve new problems without forgetting previously acquired knowledge. The Continual …

Speichern Zitieren Zitiert von: 8 Ähnliche Artikel Alle 8 Versionen

[Free GPT-4]

[PDF] arxiv.org

Disentangled skill embeddings for reinforcement learning

JC Petangoda, S Pascual-Diaz, V Adam… - arxiv preprint arxiv …, 2019 - arxiv.org

We propose a novel framework for multi-task reinforcement learning (MTRL). Using a
variational inference formulation, we learn policies that generalize across both changing …

Speichern Zitieren Zitiert von: 20 Ähnliche Artikel Alle 3 Versionen HTML-Version

[Free GPT-4]

[PDF] aclanthology.org

Language Model Adaption for Reinforcement Learning with Natural Language Action Space

J Wang, J Li, X Han, D Ye, Z Lu - … of the 62nd Annual Meeting of …, 2024 - aclanthology.org

Reinforcement learning with natural language action space often suffers from the curse of
dimensionality due to the combinatorial nature of the natural language. Previous research …

Speichern Zitieren Ähnliche Artikel Alle 2 Versionen HTML-Version

[Free GPT-4]

[PDF] arxiv.org

Planning not to talk: Multiagent systems that are robust to communication loss

MO Karabag, C Neary, U Topcu - arxiv preprint arxiv:2201.06619, 2022 - arxiv.org

In a cooperative multiagent system, a collection of agents executes a joint policy in order to
achieve some common objective. The successful deployment of such systems hinges on the …

Speichern Zitieren Zitiert von: 10 Ähnliche Artikel Alle 7 Versionen HTML-Version

Alert erstellen

Zitieren

Erweiterte Suche

In „Meine Bibliothek“ gespeichert

Mutual-information regularization in Markov decision processes and actor-critic learning

A tutorial on sparse Gaussian processes and variational inference

Model-augmented safe reinforcement learning for Volt-VAR control in power distribution networks

Multi-modal pain intensity assessment based on physiological signals: A deep learning perspective

A unified bellman optimality principle combining reward maximization and empowerment

Reinforcement learning with simple sequence priors

Mutual-Information Regularized Multi-Agent Policy Iteration

Hierarchically structured task-agnostic continual learning

Disentangled skill embeddings for reinforcement learning

Language Model Adaption for Reinforcement Learning with Natural Language Action Space

Planning not to talk: Multiagent systems that are robust to communication loss