Combining behaviors with the successor features keyboard

WC Carvalho, A Saraiva, A Filos… - Advances in neural …, 2023 - proceedings.neurips.cc
Abstract The Option Keyboard (OK) was recently proposed as a method for transferring
behavioral knowledge across tasks. OK transfers knowledge by adaptively combining …

Maximum state entropy exploration using predecessor and successor representations

AK Jain, L Lehnert, I Rish… - Advances in Neural …, 2024 - proceedings.neurips.cc
Animals have a developed ability to explore that aids them in important tasks such as
locating food, exploring for shelter, and finding misplaced items. These exploration skills …

Constrained gpi for zero-shot transfer in reinforcement learning

J Kim, S Park, G Kim - Advances in Neural Information …, 2022 - proceedings.neurips.cc
For zero-shot transfer in reinforcement learning where the reward function varies between
different tasks, the successor features framework has been one of the popular approaches …

Non-Adversarial Inverse Reinforcement Learning via Successor Feature Matching

AK Jain, H Wiltzer, J Farebrother, I Rish… - arxiv preprint arxiv …, 2024 - arxiv.org
In inverse reinforcement learning (IRL), an agent seeks to replicate expert demonstrations
through interactions with the environment. Traditionally, IRL is treated as an adversarial …

An advantage based policy transfer algorithm for reinforcement learning with metrics of transferability

MF Alam, P Naghizadeh, D Hoelzle - arxiv preprint arxiv:2311.06731, 2023 - arxiv.org
Reinforcement learning (RL) can enable sequential decision-making in complex and high-
dimensional environments if the acquisition of a new state-action pair is efficient, ie, when …

Generalization through the lens of learning dynamics

C Lyle - arxiv preprint arxiv:2212.05377, 2022 - arxiv.org
A machine learning (ML) system must learn not only to match the output of a target function
on a training set, but also to generalize to novel situations in order to yield accurate …

Generalizable Agents with Improved Abstractions and Transfer

김재겸 - 2023 - s-space.snu.ac.kr
Many researchers in the field of deep learning have been trying to build agents that perform
a wide range of tasks. Since training on all the possible tasks is often not viable, improving …