Μελετητής Google

WC Carvalho, A Saraiva, A Filos… - Advances in neural …, 2023 - proceedings.neurips.cc

Abstract The Option Keyboard (OK) was recently proposed as a method for transferring
behavioral knowledge across tasks. OK transfers knowledge by adaptively combining …

Αποθήκευση Παράθεση Γίνεται αναφορά σε 6 Σχετικά άρθρα Όλες οι 8 εκδοχές Προβολή ως HTML

[Free GPT-4]
[DeepSeek]

[PDF] neurips.cc

Maximum state entropy exploration using predecessor and successor representations

AK Jain, L Lehnert, I Rish… - Advances in Neural …, 2024 - proceedings.neurips.cc

Animals have a developed ability to explore that aids them in important tasks such as
locating food, exploring for shelter, and finding misplaced items. These exploration skills …

Αποθήκευση Παράθεση Γίνεται αναφορά σε 10 Σχετικά άρθρα Όλες οι 7 εκδοχές Προβολή ως HTML

[Free GPT-4]
[DeepSeek]

[PDF] neurips.cc

Constrained gpi for zero-shot transfer in reinforcement learning

J Kim, S Park, G Kim - Advances in Neural Information …, 2022 - proceedings.neurips.cc

For zero-shot transfer in reinforcement learning where the reward function varies between
different tasks, the successor features framework has been one of the popular approaches …

Αποθήκευση Παράθεση Γίνεται αναφορά σε 5 Σχετικά άρθρα Όλες οι 4 εκδοχές Προβολή ως HTML

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Non-Adversarial Inverse Reinforcement Learning via Successor Feature Matching

AK Jain, H Wiltzer, J Farebrother, I Rish… - arxiv preprint arxiv …, 2024 - arxiv.org

In inverse reinforcement learning (IRL), an agent seeks to replicate expert demonstrations
through interactions with the environment. Traditionally, IRL is treated as an adversarial …

Αποθήκευση Παράθεση Σχετικά άρθρα Όλες οι 2 εκδοχές Προβολή ως HTML

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

An advantage based policy transfer algorithm for reinforcement learning with metrics of transferability

MF Alam, P Naghizadeh, D Hoelzle - arxiv preprint arxiv:2311.06731, 2023 - arxiv.org

Reinforcement learning (RL) can enable sequential decision-making in complex and high-
dimensional environments if the acquisition of a new state-action pair is efficient, ie, when …

Αποθήκευση Παράθεση Σχετικά άρθρα Προβολή ως HTML

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Generalization through the lens of learning dynamics

C Lyle - arxiv preprint arxiv:2212.05377, 2022 - arxiv.org

A machine learning (ML) system must learn not only to match the output of a target function
on a training set, but also to generalize to novel situations in order to yield accurate …

Αποθήκευση Παράθεση Σχετικά άρθρα Όλες οι 4 εκδοχές Προβολή ως HTML

[Free GPT-4]
[DeepSeek]

[PDF] snu.ac.kr

Generalizable Agents with Improved Abstractions and Transfer

김재겸 - 2023 - s-space.snu.ac.kr

Many researchers in the field of deep learning have been trying to build agents that perform
a wide range of tasks. Since training on all the possible tasks is often not viable, improving …

Αποθήκευση Παράθεση Σχετικά άρθρα Προβολή ως HTML

Δημιουργία ειδοποίησης

Παράθεση

Σύνθετη αναζήτηση

Αποθηκεύτηκε στη Βιβλιοθήκη μου

Santu Rana, and Svetha Venkatesh. A new representation of successor features for transfer...

Combining behaviors with the successor features keyboard

Maximum state entropy exploration using predecessor and successor representations

Constrained gpi for zero-shot transfer in reinforcement learning

Non-Adversarial Inverse Reinforcement Learning via Successor Feature Matching

An advantage based policy transfer algorithm for reinforcement learning with metrics of transferability

Generalization through the lens of learning dynamics

Generalizable Agents with Improved Abstractions and Transfer