Μελετητής Google

S Park, O Rybkin, S Levine - arxiv preprint arxiv:2310.08887, 2023 - arxiv.org

Unsupervised pre-training strategies have proven to be highly effective in natural language
processing and computer vision. Likewise, unsupervised reinforcement learning (RL) holds …

Αποθήκευση Παράθεση Γίνεται αναφορά σε 31 Σχετικά άρθρα Όλες οι 4 εκδοχές Προβολή ως HTML

[Free GPT-4]
[DeepSeek]

[HTML] sciencedirect.com

[HTML][HTML] On efficient computation in active inference

A Paul, N Sajid, L Da Costa, A Razi - Expert Systems with Applications, 2024 - Elsevier

Biological agents demonstrate a remarkable proficiency in calibrating appropriate scales of
planning and evaluation when interacting with their environments. It follows logically that …

Αποθήκευση Παράθεση Γίνεται αναφορά σε 7 Σχετικά άρθρα Όλες οι 3 εκδοχές

[Free GPT-4]
[DeepSeek]

[PDF] neurips.cc

A mixture of surprises for unsupervised reinforcement learning

A Zhao, M Lin, Y Li, YJ Liu… - Advances in Neural …, 2022 - proceedings.neurips.cc

Unsupervised reinforcement learning aims at learning a generalist policy in a reward-free
manner for fast adaptation to downstream tasks. Most of the existing methods propose to …

Αποθήκευση Παράθεση Γίνεται αναφορά σε 12 Σχετικά άρθρα Όλες οι 7 εκδοχές Προβολή ως HTML

[ΒΙΒΛΙΟ][B] Natural General Intelligence: How understanding the brain can help us build AI

C Summerfield - 2022 - books.google.com

Since the time of Turing, computer scientists have dreamed of building artificial general
intelligence (AGI)-a system that can think, learn and act as humans do. Over recent years …

Αποθήκευση Παράθεση Γίνεται αναφορά σε 34 Σχετικά άρθρα Όλες οι 5 εκδοχές Αναζήτηση βιβλιοθήκης

[Free GPT-4]
[DeepSeek]

[PDF] mlr.press

Predictable mdp abstraction for unsupervised model-based rl

S Park, S Levine - International Conference on Machine …, 2023 - proceedings.mlr.press

A key component of model-based reinforcement learning (RL) is a dynamics model that
predicts the outcomes of actions. Errors in this predictive model can degrade the …

Αποθήκευση Παράθεση Γίνεται αναφορά σε 11 Σχετικά άρθρα Όλες οι 6 εκδοχές Προβολή ως HTML

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Accelerating Goal-Conditioned RL Algorithms and Research

M Bortkiewicz, W Pałucki, V Myers, T Dziarmaga… - arxiv preprint arxiv …, 2024 - arxiv.org

Self-supervision has the potential to transform reinforcement learning (RL), paralleling the
breakthroughs it has enabled in other areas of machine learning. While self-supervised …

Αποθήκευση Παράθεση Γίνεται αναφορά σε 2 Σχετικά άρθρα Όλες οι 4 εκδοχές Προβολή ως HTML

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

A Unified View on Solving Objective Mismatch in Model-Based Reinforcement Learning

R Wei, N Lambert, A McDonald, A Garcia… - arxiv preprint arxiv …, 2023 - arxiv.org

Model-based Reinforcement Learning (MBRL) aims to make agents more sample-efficient,
adaptive, and explainable by learning an explicit model of the environment. While the …

Αποθήκευση Παράθεση Γίνεται αναφορά σε 10 Σχετικά άρθρα Όλες οι 3 εκδοχές Προβολή ως HTML

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

When in Doubt, Think Slow: Iterative Reasoning with Latent Imagination

M Benfeghoul, U Zahid, Q Guo, Z Fountas - arxiv preprint arxiv …, 2024 - arxiv.org

In an unfamiliar setting, a model-based reinforcement learning agent can be limited by the
accuracy of its world model. In this work, we present a novel, training-free approach to …

Αποθήκευση Παράθεση Σχετικά άρθρα Όλες οι 2 εκδοχές Προβολή ως HTML

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Surprise-Adaptive Intrinsic Motivation for Unsupervised Reinforcement Learning

A Hugessen, RC Castanyer, F Mohamed… - arxiv preprint arxiv …, 2024 - arxiv.org

Both entropy-minimizing and entropy-maximizing (curiosity) objectives for unsupervised
reinforcement learning (RL) have been shown to be effective in different environments …

Αποθήκευση Παράθεση Σχετικά άρθρα Όλες οι 3 εκδοχές Προβολή ως HTML

[Free GPT-4]
[DeepSeek]

[PDF] umontreal.ca

Intrinsic exploration for reinforcement learning beyond rewards

R Creus-Castanyer - 2024 - papyrus.bib.umontreal.ca

In reinforcement learning, a reward function is used to guide the agent's behavior towards
task-specific objectives. However, such extrinsic rewards often fall short in complex …

Αποθήκευση Παράθεση Σχετικά άρθρα Προβολή ως HTML

Δημιουργία ειδοποίησης

Παράθεση

Σύνθετη αναζήτηση

Αποθηκεύτηκε στη Βιβλιοθήκη μου

Information is power: Intrinsic control via information capture

Metra: Scalable unsupervised rl with metric-aware abstraction

[HTML][HTML] On efficient computation in active inference

A mixture of surprises for unsupervised reinforcement learning

[ΒΙΒΛΙΟ][B] Natural General Intelligence: How understanding the brain can help us build AI

Predictable mdp abstraction for unsupervised model-based rl

Accelerating Goal-Conditioned RL Algorithms and Research

A Unified View on Solving Objective Mismatch in Model-Based Reinforcement Learning

When in Doubt, Think Slow: Iterative Reasoning with Latent Imagination

Surprise-Adaptive Intrinsic Motivation for Unsupervised Reinforcement Learning

Intrinsic exploration for reinforcement learning beyond rewards