- Academic Search

B Eysenbach, V Myers… - Advances in Neural …, 2025 - proceedings.neurips.cc

Given time series data, how can we answer questions like what will happen in the
future?''and how did we get here?''These sorts of probabilistic inference questions are …

Opslaan Citeren Geciteerd door 15 Verwante artikelen Alle 7 versies HTML-versie

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Reinforcement learning: An overview

K Murphy - arxiv preprint arxiv:2412.05265, 2024 - arxiv.org

This manuscript gives a big-picture, up-to-date overview of the field of (deep) reinforcement
learning and sequential decision making, covering value-based RL, policy-gradient …

Opslaan Citeren Geciteerd door 1 Verwante artikelen Alle 2 versies HTML-versie

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Bridging state and history representations: Understanding self-predictive rl

T Ni, B Eysenbach, E Seyedsalehi, M Ma… - arxiv preprint arxiv …, 2024 - arxiv.org

Representations are at the core of all deep reinforcement learning (RL) methods for both
Markov decision processes (MDPs) and partially observable Markov decision processes …

Opslaan Citeren Geciteerd door 23 Verwante artikelen Alle 6 versies HTML-versie

[Free GPT-4]
[DeepSeek]

[HTML] sciencedirect.com

[HTML][HTML] Investigating the properties of neural network representations in reinforcement learning

H Wang, E Miahi, M White, MC Machado, Z Abbas… - Artificial Intelligence, 2024 - Elsevier

In this paper we investigate the properties of representations learned by deep reinforcement
learning systems. Much of the early work on representations for reinforcement learning …

Opslaan Citeren Geciteerd door 28 Verwante artikelen Alle 6 versies

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Predictive auxiliary objectives in deep rl mimic learning in the brain

C Fang, KL Stachenfeld - arxiv preprint arxiv:2310.06089, 2023 - arxiv.org

The ability to predict upcoming events has been hypothesized to comprise a key aspect of
natural and machine cognition. This is supported by trends in deep reinforcement learning …

Opslaan Citeren Geciteerd door 10 Verwante artikelen Alle 4 versies HTML-versie

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

A unified view on solving objective mismatch in model-based reinforcement learning

R Wei, N Lambert, A McDonald, A Garcia… - arxiv preprint arxiv …, 2023 - arxiv.org

Model-based Reinforcement Learning (MBRL) aims to make agents more sample-efficient,
adaptive, and explainable by learning an explicit model of the environment. While the …

Opslaan Citeren Geciteerd door 10 Verwante artikelen Alle 3 versies HTML-versie

[Free GPT-4]
[DeepSeek]

[PDF] mlr.press

Representations and exploration for deep reinforcement learning using singular value decomposition

Y Chandak, S Thakoor, ZD Guo… - International …, 2023 - proceedings.mlr.press

Abstract Representation learning and exploration are among the key challenges for any
deep reinforcement learning agent. In this work, we provide a singular value decomposition …

Opslaan Citeren Geciteerd door 5 Verwante artikelen Alle 7 versies HTML-versie

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Cross-domain policy adaptation by capturing representation mismatch

J Lyu, C Bai, J Yang, Z Lu, X Li - arxiv preprint arxiv:2405.15369, 2024 - arxiv.org

It is vital to learn effective policies that can be transferred to different domains with dynamics
discrepancies in reinforcement learning (RL). In this paper, we consider dynamics …

Opslaan Citeren Geciteerd door 4 Verwante artikelen Alle 8 versies HTML-versie

[Free GPT-4]
[DeepSeek]

[PDF] neurips.cc

Self-predictive universal AI

E Catt, J Grau-Moya, M Hutter… - Advances in …, 2023 - proceedings.neurips.cc

Reinforcement Learning (RL) algorithms typically utilize learning and/or planning
techniques to derive effective policies. The integration of both approaches has proven to be …

Opslaan Citeren Geciteerd door 4 Verwante artikelen Alle 3 versies HTML-versie

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Curiosity in hindsight: Intrinsic exploration in stochastic environments

D Jarrett, C Tallec, F Altché, T Mesnard… - arxiv preprint arxiv …, 2022 - arxiv.org

Consider the problem of exploration in sparse-reward or reward-free environments, such as
in Montezuma's Revenge. In the curiosity-driven paradigm, the agent is rewarded for how …

Opslaan Citeren Geciteerd door 14 Verwante artikelen Alle 7 versies HTML-versie

Melding maken

Citeren

Geavanceerd zoeken

Opgeslagen in Mijn bibliotheek

Understanding self-predictive learning for reinforcement learning

Inference via interpolation: Contrastive representations provably enable planning and inference

Reinforcement learning: An overview

Bridging state and history representations: Understanding self-predictive rl

[HTML][HTML] Investigating the properties of neural network representations in reinforcement learning

Predictive auxiliary objectives in deep rl mimic learning in the brain

A unified view on solving objective mismatch in model-based reinforcement learning

Representations and exploration for deep reinforcement learning using singular value decomposition

Cross-domain policy adaptation by capturing representation mismatch

Self-predictive universal AI

Curiosity in hindsight: Intrinsic exploration in stochastic environments