- Academic Search

World models and predictive coding for cognitive and developmental robotics: frontiers and challenges

T Taniguchi, S Murata, M Suzuki, D Ognibene… - Advanced …, 2023 - Taylor & Francis

Creating autonomous robots that can actively explore the environment, acquire knowledge
and learn skills continuously is the ultimate achievement envisioned in cognitive and …

Salva Cita Citato da 74 Articoli correlati Tutte e 16 le versioni Ricerca biblioteche

[Free GPT-4]
[DeepSeek]

[PDF] mlr.press

Optimal goal-reaching reinforcement learning via quasimetric learning

T Wang, A Torralba, P Isola… - … Conference on Machine …, 2023 - proceedings.mlr.press

In goal-reaching reinforcement learning (RL), the optimal value function has a particular
geometry, called quasimetrics structure. This paper introduces Quasimetric Reinforcement …

Salva Cita Citato da 37 Articoli correlati Tutte e 8 le versioni Versione HTML

[Free GPT-4]
[DeepSeek]

[PDF] jair.org Full View

Structure in deep reinforcement learning: A survey and open problems

A Mohan, A Zhang, M Lindauer - Journal of Artificial Intelligence Research, 2024 - jair.org

Reinforcement Learning (RL), bolstered by the expressive capabilities of Deep Neural
Networks (DNNs) for function approximation, has demonstrated considerable success in …

Salva Cita Citato da 23 Articoli correlati Tutte e 10 le versioni Versione HTML

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Reinforcement learning: An overview

K Murphy - arxiv preprint arxiv:2412.05265, 2024 - arxiv.org

This manuscript gives a big-picture, up-to-date overview of the field of (deep) reinforcement
learning and sequential decision making, covering value-based RL, policy-gradient …

Salva Cita Citato da 1 Articoli correlati Tutte e 2 le versioni Versione HTML

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Bridging state and history representations: Understanding self-predictive rl

T Ni, B Eysenbach, E Seyedsalehi, M Ma… - arxiv preprint arxiv …, 2024 - arxiv.org

Representations are at the core of all deep reinforcement learning (RL) methods for both
Markov decision processes (MDPs) and partially observable Markov decision processes …

Salva Cita Citato da 23 Articoli correlati Tutte e 6 le versioni Versione HTML

[Free GPT-4]
[DeepSeek]

[PDF] neurips.cc

Learning world models with identifiable factorization

Y Liu, B Huang, Z Zhu, H Tian… - Advances in Neural …, 2023 - proceedings.neurips.cc

Extracting a stable and compact representation of the environment is crucial for efficient
reinforcement learning in high-dimensional, noisy, and non-stationary environments …

Salva Cita Citato da 14 Articoli correlati Tutte e 10 le versioni Versione HTML

[Free GPT-4]
[DeepSeek]

[PDF] neurips.cc

Repo: Resilient model-based reinforcement learning by regularizing posterior predictability

C Zhu, M Simchowitz, S Gadipudi… - Advances in Neural …, 2023 - proceedings.neurips.cc

Visual model-based RL methods typically encode image observations into low-dimensional
representations in a manner that does not eliminate redundant information. This leaves them …

Salva Cita Citato da 10 Articoli correlati Tutte e 6 le versioni Versione HTML

[Free GPT-4]
[DeepSeek]

[PDF] neurips.cc

Ignorance is bliss: Robust control via information gating

M Tomar, R Islam, M Taylor… - Advances in Neural …, 2023 - proceedings.neurips.cc

Informational parsimony provides a useful inductive bias for learning representations that
achieve better generalization by being robust to noise and spurious correlations. We …

Salva Cita Citato da 10 Articoli correlati Tutte e 5 le versioni Versione HTML

[Free GPT-4]
[DeepSeek]

[PDF] aaai.org

Building minimal and reusable causal state abstractions for reinforcement learning

Z Wang, C Wang, X **ao, Y Zhu, P Stone - Proceedings of the AAAI …, 2024 - ojs.aaai.org

Two desiderata of reinforcement learning (RL) algorithms are the ability to learn from
relatively little experience and the ability to learn policies that generalize to a range of …

Salva Cita Citato da 7 Articoli correlati Tutte e 11 le versioni Versione HTML

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Guaranteed discovery of control-endogenous latent states with multi-step inverse models

A Lamb, R Islam, Y Efroni, A Didolkar, D Misra… - arxiv preprint arxiv …, 2022 - arxiv.org

In many sequential decision-making tasks, the agent is not able to model the full complexity
of the world, which consists of multitudes of relevant and irrelevant information. For example …

Salva Cita Citato da 20 Articoli correlati Tutte e 3 le versioni Versione HTML

Crea avviso

Cita

Ricerca avanzata

Salvato in La mia biblioteca

Denoised mdps: Learning world models better than the world itself

World models and predictive coding for cognitive and developmental robotics: frontiers and challenges

Optimal goal-reaching reinforcement learning via quasimetric learning

Structure in deep reinforcement learning: A survey and open problems

Reinforcement learning: An overview

Bridging state and history representations: Understanding self-predictive rl

Learning world models with identifiable factorization

Repo: Resilient model-based reinforcement learning by regularizing posterior predictability

Ignorance is bliss: Robust control via information gating

Building minimal and reusable causal state abstractions for reinforcement learning

Guaranteed discovery of control-endogenous latent states with multi-step inverse models