Google Академик

L Wang, X Zhang, H Su, J Zhu - IEEE Transactions on Pattern …, 2024 - ieeexplore.ieee.org

To cope with real-world dynamics, an intelligent system needs to incrementally acquire,
update, accumulate, and exploit knowledge throughout its lifetime. This ability, known as …

Сачувај Цитирај 737 пута наведен Сродни чланци Све верзије (9)

[Free GPT-4]
[DeepSeek]

[PDF] nowpublishers.com

Model-based reinforcement learning: A survey

TM Moerland, J Broekens, A Plaat… - … and Trends® in …, 2023 - nowpublishers.com

Sequential decision making, commonly formalized as Markov Decision Process (MDP)
optimization, is an important challenge in artificial intelligence. Two key approaches to this …

Сачувај Цитирај 946 пута наведен Сродни чланци Све верзије (15) Претрага библиотека HTML верзија

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Model-based offline planning

A Argenson, G Dulac-Arnold - arxiv preprint arxiv:2008.05556, 2020 - arxiv.org

Offline learning is a key part of making reinforcement learning (RL) useable in real systems.
Offline RL looks at scenarios where there is data from a system's operation, but no direct …

Сачувај Цитирај 162 пута наведен Сродни чланци Све верзије (4) HTML верзија

[Free GPT-4]
[DeepSeek]

[PDF] neurips.cc

Continual world: A robotic benchmark for continual reinforcement learning

M Wołczyk, M Zając, R Pascanu… - Advances in Neural …, 2021 - proceedings.neurips.cc

Abstract Continual learning (CL)---the ability to continuously learn, building on previously
acquired knowledge---is a natural requirement for long-lived autonomous reinforcement …

Сачувај Цитирај 106 пута наведен Сродни чланци Све верзије (7) HTML верзија

[Free GPT-4]
[DeepSeek]

[PDF] mlr.press

Optimizing for the future in non-stationary mdps

Y Chandak, G Theocharous… - International …, 2020 - proceedings.mlr.press

Most reinforcement learning methods are based upon the key assumption that the transition
dynamics and reward functions are fixed, that is, the underlying Markov decision process is …

Сачувај Цитирај 79 пута наведен Сродни чланци Све верзије (11) HTML верзија

[Free GPT-4]
[DeepSeek]

[PDF] neurips.cc

Prediction and control in continual reinforcement learning

N Anand, D Precup - Advances in Neural Information …, 2023 - proceedings.neurips.cc

Temporal difference (TD) learning is often used to update the estimate of the value function
which is used by RL agents to extract useful policies. In this paper, we focus on value …

Сачувај Цитирај 14 пута наведен Сродни чланци Све верзије (6) HTML верзија

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Reset-free lifelong learning with skill-space planning

K Lu, A Grover, P Abbeel, I Mordatch - arxiv preprint arxiv:2012.03548, 2020 - arxiv.org

The objective of lifelong reinforcement learning (RL) is to optimize agents which can
continuously adapt and interact in changing environments. However, current RL approaches …

Сачувај Цитирај 49 пута наведен Сродни чланци Све верзије (5) HTML верзија

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Learning skills to patch plans based on inaccurate models

A Lagrassa, S Lee, O Kroemer - 2020 IEEE/RSJ International …, 2020 - ieeexplore.ieee.org

Planners using accurate models can be effective for accomplishing manipulation tasks in the
real world, but are typically highly specialized and require significant fine-tuning to be …

Сачувај Цитирај 13 пута наведен Сродни чланци Све верзије (7)

[Free GPT-4]
[DeepSeek]

[PDF] mlr.press

Neural-progressive hedging: Enforcing constraints in reinforcement learning with stochastic programming

S Ghosh, L Wynter, SH Lim… - Uncertainty in Artificial …, 2022 - proceedings.mlr.press

We propose a framework, called neural-progressive hedging (NP), that leverages stochastic
programming during the online phase of executing a reinforcement learning (RL) policy. The …

Сачувај Цитирај 2 пута наведен Сродни чланци Све верзије (8) HTML верзија

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Uncertainty-sensitive learning and planning with ensembles

P Miłoś, Ł Kuciński, K Czechowski… - arxiv preprint arxiv …, 2019 - arxiv.org

We propose a reinforcement learning framework for discrete environments in which an
agent makes both strategic and tactical decisions. The former manifests itself through the …

Сачувај Цитирај 8 пута наведен Сродни чланци Све верзије (3) HTML верзија

Направи обавештење

Цитирај

Напредна претрага

Сачувано у мојој библиотеци

Adaptive online planning for continual lifelong learning

A comprehensive survey of continual learning: Theory, method and application

Model-based reinforcement learning: A survey

Model-based offline planning

Continual world: A robotic benchmark for continual reinforcement learning

Optimizing for the future in non-stationary mdps

Prediction and control in continual reinforcement learning

Reset-free lifelong learning with skill-space planning

Learning skills to patch plans based on inaccurate models

Neural-progressive hedging: Enforcing constraints in reinforcement learning with stochastic programming

Uncertainty-sensitive learning and planning with ensembles