Turnitin
降AI改写
早检测系统
早降重系统
Turnitin-UK版
万方检测-期刊版
维普编辑部版
Grammarly检测
Paperpass检测
checkpass检测
PaperYY检测
Deep reinforcement learning
SE Li - Reinforcement learning for sequential decision and …, 2023 - Springer
Similar to humans, RL agents use interactive learning to successfully obtain satisfactory
decision strategies. However, in many cases, it is desirable to learn directly from …
decision strategies. However, in many cases, it is desirable to learn directly from …
SDRL: interpretable and data-efficient deep reinforcement learning leveraging symbolic planning
Deep reinforcement learning (DRL) has gained great success by learning directly from high-
dimensional sensory inputs, yet is notorious for the lack of interpretability. Interpretability of …
dimensional sensory inputs, yet is notorious for the lack of interpretability. Interpretability of …
Reasoning about physical interactions with object-oriented prediction and planning
Object-based factorizations provide a useful level of abstraction for interacting with the
world. Building explicit object representations, however, often requires supervisory signals …
world. Building explicit object representations, however, often requires supervisory signals …
Human-level reinforcement learning through theory-based modeling, exploration, and planning
PA Tsividis, J Loula, J Burga, N Foss… - arxiv preprint arxiv …, 2021 - arxiv.org
Reinforcement learning (RL) studies how an agent comes to achieve reward in an
environment through interactions over time. Recent advances in machine RL have …
environment through interactions over time. Recent advances in machine RL have …
Unsupervised object interaction learning with counterfactual dynamics models
We present COIL (Counterfactual Object Interaction Learning), a novel way of learning skills
of object interactions on entity-centric environments. The goal is to learn primitive behaviors …
of object interactions on entity-centric environments. The goal is to learn primitive behaviors …
Roll: Visual self-supervised reinforcement learning with object reasoning
Current image-based reinforcement learning (RL) algorithms typically operate on the whole
image without performing object-level reasoning. This leads to inefficient goal sampling and …
image without performing object-level reasoning. This leads to inefficient goal sampling and …
Causal world models by unsupervised deconfounding of physical dynamics
The capability of imagining internally with a mental model of the world is vitally important for
human cognition. If a machine intelligent agent can learn a world model to create a" dream" …
human cognition. If a machine intelligent agent can learn a world model to create a" dream" …
Learning abstract models for strategic exploration and fast reward transfer
Model-based reinforcement learning (RL) is appealing because (i) it enables planning and
thus more strategic exploration, and (ii) by decoupling dynamics from rewards, it enables …
thus more strategic exploration, and (ii) by decoupling dynamics from rewards, it enables …
Одновременное планирование и обучение в иерархической системе управления когнитивным агентом
АИ Панов - Автоматика и телемеханика, 2022 - mathnet.ru
Задачи планирования поведения и обучения принятию решений в динамической среде
в системах управления интеллектуальными агентами обычно разделяют и …
в системах управления интеллектуальными агентами обычно разделяют и …
Toybox: a suite of environments for experimental evaluation of deep reinforcement learning
Evaluation of deep reinforcement learning (RL) is inherently challenging. In particular,
learned policies are largely opaque, and hypotheses about the behavior of deep RL agents …
learned policies are largely opaque, and hypotheses about the behavior of deep RL agents …