[HTML][HTML] Deep learning, reinforcement learning, and world models

Y Matsuo, Y LeCun, M Sahani, D Precup, D Silver… - Neural Networks, 2022 - Elsevier
Deep learning (DL) and reinforcement learning (RL) methods seem to be a part of
indispensable factors to achieve human-level or super-human AI systems. On the other …

Metalearning and neuromodulation

K Doya - Neural networks, 2002 - Elsevier
This paper presents a computational theory on the roles of the ascending neuromodulatory
systems from the viewpoint that they mediate the global signals that regulate the distributed …

Mt-opt: Continuous multi-task robotic reinforcement learning at scale

D Kalashnikov, J Varley, Y Chebotar… - arxiv preprint arxiv …, 2021 - arxiv.org
General-purpose robotic systems must master a large repertoire of diverse skills to be useful
in a range of daily tasks. While reinforcement learning provides a powerful framework for …

Learning agile and dynamic motor skills for legged robots

J Hwangbo, J Lee, A Dosovitskiy, D Bellicoso… - Science Robotics, 2019 - science.org
Legged robots pose one of the greatest challenges in robotics. Dynamic and agile
maneuvers of animals cannot be imitated by existing methods that are crafted by humans. A …

Reinforcement learning in robotics: A survey

J Kober, JA Bagnell, J Peters - The International Journal of …, 2013 - journals.sagepub.com
Reinforcement learning offers to robotics a framework and set of tools for the design of
sophisticated and hard-to-engineer behaviors. Conversely, the challenges of robotic …

A hierarchy of time-scales and the brain

SJ Kiebel, J Daunizeau, KJ Friston - PLoS computational biology, 2008 - journals.plos.org
In this paper, we suggest that cortical anatomy recapitulates the temporal hierarchy that is
inherent in the dynamics of environmental states. Many aspects of brain function can be …

Multiple model-based reinforcement learning

K Doya, K Samejima, K Katagiri, M Kawato - Neural computation, 2002 - direct.mit.edu
We propose a modular reinforcement learning architecture for nonlinear, nonstationary
control tasks, which we call multiple model-based reinforcement learning (MMRL). The basic …

Hierarchical relative entropy policy search

C Daniel, G Neumann, O Kroemer, J Peters - Journal of Machine Learning …, 2016 - jmlr.org
Many reinforcement learning (RL) tasks, especially in robotics, consist of multiple sub-tasks
that are strongly structured. Such task structures can be exploited by incorporating …

[SÁCH][B] Exploring robotic minds: actions, symbols, and consciousness as self-organizing dynamic phenomena

J Tani - 2016 - books.google.com
In Exploring Robotic Minds: Actions, Symbols, and Consciousness as Self-Organizing
Dynamic Phenomena, Jun Tani sets out to answer an essential and tantalizing question …

Robust reinforcement learning

J Morimoto, K Doya - Neural computation, 2005 - ieeexplore.ieee.org
This letter proposes a new reinforcement learning (RL) paradigm that explicitly takes into
account input disturbance as well as modeling errors. The use of environmental models in …