[HTML][HTML] Deep learning, reinforcement learning, and world models

Y Matsuo, Y LeCun, M Sahani, D Precup, D Silver… - Neural Networks, 2022 - Elsevier
Deep learning (DL) and reinforcement learning (RL) methods seem to be a part of
indispensable factors to achieve human-level or super-human AI systems. On the other …

Hierarchical motor control in mammals and machines

J Merel, M Botvinick, G Wayne - Nature communications, 2019 - nature.com
Advances in artificial intelligence are stimulating interest in neuroscience. However, most
attention is given to discrete tasks with simple action spaces, such as board games and …

Ase: Large-scale reusable adversarial skill embeddings for physically simulated characters

XB Peng, Y Guo, L Halper, S Levine… - ACM Transactions On …, 2022 - dl.acm.org
The incredible feats of athleticism demonstrated by humans are made possible in part by a
vast repertoire of general-purpose motor skills, acquired through years of practice and …

Amp: Adversarial motion priors for stylized physics-based character control

XB Peng, Z Ma, P Abbeel, S Levine… - ACM Transactions on …, 2021 - dl.acm.org
Synthesizing graceful and life-like behaviors for physically simulated characters has been a
fundamental challenge in computer animation. Data-driven methods that leverage motion …

[HTML][HTML] dm_control: Software and tasks for continuous control

S Tunyasuvunakool, A Muldal, Y Doron, S Liu, S Bohez… - Software Impacts, 2020 - Elsevier
The dm_control software package is a collection of Python libraries and task suites for
reinforcement learning agents in an articulated-body simulation. Infrastructure includes a …

A virtual rodent predicts the structure of neural activity across behaviours

D Aldarondo, J Merel, JD Marshall, L Hasenclever… - Nature, 2024 - nature.com
Animals have exquisite control of their bodies, allowing them to perform a diverse range of
behaviours. How such control is implemented by the brain, however, remains unclear …

Critic regularized regression

Z Wang, A Novikov, K Zolna, JS Merel… - Advances in …, 2020 - proceedings.neurips.cc
Offline reinforcement learning (RL), also known as batch RL, offers the prospect of policy
optimization from large pre-recorded datasets without online environment interaction. It …

Synthesizing diverse human motions in 3d indoor scenes

K Zhao, Y Zhang, S Wang… - Proceedings of the …, 2023 - openaccess.thecvf.com
We present a novel method for populating 3D indoor scenes with virtual humans that can
navigate in the environment and interact with objects in a realistic manner. Existing …

Accelerating reinforcement learning with learned skill priors

K Pertsch, Y Lee, J Lim - Conference on robot learning, 2021 - proceedings.mlr.press
Intelligent agents rely heavily on prior experience when learning a new task, yet most
modern reinforcement learning (RL) approaches learn every task from scratch. One …

Physics-based character controllers using conditional vaes

J Won, D Gopinath, J Hodgins - ACM Transactions on Graphics (TOG), 2022 - dl.acm.org
High-quality motion capture datasets are now publicly available, and researchers have used
them to create kinematics-based controllers that can generate plausible and diverse human …