[HTML][HTML] Deep learning, reinforcement learning, and world models

Y Matsuo, Y LeCun, M Sahani, D Precup, D Silver… - Neural Networks, 2022 - Elsevier
Deep learning (DL) and reinforcement learning (RL) methods seem to be a part of
indispensable factors to achieve human-level or super-human AI systems. On the other …

Knowledge-integrated machine learning for materials: lessons from gameplaying and robotics

K Hippalgaonkar, Q Li, X Wang, JW Fisher III… - Nature Reviews …, 2023 - nature.com
As materials researchers increasingly embrace machine-learning (ML) methods, it is natural
to wonder what lessons can be learned from other fields undergoing similar developments …

Daydreamer: World models for physical robot learning

P Wu, A Escontrela, D Hafner… - … on robot learning, 2023 - proceedings.mlr.press
To solve tasks in complex environments, robots need to learn from experience. Deep
reinforcement learning is a common approach to robot learning but requires a large amount …

[HTML][HTML] Robot learning towards smart robotic manufacturing: A review

Z Liu, Q Liu, W Xu, L Wang, Z Zhou - Robotics and Computer-Integrated …, 2022 - Elsevier
Robotic equipment has been playing a central role since the proposal of smart
manufacturing. Since the beginning of the first integration of industrial robots into production …

Masked world models for visual control

Y Seo, D Hafner, H Liu, F Liu, S James… - … on Robot Learning, 2023 - proceedings.mlr.press
Visual model-based reinforcement learning (RL) has the potential to enable sample-efficient
robot learning from visual observations. Yet the current approaches typically train a single …

Temporal difference learning for model predictive control

N Hansen, X Wang, H Su - arxiv preprint arxiv:2203.04955, 2022 - arxiv.org
Data-driven model predictive control has two key advantages over model-free methods: a
potential for improved sample efficiency through model learning, and better performance as …

Mastering atari with discrete world models

D Hafner, T Lillicrap, M Norouzi, J Ba - arxiv preprint arxiv:2010.02193, 2020 - arxiv.org
Intelligent agents need to generalize from past experience to achieve goals in complex
environments. World models facilitate such generalization and allow learning behaviors …

Contrastive learning as goal-conditioned reinforcement learning

B Eysenbach, T Zhang, S Levine… - Advances in Neural …, 2022 - proceedings.neurips.cc
In reinforcement learning (RL), it is easier to solve a task if given a good representation.
While deep RL should automatically acquire such good representations, prior work often …

Offline reinforcement learning: Tutorial, review, and perspectives on open problems

S Levine, A Kumar, G Tucker, J Fu - arxiv preprint arxiv:2005.01643, 2020 - arxiv.org
In this tutorial article, we aim to provide the reader with the conceptual tools needed to get
started on research on offline reinforcement learning algorithms: reinforcement learning …

Solving rubik's cube with a robot hand

I Akkaya, M Andrychowicz, M Chociej, M Litwin… - arxiv preprint arxiv …, 2019 - arxiv.org
We demonstrate that models trained only in simulation can be used to solve a manipulation
problem of unprecedented complexity on a real robot. This is made possible by two key …