Convolutional neural networks as a model of the visual system: Past, present, and future

GW Lindsay - Journal of cognitive neuroscience, 2021 - direct.mit.edu
Convolutional neural networks (CNNs) were inspired by early findings in the study of
biological vision. They have since become successful tools in computer vision and state-of …

Learning for a robot: Deep reinforcement learning, imitation learning, transfer learning

J Hua, L Zeng, G Li, Z Ju - Sensors, 2021 - mdpi.com
Dexterous manipulation of the robot is an important part of realizing intelligence, but
manipulators can only perform simple tasks such as sorting and packing in a structured …

A minimalist approach to offline reinforcement learning

S Fujimoto, SS Gu - Advances in neural information …, 2021 - proceedings.neurips.cc
Offline reinforcement learning (RL) defines the task of learning from a fixed batch of data.
Due to errors in value estimation from out-of-distribution actions, most offline RL algorithms …

Cal-ql: Calibrated offline rl pre-training for efficient online fine-tuning

M Nakamoto, S Zhai, A Singh… - Advances in …, 2024 - proceedings.neurips.cc
A compelling use case of offline reinforcement learning (RL) is to obtain a policy initialization
from existing datasets followed by fast online fine-tuning with limited interaction. However …

Foundation models for decision making: Problems, methods, and opportunities

S Yang, O Nachum, Y Du, J Wei, P Abbeel… - arxiv preprint arxiv …, 2023 - arxiv.org
Foundation models pretrained on diverse data at scale have demonstrated extraordinary
capabilities in a wide range of vision and language tasks. When such models are deployed …

Behavior Transformers: Cloning modes with one stone

NM Shafiullah, Z Cui… - Advances in neural …, 2022 - proceedings.neurips.cc
While behavior learning has made impressive progress in recent times, it lags behind
computer vision and natural language processing due to its inability to leverage large …

Solving rubik's cube with a robot hand

I Akkaya, M Andrychowicz, M Chociej, M Litwin… - arxiv preprint arxiv …, 2019 - arxiv.org
We demonstrate that models trained only in simulation can be used to solve a manipulation
problem of unprecedented complexity on a real robot. This is made possible by two key …

robosuite: A modular simulation framework and benchmark for robot learning

Y Zhu, J Wong, A Mandlekar, R Martín-Martín… - arxiv preprint arxiv …, 2020 - arxiv.org
robosuite is a simulation framework for robot learning powered by the MuJoCo physics
engine. It offers a modular design for creating robotic tasks as well as a suite of benchmark …

Learning dexterous in-hand manipulation

OAIM Andrychowicz, B Baker… - … Journal of Robotics …, 2020 - journals.sagepub.com
We use reinforcement learning (RL) to learn dexterous in-hand manipulation policies that
can perform vision-based object reorientation on a physical Shadow Dexterous Hand. The …

[HTML][HTML] dm_control: Software and tasks for continuous control

S Tunyasuvunakool, A Muldal, Y Doron, S Liu, S Bohez… - Software Impacts, 2020 - Elsevier
The dm_control software package is a collection of Python libraries and task suites for
reinforcement learning agents in an articulated-body simulation. Infrastructure includes a …