Deep reinforcement learning for robotics: A survey of real-world successes

C Tang, B Abbatematteo, J Hu… - Annual Review of …, 2024 - annualreviews.org
Reinforcement learning (RL), particularly its combination with deep neural networks,
referred to as deep RL (DRL), has shown tremendous promise across a wide range of …

Waymax: An accelerated, data-driven simulator for large-scale autonomous driving research

C Gulino, J Fu, W Luo, G Tucker… - Advances in …, 2023 - proceedings.neurips.cc
Simulation is an essential tool to develop and benchmark autonomous vehicle planning
software in a safe and cost-effective manner. However, realistic simulation requires accurate …

Toward general-purpose robots via foundation models: A survey and meta-analysis

Y Hu, Q **e, V Jain, J Francis, J Patrikar… - arxiv preprint arxiv …, 2023 - arxiv.org
Building general-purpose robots that operate seamlessly in any environment, with any
object, and utilizing various skills to complete diverse tasks has been a long-standing goal in …

On bringing robots home

NMM Shafiullah, A Rai, H Etukuru, Y Liu, I Misra… - arxiv preprint arxiv …, 2023 - arxiv.org
Throughout history, we have successfully integrated various machines into our homes.
Dishwashers, laundry machines, stand mixers, and robot vacuums are a few recent …

Adaptive mobile manipulation for articulated objects in the open world

H **ong, R Mendonca, K Shaw, D Pathak - arxiv preprint arxiv …, 2024 - arxiv.org
Deploying robots in open-ended unstructured environments such as homes has been a long-
standing research problem. However, robots are often studied only in closed-off lab settings …

Plan-seq-learn: Language model guided rl for solving long horizon robotics tasks

M Dalal, T Chiruvolu, D Chaplot… - arxiv preprint arxiv …, 2024 - arxiv.org
Large Language Models (LLMs) have been shown to be capable of performing high-level
planning for long-horizon robotics tasks, yet existing methods require access to a pre …

Accelerating reinforcement learning with value-conditional state entropy exploration

D Kim, J Shin, P Abbeel, Y Seo - Advances in Neural …, 2023 - proceedings.neurips.cc
A promising technique for exploration is to maximize the entropy of visited state distribution,
ie, state entropy, by encouraging uniform coverage of visited state space. While it has been …

How to prompt your robot: A promptbook for manipulation skills with code as policies

MG Arenas, T **ao, S Singh, V Jain… - … on Robotics and …, 2024 - ieeexplore.ieee.org
Large Language Models (LLMs) have demonstrated the ability to perform semantic
reasoning, planning and write code for robotics tasks. However, most methods rely on pre …

Continuous control with coarse-to-fine reinforcement learning

Y Seo, J Uruç, S James - arxiv preprint arxiv:2407.07787, 2024 - arxiv.org
Despite recent advances in improving the sample-efficiency of reinforcement learning (RL)
algorithms, designing an RL algorithm that can be practically deployed in real-world …

What foundation models can bring for robot learning in manipulation: A survey

D Li, Y **, Y Sun, H Yu, J Shi, X Hao, P Hao… - arxiv preprint arxiv …, 2024 - arxiv.org
The realization of universal robots is an ultimate goal of researchers. However, a key hurdle
in achieving this goal lies in the robots' ability to manipulate objects in their unstructured …