Reinforcement learning algorithms and applications in healthcare and robotics: a comprehensive and systematic review

MNA Al-Hamadani, MA Fadhel, L Alzubaidi, B Harangi - Sensors, 2024 - mdpi.com
Reinforcement learning (RL) has emerged as a dynamic and transformative paradigm in
artificial intelligence, offering the promise of intelligent decision-making in complex and …

Learning fine-grained bimanual manipulation with low-cost hardware

TZ Zhao, V Kumar, S Levine, C Finn - arxiv preprint arxiv:2304.13705, 2023 - arxiv.org
Fine manipulation tasks, such as threading cable ties or slotting a battery, are notoriously
difficult for robots because they require precision, careful coordination of contact forces, and …

Eureka: Human-level reward design via coding large language models

YJ Ma, W Liang, G Wang, DA Huang, O Bastani… - arxiv preprint arxiv …, 2023 - arxiv.org
Large Language Models (LLMs) have excelled as high-level semantic planners for
sequential decision-making tasks. However, harnessing them to learn complex low-level …

Multi-agent reinforcement learning is a sequence modeling problem

M Wen, J Kuba, R Lin, W Zhang… - Advances in …, 2022 - proceedings.neurips.cc
Large sequence models (SM) such as GPT series and BERT have displayed outstanding
performance and generalization capabilities in natural language process, vision and …

Safety gymnasium: A unified safe reinforcement learning benchmark

J Ji, B Zhang, J Zhou, X Pan… - Advances in …, 2023 - proceedings.neurips.cc
Artificial intelligence (AI) systems possess significant potential to drive societal progress.
However, their deployment often faces obstacles due to substantial safety concerns. Safe …

ARCTIC: A dataset for dexterous bimanual hand-object manipulation

Z Fan, O Taheri, D Tzionas… - Proceedings of the …, 2023 - openaccess.thecvf.com
Humans intuitively understand that inanimate objects do not move by themselves, but that
state changes are typically caused by human manipulation (eg, the opening of a book). This …

3d diffusion policy: Generalizable visuomotor policy learning via simple 3d representations

Y Ze, G Zhang, K Zhang, C Hu, M Wang… - arxiv preprint arxiv …, 2024 - arxiv.org
Imitation learning provides an efficient way to teach robots dexterous skills; however,
learning complex skills robustly and generalizablely usually consumes large amounts of …

Toward general-purpose robots via foundation models: A survey and meta-analysis

Y Hu, Q **e, V Jain, J Francis, J Patrikar… - arxiv preprint arxiv …, 2023 - arxiv.org
Building general-purpose robots that operate seamlessly in any environment, with any
object, and utilizing various skills to complete diverse tasks has been a long-standing goal in …

Heterogeneous-agent reinforcement learning

Y Zhong, JG Kuba, X Feng, S Hu, J Ji, Y Yang - Journal of Machine …, 2024 - jmlr.org
The necessity for cooperation among intelligent machines has popularised cooperative multi-
agent reinforcement learning (MARL) in AI research. However, many research endeavours …

Aloha unleashed: A simple recipe for robot dexterity

TZ Zhao, J Tompson, D Driess, P Florence… - arxiv preprint arxiv …, 2024 - arxiv.org
Recent work has shown promising results for learning end-to-end robot policies using
imitation learning. In this work we address the question of how far can we push imitation …