A survey of meta-reinforcement learning

J Beck, R Vuorio, EZ Liu, Z **ong, L Zintgraf… - arxiv preprint arxiv …, 2023 - arxiv.org
While deep reinforcement learning (RL) has fueled multiple high-profile successes in
machine learning, it is held back from more widespread adoption by its often poor data …

You only demonstrate once: Category-level manipulation from single visual demonstration

B Wen, W Lian, K Bekris, S Schaal - arxiv preprint arxiv:2201.12716, 2022 - arxiv.org
Promising results have been achieved recently in category-level manipulation that
generalizes across object instances. Nevertheless, it often requires expensive real-world …

Learning from demonstration for autonomous generation of robotic trajectory: Status quo and forward-looking overview

W Li, Y Wang, Y Liang, DT Pham - Advanced Engineering Informatics, 2024 - Elsevier
Learning from demonstration (LfD) enables robots to intuitively acquire new skills from
human demonstrations and incrementally evolve robotic intelligence. Given the significance …

[HTML][HTML] A survey of demonstration learning

A Correia, LA Alexandre - Robotics and Autonomous Systems, 2024 - Elsevier
With the fast improvement of machine learning, reinforcement learning (RL) has been used
to automate human tasks in different areas. However, training such agents is difficult and …

Reinforcement Learning with Foundation Priors: Let Embodied Agent Efficiently Learn on Its Own

W Ye, Y Zhang, H Weng, X Gu, S Wang… - … Conference on Robot …, 2024 - openreview.net
Reinforcement learning (RL) is a promising approach for solving robotic manipulation tasks.
However, it is challenging to apply the RL algorithms directly in the real world. For one thing …

Opirl: Sample efficient off-policy inverse reinforcement learning via distribution matching

H Hoshino, K Ota, A Kanezaki… - … Conference on Robotics …, 2022 - ieeexplore.ieee.org
Inverse Reinforcement Learning (IRL) is attractive in scenarios where reward engineering
can be tedious. However, prior IRL algorithms use on-policy transitions, which require …

Meta-residual policy learning: Zero-trial robot skill adaptation via knowledge fusion

P Hao, T Lu, S Cui, J Wei, Y Cai… - IEEE Robotics and …, 2022 - ieeexplore.ieee.org
Adapting the mastered manipulation skill to novel objects is still challenging for robots.
Recent works have attempted to endow the robot with the ability to adapt to unseen tasks by …

Uncalibrated and unmodeled image-based visual servoing of robot manipulators using zeroing neural networks

N Tan, P Yu, W Zheng - IEEE Transactions on Cybernetics, 2022 - ieeexplore.ieee.org
Neural networks have been widely investigated for the control of robot manipulators and
recurrent neural network (RNN) is accepted as a powerful tool for visual servoing. Different …

[HTML][HTML] 基于深度**化学**的机器人操作行为研究综述

陈佳盼, 郑敏华 - 机器人, 2022 - html.rhhz.net
通过梳理, 总结前人的研究, 首先对深度学**和**化学**的基本理论和算法进行介绍,
进而对深度**化学**的流行算法和在机器人操作领域的应用现状进行综述. 最后 …

Foundation reinforcement learning: towards embodied generalist agents with foundation prior assistance

W Ye, Y Zhang, M Wang, S Wang, X Gu, P Abbeel… - 2023 - openreview.net
Recently, people have shown that large-scale pre-training from diverse internet-scale data is
the key to building a generalist model, as witnessed in the natural language processing …