Recent progress in reinforcement learning and adaptive dynamic programming for advanced control applications

D Wang, N Gao, D Liu, J Li… - IEEE/CAA Journal of …, 2023 - ieeexplore.ieee.org
Reinforcement learning (RL) has roots in dynamic programming and it is called
adaptive/approximate dynamic programming (ADP) within the control community. This paper …

A review of current state-of-the-art control methods for lower-limb powered prostheses

R Gehlhar, M Tucker, AJ Young, AD Ames - Annual Reviews in Control, 2023 - Elsevier
Lower-limb prostheses aim to restore ambulatory function for individuals with lower-limb
amputations. While the design of lower-limb prostheses is important, this paper focuses on …

Experiment-free exoskeleton assistance via learning in simulation

S Luo, M Jiang, S Zhang, J Zhu, S Yu… - Nature, 2024 - nature.com
Exoskeletons have enormous potential to improve human locomotive performance,–.
However, their development and broad dissemination are limited by the requirement for …

Reinforcement learning for solving the vehicle routing problem

M Nazari, A Oroojlooy, L Snyder… - Advances in neural …, 2018 - proceedings.neurips.cc
We present an end-to-end framework for solving the Vehicle Routing Problem (VRP) using
reinforcement learning. In this approach, we train a single policy model that finds near …

Myoelectric control of robotic lower limb prostheses: a review of electromyography interfaces, control paradigms, challenges and future directions

A Fleming, N Stafford, S Huang, X Hu… - Journal of neural …, 2021 - iopscience.iop.org
Objective. Advanced robotic lower limb prostheses are mainly controlled autonomously.
Although the existing control can assist cyclic movements during locomotion of amputee …

Learning for a robot: Deep reinforcement learning, imitation learning, transfer learning

J Hua, L Zeng, G Li, Z Ju - Sensors, 2021 - mdpi.com
Dexterous manipulation of the robot is an important part of realizing intelligence, but
manipulators can only perform simple tasks such as sorting and packing in a structured …

Trust region policy optimization

J Schulman, S Levine, P Abbeel… - … on machine learning, 2015 - proceedings.mlr.press
In this article, we describe a method for optimizing control policies, with guaranteed
monotonic improvement. By making several approximations to the theoretically-justified …

On human-in-the-loop optimization of human–robot interaction

P Slade, C Atkeson, JM Donelan, H Houdijk… - Nature, 2024 - nature.com
From industrial exoskeletons to implantable medical devices, robots that interact closely with
people are poised to improve every aspect of our lives. Yet designing these systems is very …

Human-in-the-loop optimization of wearable robotic devices to improve human–robot interaction: A systematic review

MA Diaz, M Voß, A Dillen, B Tassignon… - IEEE Transactions …, 2022 - ieeexplore.ieee.org
This article presents a systematic review on wearable robotic devices that use human-in-the-
loop optimization (HILO) strategies to improve human–robot interaction. A total of 46 HILO …

Model-based reinforcement learning control of electrohydraulic position servo systems

Z Yao, X Liang, GP Jiang, J Yao - IEEE/ASME Transactions on …, 2022 - ieeexplore.ieee.org
Even though the unprecedented success of AlphaGo Zero demonstrated reinforcement
learning as a feasible complex problem solver, the research on reinforcement learning …