Transferring policy of deep reinforcement learning from simulation to reality for robotics

H Ju, R Juan, R Gomez, K Nakamura… - Nature Machine …, 2022 - nature.com
Deep reinforcement learning has achieved great success in many fields and has shown
promise in learning robust skills for robot control in recent years. However, sampling …

Fusion dynamical systems with machine learning in imitation learning: A comprehensive overview

Y Hu, FJ Abu-Dakka, F Chen, X Luo, Z Li, A Knoll… - Information …, 2024 - Elsevier
Imitation Learning (IL), also referred to as Learning from Demonstration (LfD), holds
significant promise for capturing expert motor skills through efficient imitation, facilitating …

Socially compliant navigation dataset (scand): A large-scale dataset of demonstrations for social navigation

H Karnan, A Nair, X **ao, G Warnell… - IEEE Robotics and …, 2022 - ieeexplore.ieee.org
Social navigation is the capability of an autonomous agent, such as a robot, to navigate in a
“socially compliant” manner in the presence of other intelligent agents such as humans. With …

Prompt to transfer: Sim-to-real transfer for traffic signal control with prompt learning

L Da, M Gao, H Mei, H Wei - Proceedings of the AAAI Conference on …, 2024 - ojs.aaai.org
Numerous methods are proposed for the Traffic Signal Control (TSC) tasks aiming to provide
efficient transportation and mitigate congestion waste. In recent, promising results have …

Mahalo: Unifying offline reinforcement learning and imitation learning from observations

A Li, B Boots, CA Cheng - International Conference on …, 2023 - proceedings.mlr.press
We study a new paradigm for sequential decision making, called offline policy learning from
observations (PLfO). Offline PLfO aims to learn policies using datasets with substandard …

Simgan: Hybrid simulator identification for domain adaptation via adversarial reinforcement learning

Y Jiang, T Zhang, D Ho, Y Bai, CK Liu… - … on Robotics and …, 2021 - ieeexplore.ieee.org
As learning-based approaches progress towards automating robot controllers design,
transferring learned policies to new domains with different dynamics (eg sim-to-real transfer) …

Causal navigation by continuous-time neural networks

C Vorbach, R Hasani, A Amini… - Advances in Neural …, 2021 - proceedings.neurips.cc
Imitation learning enables high-fidelity, vision-based learning of policies within rich,
photorealistic environments. However, such techniques often rely on traditional discrete-time …

Cross-domain policy adaptation via value-guided data filtering

K Xu, C Bai, X Ma, D Wang, B Zhao… - Advances in …, 2023 - proceedings.neurips.cc
Generalizing policies across different domains with dynamics mismatch poses a significant
challenge in reinforcement learning. For example, a robot learns the policy in a simulator …

Vi-ikd: High-speed accurate off-road navigation using learned visual-inertial inverse kinodynamics

H Karnan, KS Sikand, P Atreya… - 2022 IEEE/RSJ …, 2022 - ieeexplore.ieee.org
One of the key challenges in high-speed off-road navigation on ground vehicles is that the
kinodynamics of the vehicle-terrain interaction can differ dramatically depending on the …

Mobile: Model-based imitation learning from observation alone

R Kidambi, J Chang, W Sun - Advances in Neural …, 2021 - proceedings.neurips.cc
Abstract This paper studies Imitation Learning from Observations alone (ILFO) where the
learner is presented with expert demonstrations that consist only of states visited by an …