Turnitin
降AI改写
早检测系统
早降重系统
Turnitin-UK版
万方检测-期刊版
维普编辑部版
Grammarly检测
Paperpass检测
checkpass检测
PaperYY检测
Transferring policy of deep reinforcement learning from simulation to reality for robotics
H Ju, R Juan, R Gomez, K Nakamura… - Nature Machine …, 2022 - nature.com
Deep reinforcement learning has achieved great success in many fields and has shown
promise in learning robust skills for robot control in recent years. However, sampling …
promise in learning robust skills for robot control in recent years. However, sampling …
Fusion dynamical systems with machine learning in imitation learning: A comprehensive overview
Imitation Learning (IL), also referred to as Learning from Demonstration (LfD), holds
significant promise for capturing expert motor skills through efficient imitation, facilitating …
significant promise for capturing expert motor skills through efficient imitation, facilitating …
Socially compliant navigation dataset (scand): A large-scale dataset of demonstrations for social navigation
Social navigation is the capability of an autonomous agent, such as a robot, to navigate in a
“socially compliant” manner in the presence of other intelligent agents such as humans. With …
“socially compliant” manner in the presence of other intelligent agents such as humans. With …
Prompt to transfer: Sim-to-real transfer for traffic signal control with prompt learning
Numerous methods are proposed for the Traffic Signal Control (TSC) tasks aiming to provide
efficient transportation and mitigate congestion waste. In recent, promising results have …
efficient transportation and mitigate congestion waste. In recent, promising results have …
Mahalo: Unifying offline reinforcement learning and imitation learning from observations
We study a new paradigm for sequential decision making, called offline policy learning from
observations (PLfO). Offline PLfO aims to learn policies using datasets with substandard …
observations (PLfO). Offline PLfO aims to learn policies using datasets with substandard …
Simgan: Hybrid simulator identification for domain adaptation via adversarial reinforcement learning
As learning-based approaches progress towards automating robot controllers design,
transferring learned policies to new domains with different dynamics (eg sim-to-real transfer) …
transferring learned policies to new domains with different dynamics (eg sim-to-real transfer) …
Causal navigation by continuous-time neural networks
Imitation learning enables high-fidelity, vision-based learning of policies within rich,
photorealistic environments. However, such techniques often rely on traditional discrete-time …
photorealistic environments. However, such techniques often rely on traditional discrete-time …
Cross-domain policy adaptation via value-guided data filtering
Generalizing policies across different domains with dynamics mismatch poses a significant
challenge in reinforcement learning. For example, a robot learns the policy in a simulator …
challenge in reinforcement learning. For example, a robot learns the policy in a simulator …
Vi-ikd: High-speed accurate off-road navigation using learned visual-inertial inverse kinodynamics
One of the key challenges in high-speed off-road navigation on ground vehicles is that the
kinodynamics of the vehicle-terrain interaction can differ dramatically depending on the …
kinodynamics of the vehicle-terrain interaction can differ dramatically depending on the …
Mobile: Model-based imitation learning from observation alone
Abstract This paper studies Imitation Learning from Observations alone (ILFO) where the
learner is presented with expert demonstrations that consist only of states visited by an …
learner is presented with expert demonstrations that consist only of states visited by an …