Autonomous driving system: A comprehensive survey

J Zhao, W Zhao, B Deng, Z Wang, F Zhang… - Expert Systems with …, 2024 - Elsevier
Automation is increasingly at the forefront of transportation research, with the potential to
bring fully autonomous vehicles to our roads in the coming years. This comprehensive …

On the use of simulation in robotics: Opportunities, challenges, and suggestions for moving forward

HS Choi, C Crump, C Duriez, A Elmquist… - Proceedings of the …, 2021 - pnas.org
The last five years marked a surge in interest for and use of smart robots, which operate in
dynamic and unstructured environments and might interact with humans. We posit that well …

Deep reinforcement learning in a handful of trials using probabilistic dynamics models

K Chua, R Calandra, R McAllister… - Advances in neural …, 2018 - proceedings.neurips.cc
Abstract Model-based reinforcement learning (RL) algorithms can attain excellent sample
efficiency, but often lag behind the best model-free algorithms in terms of asymptotic …

Tossingbot: Learning to throw arbitrary objects with residual physics

A Zeng, S Song, J Lee, A Rodriguez… - IEEE Transactions on …, 2020 - ieeexplore.ieee.org
We investigate whether a robot arm can learn to pick and throw arbitrary rigid objects into
selected boxes quickly and accurately. Throwing has the potential to increase the physical …

Domain randomization for transferring deep neural networks from simulation to the real world

J Tobin, R Fong, A Ray, J Schneider… - 2017 IEEE/RSJ …, 2017 - ieeexplore.ieee.org
Bridging thereality gap'that separates simulated robotics from experiments on hardware
could accelerate robotic research through improved data availability. This paper explores …

Differentiable mpc for end-to-end planning and control

B Amos, I Jimenez, J Sacks… - Advances in neural …, 2018 - proceedings.neurips.cc
We present foundations for using Model Predictive Control (MPC) as a differentiable policy
class for reinforcement learning. This provides one way of leveraging and combining the …

Model-ensemble trust-region policy optimization

T Kurutach, I Clavera, Y Duan, A Tamar… - arxiv preprint arxiv …, 2018 - arxiv.org
Model-free reinforcement learning (RL) methods are succeeding in a growing number of
tasks, aided by recent advances in deep learning. However, they tend to suffer from high …

Survey of model-based reinforcement learning: Applications on robotics

AS Polydoros, L Nalpantidis - Journal of Intelligent & Robotic Systems, 2017 - Springer
Reinforcement learning is an appealing approach for allowing robots to learn new tasks.
Relevant literature reveals a plethora of methods, but at the same time makes clear the lack …

Learning continuous control policies by stochastic value gradients

N Heess, G Wayne, D Silver… - Advances in neural …, 2015 - proceedings.neurips.cc
We present a unified framework for learning continuous control policies
usingbackpropagation. It supports stochastic control by treating stochasticity in theBellman …

Epopt: Learning robust neural network policies using model ensembles

A Rajeswaran, S Ghotra, B Ravindran… - arxiv preprint arxiv …, 2016 - arxiv.org
Sample complexity and safety are major challenges when learning policies with
reinforcement learning for real-world tasks, especially when the policies are represented …