Deep reinforcement learning for robotics: A survey of real-world successes

C Tang, B Abbatematteo, J Hu… - Annual Review of …, 2024 - annualreviews.org
Reinforcement learning (RL), particularly its combination with deep neural networks,
referred to as deep RL (DRL), has shown tremendous promise across a wide range of …

Ai alignment: A comprehensive survey

J Ji, T Qiu, B Chen, B Zhang, H Lou, K Wang… - arxiv preprint arxiv …, 2023 - arxiv.org
AI alignment aims to make AI systems behave in line with human intentions and values. As
AI systems grow more capable, the potential large-scale risks associated with misaligned AI …

Reaching the limit in autonomous racing: Optimal control versus reinforcement learning

Y Song, A Romero, M Müller, V Koltun… - Science Robotics, 2023 - science.org
A central question in robotics is how to design a control system for an agile mobile robot.
This paper studies this question systematically, focusing on a challenging setting …

Loss of plasticity in deep continual learning

S Dohare, JF Hernandez-Garcia, Q Lan, P Rahman… - Nature, 2024 - nature.com
Artificial neural networks, deep-learning methods and the backpropagation algorithm form
the foundation of modern machine learning and artificial intelligence. These methods are …

[PDF][PDF] Introduction to reinforcement learning

D Ernst, A Louette - Feuerriegel, S., Hartmann, J., Janiesch, C., and …, 2024 - damien-ernst.be
Examples:• A predictive maintenance agent for industrial equipment that analyzes sensor
data to predict failures before they happen, scheduling maintenance only when needed and …

Actor-critic model predictive control

A Romero, Y Song… - 2024 IEEE International …, 2024 - ieeexplore.ieee.org
An open research question in robotics is how to combine the benefits of model-free
reinforcement learning (RL)—known for its strong task performance and flexibility in …

Autonomous drone racing: A survey

D Hanover, A Loquercio, L Bauersfeld… - IEEE Transactions …, 2024 - ieeexplore.ieee.org
Over the last decade, the use of autonomous drone systems for surveying, search and
rescue, or last-mile delivery has increased exponentially. With the rise of these applications …

Gensim: Generating robotic simulation tasks via large language models

L Wang, Y Ling, Z Yuan, M Shridhar, C Bao… - arxiv preprint arxiv …, 2023 - arxiv.org
Collecting large amounts of real-world interaction data to train general robotic policies is
often prohibitively expensive, thus motivating the use of simulation data. However, existing …

A guide to artificial intelligence for cancer researchers

R Perez-Lopez, N Ghaffari Laleh, F Mahmood… - Nature Reviews …, 2024 - nature.com
Artificial intelligence (AI) has been commoditized. It has evolved from a specialty resource to
a readily accessible tool for cancer researchers. AI-based tools can boost research …

Diffusion policy policy optimization

AZ Ren, J Lidard, LL Ankile, A Simeonov… - arxiv preprint arxiv …, 2024 - arxiv.org
We introduce Diffusion Policy Policy Optimization, DPPO, an algorithmic framework
including best practices for fine-tuning diffusion-based policies (eg Diffusion Policy) in …