Deep reinforcement learning for robotics: A survey of real-world successes
Reinforcement learning (RL), particularly its combination with deep neural networks,
referred to as deep RL (DRL), has shown tremendous promise across a wide range of …
referred to as deep RL (DRL), has shown tremendous promise across a wide range of …
Ai alignment: A comprehensive survey
AI alignment aims to make AI systems behave in line with human intentions and values. As
AI systems grow more capable, the potential large-scale risks associated with misaligned AI …
AI systems grow more capable, the potential large-scale risks associated with misaligned AI …
Reaching the limit in autonomous racing: Optimal control versus reinforcement learning
A central question in robotics is how to design a control system for an agile mobile robot.
This paper studies this question systematically, focusing on a challenging setting …
This paper studies this question systematically, focusing on a challenging setting …
Loss of plasticity in deep continual learning
Artificial neural networks, deep-learning methods and the backpropagation algorithm form
the foundation of modern machine learning and artificial intelligence. These methods are …
the foundation of modern machine learning and artificial intelligence. These methods are …
[PDF][PDF] Introduction to reinforcement learning
D Ernst, A Louette - Feuerriegel, S., Hartmann, J., Janiesch, C., and …, 2024 - damien-ernst.be
Examples:• A predictive maintenance agent for industrial equipment that analyzes sensor
data to predict failures before they happen, scheduling maintenance only when needed and …
data to predict failures before they happen, scheduling maintenance only when needed and …
Actor-critic model predictive control
A Romero, Y Song… - 2024 IEEE International …, 2024 - ieeexplore.ieee.org
An open research question in robotics is how to combine the benefits of model-free
reinforcement learning (RL)—known for its strong task performance and flexibility in …
reinforcement learning (RL)—known for its strong task performance and flexibility in …
Autonomous drone racing: A survey
Over the last decade, the use of autonomous drone systems for surveying, search and
rescue, or last-mile delivery has increased exponentially. With the rise of these applications …
rescue, or last-mile delivery has increased exponentially. With the rise of these applications …
Gensim: Generating robotic simulation tasks via large language models
Collecting large amounts of real-world interaction data to train general robotic policies is
often prohibitively expensive, thus motivating the use of simulation data. However, existing …
often prohibitively expensive, thus motivating the use of simulation data. However, existing …
A guide to artificial intelligence for cancer researchers
Artificial intelligence (AI) has been commoditized. It has evolved from a specialty resource to
a readily accessible tool for cancer researchers. AI-based tools can boost research …
a readily accessible tool for cancer researchers. AI-based tools can boost research …
Diffusion policy policy optimization
We introduce Diffusion Policy Policy Optimization, DPPO, an algorithmic framework
including best practices for fine-tuning diffusion-based policies (eg Diffusion Policy) in …
including best practices for fine-tuning diffusion-based policies (eg Diffusion Policy) in …