- Academic Search

C Tang, B Abbatematteo, J Hu… - Annual Review of …, 2024 - annualreviews.org

Reinforcement learning (RL), particularly its combination with deep neural networks,
referred to as deep RL (DRL), has shown tremendous promise across a wide range of …

Simpan Kutip Dirujuk 23 kali Artikel terkait 3 versi

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Ai alignment: A comprehensive survey

J Ji, T Qiu, B Chen, B Zhang, H Lou, K Wang… - arxiv preprint arxiv …, 2023 - arxiv.org

AI alignment aims to make AI systems behave in line with human intentions and values. As
AI systems grow more capable, the potential large-scale risks associated with misaligned AI …

Simpan Kutip Dirujuk 228 kali Artikel terkait 3 versi Versi HTML

[Free GPT-4]
[DeepSeek]

[PDF] science.org

Reaching the limit in autonomous racing: Optimal control versus reinforcement learning

Y Song, A Romero, M Müller, V Koltun… - Science Robotics, 2023 - science.org

A central question in robotics is how to design a control system for an agile mobile robot.
This paper studies this question systematically, focusing on a challenging setting …

Simpan Kutip Dirujuk 148 kali Artikel terkait 9 versi

[Free GPT-4]
[DeepSeek]

[PDF] nature.com

Loss of plasticity in deep continual learning

S Dohare, JF Hernandez-Garcia, Q Lan, P Rahman… - Nature, 2024 - nature.com

Artificial neural networks, deep-learning methods and the backpropagation algorithm form
the foundation of modern machine learning and artificial intelligence. These methods are …

Simpan Kutip Dirujuk 51 kali Artikel terkait 2 versi

[Free GPT-4]
[DeepSeek]

[PDF] damien-ernst.be

[PDF][PDF] Introduction to reinforcement learning

D Ernst, A Louette - Feuerriegel, S., Hartmann, J., Janiesch, C., and …, 2024 - damien-ernst.be

Examples:• A predictive maintenance agent for industrial equipment that analyzes sensor
data to predict failures before they happen, scheduling maintenance only when needed and …

Simpan Kutip Dirujuk 6074 kali Artikel terkait 7 versi Pencarian Perpustakaan Versi HTML

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Actor-critic model predictive control

A Romero, Y Song… - 2024 IEEE International …, 2024 - ieeexplore.ieee.org

An open research question in robotics is how to combine the benefits of model-free
reinforcement learning (RL)—known for its strong task performance and flexibility in …

Simpan Kutip Dirujuk 42 kali Artikel terkait 5 versi

[Free GPT-4]
[DeepSeek]

[PDF] ieee.org

Autonomous drone racing: A survey

D Hanover, A Loquercio, L Bauersfeld… - IEEE Transactions …, 2024 - ieeexplore.ieee.org

Over the last decade, the use of autonomous drone systems for surveying, search and
rescue, or last-mile delivery has increased exponentially. With the rise of these applications …

Simpan Kutip Dirujuk 71 kali Artikel terkait 6 versi

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Gensim: Generating robotic simulation tasks via large language models

L Wang, Y Ling, Z Yuan, M Shridhar, C Bao… - arxiv preprint arxiv …, 2023 - arxiv.org

Collecting large amounts of real-world interaction data to train general robotic policies is
often prohibitively expensive, thus motivating the use of simulation data. However, existing …

Simpan Kutip Dirujuk 63 kali Artikel terkait 5 versi Versi HTML

A guide to artificial intelligence for cancer researchers

R Perez-Lopez, N Ghaffari Laleh, F Mahmood… - Nature Reviews …, 2024 - nature.com

Artificial intelligence (AI) has been commoditized. It has evolved from a specialty resource to
a readily accessible tool for cancer researchers. AI-based tools can boost research …

Simpan Kutip Dirujuk 63 kali Artikel terkait 3 versi

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Diffusion policy policy optimization

AZ Ren, J Lidard, LL Ankile, A Simeonov… - arxiv preprint arxiv …, 2024 - arxiv.org

We introduce Diffusion Policy Policy Optimization, DPPO, an algorithmic framework
including best practices for fine-tuning diffusion-based policies (eg Diffusion Policy) in …

Simpan Kutip Dirujuk 18 kali Artikel terkait 4 versi Versi HTML

Buat notifikasi

Kutip

Penelusuran lanjutan

Disimpan ke Koleksi saya

Champion-level drone racing using deep reinforcement learning

Deep reinforcement learning for robotics: A survey of real-world successes

Ai alignment: A comprehensive survey

Reaching the limit in autonomous racing: Optimal control versus reinforcement learning

Loss of plasticity in deep continual learning

[PDF][PDF] Introduction to reinforcement learning

Actor-critic model predictive control

Autonomous drone racing: A survey

Gensim: Generating robotic simulation tasks via large language models

A guide to artificial intelligence for cancer researchers

Diffusion policy policy optimization