- Academic Search

C Tang, B Abbatematteo, J Hu… - Annual Review of …, 2024 - annualreviews.org

Reinforcement learning (RL), particularly its combination with deep neural networks,
referred to as deep RL (DRL), has shown tremendous promise across a wide range of …

Zapisz Cytuj Cytowane przez 23 Powiązane artykuły Wszystkie wersje 3

[Free GPT-4]
[DeepSeek]

[HTML] springer.com

[HTML][HTML] Deep Learning applications for COVID-19

C Shorten, TM Khoshgoftaar, B Furht - Journal of big Data, 2021 - Springer

This survey explores how Deep Learning has battled the COVID-19 pandemic and provides
directions for future research on COVID-19. We cover Deep Learning applications in Natural …

Zapisz Cytuj Cytowane przez 388 Powiązane artykuły Wszystkie wersje 15

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Ai alignment: A comprehensive survey

J Ji, T Qiu, B Chen, B Zhang, H Lou, K Wang… - arxiv preprint arxiv …, 2023 - arxiv.org

AI alignment aims to make AI systems behave in line with human intentions and values. As
AI systems grow more capable, the potential large-scale risks associated with misaligned AI …

Zapisz Cytuj Cytowane przez 230 Powiązane artykuły Wszystkie wersje 3 Wersja HTML

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Human-to-robot imitation in the wild

S Bahl, A Gupta, D Pathak - arxiv preprint arxiv:2207.09450, 2022 - arxiv.org

We approach the problem of learning by watching humans in the wild. While traditional
approaches in Imitation and Reinforcement Learning are promising for learning in the real …

Zapisz Cytuj Cytowane przez 142 Powiązane artykuły Wszystkie wersje 4 Wersja HTML

[Free GPT-4]
[DeepSeek]

[PDF] mlr.press

Few-shot preference learning for human-in-the-loop rl

DJ Hejna III, D Sadigh - Conference on Robot Learning, 2023 - proceedings.mlr.press

While reinforcement learning (RL) has become a more popular approach for robotics,
designing sufficiently informative reward functions for complex tasks has proven to be …

Zapisz Cytuj Cytowane przez 92 Powiązane artykuły Wszystkie wersje 6 Wersja HTML

[Free GPT-4]
[DeepSeek]

[PDF] neurips.cc

Inverse preference learning: Preference-based rl without a reward function

J Hejna, D Sadigh - Advances in Neural Information …, 2024 - proceedings.neurips.cc

Reward functions are difficult to design and often hard to align with human intent. Preference-
based Reinforcement Learning (RL) algorithms address these problems by learning reward …

Zapisz Cytuj Cytowane przez 47 Powiązane artykuły Wszystkie wersje 9 Wersja HTML

[Free GPT-4]
[DeepSeek]

[HTML] informs.org

Global optimality guarantees for policy gradient methods

J Bhandari, D Russo - Operations Research, 2024 - pubsonline.informs.org

Policy gradients methods apply to complex, poorly understood, control problems by
performing stochastic gradient descent over a parameterized class of polices. Unfortunately …

Zapisz Cytuj Cytowane przez 287 Powiązane artykuły Wszystkie wersje 7

[Free GPT-4]
[DeepSeek]

[PDF] mdpi.com

Hierarchical reinforcement learning: A survey and open research challenges

M Hutsebaut-Buysse, K Mets, S Latré - Machine Learning and Knowledge …, 2022 - mdpi.com

Reinforcement learning (RL) allows an agent to solve sequential decision-making problems
by interacting with an environment in a trial-and-error fashion. When these environments are …

Zapisz Cytuj Cytowane przez 113 Powiązane artykuły Wszystkie wersje 8 Kopia

[Free GPT-4]
[DeepSeek]

[PDF] mlr.press

Genloco: Generalized locomotion controllers for quadrupedal robots

G Feng, H Zhang, Z Li, XB Peng… - … on Robot Learning, 2023 - proceedings.mlr.press

Recent years have seen a surge in commercially-available and affordable quadrupedal
robots, with many of these platforms being actively used in research and industry. As the …

Zapisz Cytuj Cytowane przez 57 Powiązane artykuły Wszystkie wersje 9 Wersja HTML

[Free GPT-4]
[DeepSeek]

[PDF] cranfield.ac.uk

Model-free reinforcement learning from expert demonstrations: a survey

J Ramírez, W Yu, A Perrusquía - Artificial Intelligence Review, 2022 - Springer

Reinforcement learning from expert demonstrations (RLED) is the intersection of imitation
learning with reinforcement learning that seeks to take advantage of these two learning …

Zapisz Cytuj Cytowane przez 100 Powiązane artykuły Wszystkie wersje 5

Utwórz alert

Cytuj

Szukanie zaawansowane

Zapisano w Mojej bibliotece

The ingredients of real-world robotic reinforcement learning

Deep reinforcement learning for robotics: A survey of real-world successes

[HTML][HTML] Deep Learning applications for COVID-19

Ai alignment: A comprehensive survey

Human-to-robot imitation in the wild

Few-shot preference learning for human-in-the-loop rl

Inverse preference learning: Preference-based rl without a reward function

Global optimality guarantees for policy gradient methods

Hierarchical reinforcement learning: A survey and open research challenges

Genloco: Generalized locomotion controllers for quadrupedal robots

Model-free reinforcement learning from expert demonstrations: a survey