- Academic Search

TT Nguyen, VJ Reddi - IEEE Transactions on Neural Networks …, 2021 - ieeexplore.ieee.org

The scale of Internet-connected systems has increased considerably, and these systems are
being exposed to cyberattacks more than ever. The complexity and dynamics of …

Opslaan Citeren Geciteerd door 561 Verwante artikelen Alle 16 versies

Reinforcement learning-based physical cross-layer security and privacy in 6G

X Lu, L ** a general algorithm that learns to solve tasks across a wide range of
applications has been a fundamental challenge in artificial intelligence. Although current …

Opslaan Citeren Geciteerd door 548 Verwante artikelen Alle 2 versies HTML-versie

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

A generalist agent

S Reed, K Zolna, E Parisotto, SG Colmenarejo… - arxiv preprint arxiv …, 2022 - arxiv.org

Inspired by progress in large-scale language modeling, we apply a similar approach
towards building a single generalist agent beyond the realm of text outputs. The agent …

Opslaan Citeren Geciteerd door 1007 Verwante artikelen Alle 4 versies HTML-versie

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Mastering visual continuous control: Improved data-augmented reinforcement learning

D Yarats, R Fergus, A Lazaric, L Pinto - arxiv preprint arxiv:2107.09645, 2021 - arxiv.org

We present DrQ-v2, a model-free reinforcement learning (RL) algorithm for visual
continuous control. DrQ-v2 builds on DrQ, an off-policy actor-critic approach that uses data …

Opslaan Citeren Geciteerd door 353 Verwante artikelen Alle 4 versies HTML-versie

[Free GPT-4]
[DeepSeek]

[PDF] openreview.net

Image augmentation is all you need: Regularizing deep reinforcement learning from pixels

D Yarats, I Kostrikov, R Fergus - International conference on …, 2021 - openreview.net

We propose a simple data augmentation technique that can be applied to standard model-
free reinforcement learning algorithms, enabling robust learning directly from pixels without …

Opslaan Citeren Geciteerd door 489 Verwante artikelen Alle 6 versies HTML-versie

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Solving rubik's cube with a robot hand

I Akkaya, M Andrychowicz, M Chociej, M Litwin… - arxiv preprint arxiv …, 2019 - arxiv.org

We demonstrate that models trained only in simulation can be used to solve a manipulation
problem of unprecedented complexity on a real robot. This is made possible by two key …

Opslaan Citeren Geciteerd door 1274 Verwante artikelen Alle 7 versies HTML-versie

[Free GPT-4]
[DeepSeek]

[PDF] neurips.cc

Critic regularized regression

Z Wang, A Novikov, K Zolna, JS Merel… - Advances in …, 2020 - proceedings.neurips.cc

Offline reinforcement learning (RL), also known as batch RL, offers the prospect of policy
optimization from large pre-recorded datasets without online environment interaction. It …

Opslaan Citeren Geciteerd door 352 Verwante artikelen Alle 9 versies HTML-versie

[Free GPT-4]
[DeepSeek]

[PDF] mlr.press

Learning latent dynamics for planning from pixels

D Hafner, T Lillicrap, I Fischer… - International …, 2019 - proceedings.mlr.press

Planning has been very successful for control tasks with known environment dynamics. To
leverage planning in unknown environments, the agent needs to learn the dynamics from …

Opslaan Citeren Geciteerd door 1716 Verwante artikelen Alle 10 versies HTML-versie

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Image augmentation is all you need: Regularizing deep reinforcement learning from pixels

I Kostrikov, D Yarats, R Fergus - arxiv preprint arxiv:2004.13649, 2020 - arxiv.org

We propose a simple data augmentation technique that can be applied to standard model-
free reinforcement learning algorithms, enabling robust learning directly from pixels without …

Opslaan Citeren Geciteerd door 433 Verwante artikelen Alle 3 versies HTML-versie

Melding maken

Citeren

Geavanceerd zoeken

Opgeslagen in Mijn bibliotheek

Distributed distributional deterministic policy gradients

Deep reinforcement learning for cyber security

Reinforcement learning-based physical cross-layer security and privacy in 6G

A generalist agent

Mastering visual continuous control: Improved data-augmented reinforcement learning

Image augmentation is all you need: Regularizing deep reinforcement learning from pixels

Solving rubik's cube with a robot hand

Critic regularized regression

Learning latent dynamics for planning from pixels

Image augmentation is all you need: Regularizing deep reinforcement learning from pixels