Μελετητής Google

M Ahsan, KE Nygard, R Gomes… - … of Cybersecurity and …, 2022 - mdpi.com

Machine learning is of rising importance in cybersecurity. The primary objective of applying
machine learning in cybersecurity is to make the process of malware detection more …

Αποθήκευση Παράθεση Γίνεται αναφορά σε 173 Σχετικά άρθρα Όλες οι 9 εκδοχές Προσωρινά αποθηκευμένη

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Reconfigurable intelligent surfaces: Principles and opportunities

Y Liu, X Liu, X Mu, T Hou, J Xu… - … surveys & tutorials, 2021 - ieeexplore.ieee.org

Reconfigurable intelligent surfaces (RISs), also known as intelligent reflecting surfaces
(IRSs), or large intelligent surfaces (LISs), 1 have received significant attention for their …

Αποθήκευση Παράθεση Γίνεται αναφορά σε 1428 Σχετικά άρθρα Όλες οι 7 εκδοχές

[Free GPT-4]
[DeepSeek]

[PDF] nature.com

Discovering faster matrix multiplication algorithms with reinforcement learning

A Fawzi, M Balog, A Huang, T Hubert… - Nature, 2022 - nature.com

Improving the efficiency of algorithms for fundamental computations can have a widespread
impact, as it can affect the overall speed of a large amount of computations. Matrix …

Αποθήκευση Παράθεση Γίνεται αναφορά σε 676 Σχετικά άρθρα Όλες οι 11 εκδοχές

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Offline reinforcement learning with implicit q-learning

I Kostrikov, A Nair, S Levine - arxiv preprint arxiv:2110.06169, 2021 - arxiv.org

Offline reinforcement learning requires reconciling two conflicting aims: learning a policy that
improves over the behavior policy that collected the dataset, while at the same time …

Αποθήκευση Παράθεση Γίνεται αναφορά σε 887 Σχετικά άρθρα Όλες οι 6 εκδοχές Προβολή ως HTML

[Free GPT-4]
[DeepSeek]

[PDF] neurips.cc

Deep reinforcement learning at the edge of the statistical precipice

R Agarwal, M Schwarzer, PS Castro… - Advances in neural …, 2021 - proceedings.neurips.cc

Deep reinforcement learning (RL) algorithms are predominantly evaluated by comparing
their relative performance on a large suite of tasks. Most published results on deep RL …

Αποθήκευση Παράθεση Γίνεται αναφορά σε 746 Σχετικά άρθρα Όλες οι 7 εκδοχές Προβολή ως HTML

[Free GPT-4]
[DeepSeek]

[PDF] neurips.cc

Decision transformer: Reinforcement learning via sequence modeling

L Chen, K Lu, A Rajeswaran, K Lee… - Advances in neural …, 2021 - proceedings.neurips.cc

We introduce a framework that abstracts Reinforcement Learning (RL) as a sequence
modeling problem. This allows us to draw upon the simplicity and scalability of the …

Αποθήκευση Παράθεση Γίνεται αναφορά σε 1803 Σχετικά άρθρα Όλες οι 13 εκδοχές Προβολή ως HTML

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Idql: Implicit q-learning as an actor-critic method with diffusion policies

P Hansen-Estruch, I Kostrikov, M Janner… - arxiv preprint arxiv …, 2023 - arxiv.org

Effective offline RL methods require properly handling out-of-distribution actions. Implicit Q-
learning (IQL) addresses this by training a Q-function using only dataset actions through a …

Αποθήκευση Παράθεση Γίνεται αναφορά σε 122 Σχετικά άρθρα Όλες οι 4 εκδοχές Προβολή ως HTML

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Mastering atari with discrete world models

D Hafner, T Lillicrap, M Norouzi, J Ba - arxiv preprint arxiv:2010.02193, 2020 - arxiv.org

Intelligent agents need to generalize from past experience to achieve goals in complex
environments. World models facilitate such generalization and allow learning behaviors …

Αποθήκευση Παράθεση Γίνεται αναφορά σε 951 Σχετικά άρθρα Όλες οι 6 εκδοχές Προβολή ως HTML

[Free GPT-4]
[DeepSeek]

[PDF] neurips.cc

Conservative q-learning for offline reinforcement learning

A Kumar, A Zhou, G Tucker… - Advances in neural …, 2020 - proceedings.neurips.cc

Effectively leveraging large, previously collected datasets in reinforcement learn-ing (RL) is
a key challenge for large-scale real-world applications. Offline RL algorithms promise to …

Αποθήκευση Παράθεση Γίνεται αναφορά σε 2082 Σχετικά άρθρα Όλες οι 10 εκδοχές Προβολή ως HTML

Autonomous navigation of stratospheric balloons using reinforcement learning

MG Bellemare, S Candido, PS Castro, J Gong… - Nature, 2020 - nature.com

Efficiently navigating a superpressure balloon in the stratosphere requires the integration of
a multitude of cues, such as wind speed and solar elevation, and the process is complicated …

Αποθήκευση Παράθεση Γίνεται αναφορά σε 437 Σχετικά άρθρα Όλες οι 6 εκδοχές

Δημιουργία ειδοποίησης

Παράθεση

Σύνθετη αναζήτηση

Αποθηκεύτηκε στη Βιβλιοθήκη μου

Distributional reinforcement learning with quantile regression

[HTML][HTML] Cybersecurity threats and their mitigation approaches using Machine Learning—A Review

Reconfigurable intelligent surfaces: Principles and opportunities

Discovering faster matrix multiplication algorithms with reinforcement learning

Offline reinforcement learning with implicit q-learning

Deep reinforcement learning at the edge of the statistical precipice

Decision transformer: Reinforcement learning via sequence modeling

Idql: Implicit q-learning as an actor-critic method with diffusion policies

Mastering atari with discrete world models

Conservative q-learning for offline reinforcement learning

Autonomous navigation of stratospheric balloons using reinforcement learning