Μελετητής Google

CF Hayes, R Rădulescu, E Bargiacchi… - Autonomous Agents and …, 2022 - Springer

Real-world sequential decision-making tasks are generally complex, requiring trade-offs
between multiple, often conflicting, objectives. Despite this, the majority of research in …

Αποθήκευση Παράθεση Γίνεται αναφορά σε 395 Σχετικά άρθρα Όλες οι 21 εκδοχές

[Free GPT-4]
[DeepSeek]

[PDF] springer.com

Scalar reward is not enough: A response to silver, singh, precup and sutton (2021)

P Vamplew, BJ Smith, J Källström, G Ramos… - Autonomous Agents and …, 2022 - Springer

The recent paper “Reward is Enough” by Silver, Singh, Precup and Sutton posits that the
concept of reward maximisation is sufficient to underpin all intelligence, both natural and …

Αποθήκευση Παράθεση Γίνεται αναφορά σε 92 Σχετικά άρθρα Όλες οι 17 εκδοχές

[Free GPT-4]
[DeepSeek]

[PDF] springer.com

Explainable reinforcement learning for broad-xai: a conceptual framework and survey

R Dazeley, P Vamplew, F Cruz - Neural Computing and Applications, 2023 - Springer

Broad-XAI moves away from interpreting individual decisions based on a single datum and
aims to provide integrated explanations from multiple machine learning algorithms into a …

Αποθήκευση Παράθεση Γίνεται αναφορά σε 67 Σχετικά άρθρα Όλες οι 9 εκδοχές

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Self-organizing maps for storage and transfer of knowledge in reinforcement learning

T George Karimpanal, R Bouffanais - Adaptive Behavior, 2019 - journals.sagepub.com

The idea of reusing or transferring information from previously learned tasks (source tasks)
for the learning of new tasks (target tasks) has the potential to significantly improve the …

Αποθήκευση Παράθεση Γίνεται αναφορά σε 100 Σχετικά άρθρα Όλες οι 9 εκδοχές

Multi-objective deep reinforcement learning for emergency scheduling in a water distribution network

C Hu, Q Wang, W Gong, X Yan - Memetic Computing, 2022 - Springer

In recent years, water contamination incidents have happened frequently, causing serious
losses and impacts on society. Therefore, how to quickly respond to emergency pollution …

Αποθήκευση Παράθεση Γίνεται αναφορά σε 20 Σχετικά άρθρα Όλες οι 2 εκδοχές

[Free GPT-4]
[DeepSeek]

[PDF] cispa.saarland

Detection of threats to IoT devices using scalable VPN-forwarded honeypots

A Tambe, YL Aung, R Sridharan, M Ochoa… - Proceedings of the …, 2019 - dl.acm.org

Attacks on Internet of Things (IoT) devices, exploiting inherent vulnerabilities, have
intensified over the last few years. Recent large-scale attacks, such as Persirai, Hakai, etc …

Αποθήκευση Παράθεση Γίνεται αναφορά σε 45 Σχετικά άρθρα Όλες οι 15 εκδοχές

An application of multi-objective reinforcement learning for efficient model-free control of canals deployed with IoT networks

T Ren, J Niu, J Cui, Z Ouyang, X Liu - Journal of Network and Computer …, 2021 - Elsevier

Canals have been widely constructed to deliver water from rich areas to poor areas to ease
water shortages. Efficient controlling of canals is essential for high-performance water …

Αποθήκευση Παράθεση Γίνεται αναφορά σε 17 Σχετικά άρθρα

[Free GPT-4]
[DeepSeek]

[PDF] springer.com

Goal-conditioned offline reinforcement learning through state space partitioning

M Wang, Y **, G Montana - Machine Learning, 2024 - Springer

Offline reinforcement learning (RL) aims to create policies for sequential decision-making
using exclusively offline datasets. This presents a significant challenge, especially when …

Αποθήκευση Παράθεση Γίνεται αναφορά σε 5 Σχετικά άρθρα Όλες οι 5 εκδοχές

[Free GPT-4]
[DeepSeek]

[PDF] frontiersin.org

Experience replay using transition sequences

TG Karimpanal, R Bouffanais - Frontiers in neurorobotics, 2018 - frontiersin.org

Experience replay is one of the most commonly used approaches to improve the sample
efficiency of reinforcement learning algorithms. In this work, we propose an approach to …

Αποθήκευση Παράθεση Γίνεται αναφορά σε 20 Σχετικά άρθρα Όλες οι 13 εκδοχές Προσωρινά αποθηκευμένη

Scalable VPN-forwarded honeypots: Dataset and threat intelligence insights

YL Aung, HH Tiang, H Wijaya, M Ochoa… - Sixth Annual Industrial …, 2020 - dl.acm.org

After distributed denial-of-service attacks by the Mirai malware in 2016, large-scale attacks
exploiting IoT devices raise significant security concerns for the stakeholders involved. The …

Αποθήκευση Παράθεση Γίνεται αναφορά σε 8 Σχετικά άρθρα

Παράθεση

Σύνθετη αναζήτηση

Αποθηκεύτηκε στη Βιβλιοθήκη μου

A practical guide to multi-objective reinforcement learning and planning

Scalar reward is not enough: A response to silver, singh, precup and sutton (2021)

Explainable reinforcement learning for broad-xai: a conceptual framework and survey

Self-organizing maps for storage and transfer of knowledge in reinforcement learning

Multi-objective deep reinforcement learning for emergency scheduling in a water distribution network

Detection of threats to IoT devices using scalable VPN-forwarded honeypots

An application of multi-objective reinforcement learning for efficient model-free control of canals deployed with IoT networks

Goal-conditioned offline reinforcement learning through state space partitioning

Experience replay using transition sequences

Scalable VPN-forwarded honeypots: Dataset and threat intelligence insights