Google Académico

A tutorial on ultrareliable and low-latency communications in 6G: Integrating domain knowledge into deep learning

C She, C Sun, Z Gu, Y Li, C Yang… - Proceedings of the …, 2021 - ieeexplore.ieee.org

As one of the key communication scenarios in the fifth-generation and also the sixth-
generation (6G) mobile communication networks, ultrareliable and low-latency …

Guardar Citar Citado por 366 Artículos relacionados Las 6 versiones

[Free GPT-4]
[DeepSeek]

[PDF] jair.org

Evolutionary dynamics of multi-agent learning: A survey

D Bloembergen, K Tuyls, D Hennes… - Journal of Artificial …, 2015 - jair.org

The interaction of multiple autonomous agents gives rise to highly dynamic and
nondeterministic environments, contributing to the complexity in applications such as …

Guardar Citar Citado por 380 Artículos relacionados Las 9 versiones Versión en HTML

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Learning-based computation offloading for IoT devices with energy harvesting

M Min, L **ao, Y Chen, P Cheng, D Wu… - IEEE Transactions on …, 2019 - ieeexplore.ieee.org

Internet of Things (IoT) devices can apply mobile edge computing (MEC) and energy
harvesting (EH) to provide high-level experiences for computational intensive applications …

Guardar Citar Citado por 596 Artículos relacionados Las 6 versiones

[Free GPT-4]
[DeepSeek]

[PDF] ieee.org

Distributive dynamic spectrum access through deep reinforcement learning: A reservoir computing-based approach

HH Chang, H Song, Y Yi, J Zhang… - IEEE Internet of Things …, 2018 - ieeexplore.ieee.org

Dynamic spectrum access (DSA) is regarded as an effective and efficient technology to
share radio spectrum among different networks. As a secondary user (SU), a DSA device …

Guardar Citar Citado por 235 Artículos relacionados Las 7 versiones

[Free GPT-4]
[DeepSeek]

[PDF] hal.science

Independent reinforcement learners in cooperative markov games: a survey regarding coordination problems

L Matignon, GJ Laurent, N Le Fort-Piat - The Knowledge …, 2012 - cambridge.org

In the framework of fully cooperative multi-agent systems, independent (non-communicative)
agents that learn by reinforcement must overcome several difficulties to manage to …

Guardar Citar Citado por 583 Artículos relacionados Las 13 versiones

Reinforcement learning-based NOMA power allocation in the presence of smart jamming

L **ao, Y Li, C Dai, H Dai… - IEEE Transactions on …, 2017 - ieeexplore.ieee.org

Nonorthogonal multiple access (NOMA) systems are vulnerable to jamming attacks,
especially smart jammers who apply programmable and smart radio devices such as …

Guardar Citar Citado por 244 Artículos relacionados Las 3 versiones

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Robust multi-agent reinforcement learning with state uncertainty

S He, S Han, S Su, S Han, S Zou, F Miao - arxiv preprint arxiv:2307.16212, 2023 - arxiv.org

In real-world multi-agent reinforcement learning (MARL) applications, agents may not have
perfect state information (eg, due to inaccurate measurement or malicious attacks), which …

Guardar Citar Citado por 42 Artículos relacionados Las 5 versiones Versión en HTML

Cloud-based malware detection game for mobile devices with offloading

L **ao, Y Li, X Huang, XJ Du - IEEE Transactions on Mobile …, 2017 - ieeexplore.ieee.org

As accurate malware detection on mobile devices requires fast process of a large number of
application traces, cloud-based malware detection can utilize the data sharing and powerful …

Guardar Citar Citado por 219 Artículos relacionados Las 4 versiones

[Free GPT-4]
[DeepSeek]

[PDF] academia.edu

A deterministic improved Q-learning for path planning of a mobile robot

A Konar, IG Chakraborty, SJ Singh… - … on Systems, Man …, 2013 - ieeexplore.ieee.org

This paper provides a new deterministic Q-learning with a presumed knowledge about the
distance from the current state to both the next state and the goal. This knowledge is …

Guardar Citar Citado por 324 Artículos relacionados Las 8 versiones Búsqueda de bibliotecas

[Free GPT-4]
[DeepSeek]

[PDF] case.edu

[PDF][PDF] Classes of multiagent q-learning dynamics with epsilon-greedy exploration

M Wunder, ML Littman, M Babes - Proceedings of the 27th …, 2010 - engr.case.edu

Q-learning in single-agent environments is known to converge in the limit given sufficient
exploration. The same algorithm has been applied, with some success, in multiagent …

Guardar Citar Citado por 239 Artículos relacionados Las 10 versiones Versión en HTML

Crear alerta

Citar

Búsqueda avanzada

Guardado en Mi biblioteca

Dynamic analysis of multiagent Q-learning with ε-greedy exploration

A tutorial on ultrareliable and low-latency communications in 6G: Integrating domain knowledge into deep learning

Evolutionary dynamics of multi-agent learning: A survey

Learning-based computation offloading for IoT devices with energy harvesting

Distributive dynamic spectrum access through deep reinforcement learning: A reservoir computing-based approach

Independent reinforcement learners in cooperative markov games: a survey regarding coordination problems

Reinforcement learning-based NOMA power allocation in the presence of smart jamming

Robust multi-agent reinforcement learning with state uncertainty

Cloud-based malware detection game for mobile devices with offloading

A deterministic improved Q-learning for path planning of a mobile robot

[PDF][PDF] Classes of multiagent q-learning dynamics with epsilon-greedy exploration