A tutorial on ultrareliable and low-latency communications in 6G: Integrating domain knowledge into deep learning
As one of the key communication scenarios in the fifth-generation and also the sixth-
generation (6G) mobile communication networks, ultrareliable and low-latency …
generation (6G) mobile communication networks, ultrareliable and low-latency …
Evolutionary dynamics of multi-agent learning: A survey
The interaction of multiple autonomous agents gives rise to highly dynamic and
nondeterministic environments, contributing to the complexity in applications such as …
nondeterministic environments, contributing to the complexity in applications such as …
Learning-based computation offloading for IoT devices with energy harvesting
Internet of Things (IoT) devices can apply mobile edge computing (MEC) and energy
harvesting (EH) to provide high-level experiences for computational intensive applications …
harvesting (EH) to provide high-level experiences for computational intensive applications …
Distributive dynamic spectrum access through deep reinforcement learning: A reservoir computing-based approach
Dynamic spectrum access (DSA) is regarded as an effective and efficient technology to
share radio spectrum among different networks. As a secondary user (SU), a DSA device …
share radio spectrum among different networks. As a secondary user (SU), a DSA device …
Independent reinforcement learners in cooperative markov games: a survey regarding coordination problems
In the framework of fully cooperative multi-agent systems, independent (non-communicative)
agents that learn by reinforcement must overcome several difficulties to manage to …
agents that learn by reinforcement must overcome several difficulties to manage to …
Reinforcement learning-based NOMA power allocation in the presence of smart jamming
Nonorthogonal multiple access (NOMA) systems are vulnerable to jamming attacks,
especially smart jammers who apply programmable and smart radio devices such as …
especially smart jammers who apply programmable and smart radio devices such as …
Robust multi-agent reinforcement learning with state uncertainty
In real-world multi-agent reinforcement learning (MARL) applications, agents may not have
perfect state information (eg, due to inaccurate measurement or malicious attacks), which …
perfect state information (eg, due to inaccurate measurement or malicious attacks), which …
Cloud-based malware detection game for mobile devices with offloading
As accurate malware detection on mobile devices requires fast process of a large number of
application traces, cloud-based malware detection can utilize the data sharing and powerful …
application traces, cloud-based malware detection can utilize the data sharing and powerful …
A deterministic improved Q-learning for path planning of a mobile robot
A Konar, IG Chakraborty, SJ Singh… - … on Systems, Man …, 2013 - ieeexplore.ieee.org
This paper provides a new deterministic Q-learning with a presumed knowledge about the
distance from the current state to both the next state and the goal. This knowledge is …
distance from the current state to both the next state and the goal. This knowledge is …
[PDF][PDF] Classes of multiagent q-learning dynamics with epsilon-greedy exploration
Q-learning in single-agent environments is known to converge in the limit given sufficient
exploration. The same algorithm has been applied, with some success, in multiagent …
exploration. The same algorithm has been applied, with some success, in multiagent …