A tutorial on ultrareliable and low-latency communications in 6G: Integrating domain knowledge into deep learning

C She, C Sun, Z Gu, Y Li, C Yang… - Proceedings of the …, 2021 - ieeexplore.ieee.org
As one of the key communication scenarios in the fifth-generation and also the sixth-
generation (6G) mobile communication networks, ultrareliable and low-latency …

Evolutionary dynamics of multi-agent learning: A survey

D Bloembergen, K Tuyls, D Hennes… - Journal of Artificial …, 2015 - jair.org
The interaction of multiple autonomous agents gives rise to highly dynamic and
nondeterministic environments, contributing to the complexity in applications such as …

Learning-based computation offloading for IoT devices with energy harvesting

M Min, L **ao, Y Chen, P Cheng, D Wu… - IEEE Transactions on …, 2019 - ieeexplore.ieee.org
Internet of Things (IoT) devices can apply mobile edge computing (MEC) and energy
harvesting (EH) to provide high-level experiences for computational intensive applications …

Distributive dynamic spectrum access through deep reinforcement learning: A reservoir computing-based approach

HH Chang, H Song, Y Yi, J Zhang… - IEEE Internet of Things …, 2018 - ieeexplore.ieee.org
Dynamic spectrum access (DSA) is regarded as an effective and efficient technology to
share radio spectrum among different networks. As a secondary user (SU), a DSA device …

Independent reinforcement learners in cooperative markov games: a survey regarding coordination problems

L Matignon, GJ Laurent, N Le Fort-Piat - The Knowledge …, 2012 - cambridge.org
In the framework of fully cooperative multi-agent systems, independent (non-communicative)
agents that learn by reinforcement must overcome several difficulties to manage to …

Reinforcement learning-based NOMA power allocation in the presence of smart jamming

L **ao, Y Li, C Dai, H Dai… - IEEE Transactions on …, 2017 - ieeexplore.ieee.org
Nonorthogonal multiple access (NOMA) systems are vulnerable to jamming attacks,
especially smart jammers who apply programmable and smart radio devices such as …

Robust multi-agent reinforcement learning with state uncertainty

S He, S Han, S Su, S Han, S Zou, F Miao - arxiv preprint arxiv:2307.16212, 2023 - arxiv.org
In real-world multi-agent reinforcement learning (MARL) applications, agents may not have
perfect state information (eg, due to inaccurate measurement or malicious attacks), which …

Cloud-based malware detection game for mobile devices with offloading

L **ao, Y Li, X Huang, XJ Du - IEEE Transactions on Mobile …, 2017 - ieeexplore.ieee.org
As accurate malware detection on mobile devices requires fast process of a large number of
application traces, cloud-based malware detection can utilize the data sharing and powerful …

A deterministic improved Q-learning for path planning of a mobile robot

A Konar, IG Chakraborty, SJ Singh… - … on Systems, Man …, 2013 - ieeexplore.ieee.org
This paper provides a new deterministic Q-learning with a presumed knowledge about the
distance from the current state to both the next state and the goal. This knowledge is …

[PDF][PDF] Classes of multiagent q-learning dynamics with epsilon-greedy exploration

M Wunder, ML Littman, M Babes - Proceedings of the 27th …, 2010 - engr.case.edu
Q-learning in single-agent environments is known to converge in the limit given sufficient
exploration. The same algorithm has been applied, with some success, in multiagent …