Game of drones: Intelligent online decision making of multi-uav confrontation

D Liu, Q Zong, X Zhang, R Zhang… - IEEE Transactions on …, 2024 - ieeexplore.ieee.org
Due to the characteristics of the small size and low cost of unmanned aerial vehicles (UAVs),
Multi-UAV confrontation will play an important role in future wars. The Multi-UAV …

Pmac: Personalized multi-agent communication

X Meng, Y Tan - Proceedings of the AAAI Conference on Artificial …, 2024 - ojs.aaai.org
Communication plays a crucial role in information sharing within the field of multi-agent
reinforcement learning (MARL). However, how to transmit information that meets individual …

AMARL: An attention-based multiagent reinforcement learning approach to the min-max multiple traveling salesmen problem

H Gao, X Zhou, X Xu, Y Lan… - IEEE Transactions on …, 2023 - ieeexplore.ieee.org
In recent years, the multiple traveling salesmen problem (MTSP or multiple TSP) has
received increasing research interest and one of its main applications is coordinated …

Graph attention-based reinforcement learning for trajectory design and resource assignment in multi-UAV assisted communication

Z Feng, D Wu, M Huang, C Yuen - IEEE Internet of Things …, 2024 - ieeexplore.ieee.org
In the multiple unmanned aerial vehicle (UAV)-assisted downlink communication, it is
challenging for UAV base stations (UAV BSs) to realize trajectory design and resource …

NVIF: Neighboring variational information flow for cooperative large-scale multiagent reinforcement learning

J Chai, Y Zhu, D Zhao - IEEE Transactions on Neural Networks …, 2023 - ieeexplore.ieee.org
Communication-based multiagent reinforcement learning (MARL) has shown promising
results in promoting cooperation by enabling agents to exchange information. However, the …

Multiagent trust region policy optimization

H Li, H He - IEEE Transactions on Neural Networks and …, 2023 - ieeexplore.ieee.org
We extend trust region policy optimization (TRPO) to cooperative multiagent reinforcement
learning (MARL) for partially observable Markov games (POMGs). We show that the policy …

Optimal tracking neuro-control of continuous stirred tank reactor systems: A dynamic event-driven approach

X Yang, Y Zhou - IEEE Transactions on Artificial Intelligence, 2023 - ieeexplore.ieee.org
It is often challenging to design an optimal tracking controller for the continuous stirred tank
reactor (CSTR) system due to its nonlinear nature and physical limitations. This article …

Optimal consensus control for continuous-time linear multiagent systems: A dynamic event-triggered approach

H Zhang, A Wang, W Ji, J Qiu… - IEEE transactions on …, 2023 - ieeexplore.ieee.org
This article investigates the optimal consensus problem for general linear multiagent
systems (MASs) via a dynamic event-triggered approach. First, a modified interaction-related …

When to switch: planning and learning for partially observable multi-agent pathfinding

A Skrynnik, A Andreychuk, K Yakovlev… - IEEE Transactions on …, 2023 - ieeexplore.ieee.org
Multi-agent pathfinding (MAPF) is a problem that involves finding a set of non-conflicting
paths for a set of agents confined to a graph. In this work, we study a MAPF setting, where …

Multiagent reinforcement learning with graphical mutual information maximization

S Ding, W Du, L Ding, J Zhang… - IEEE Transactions on …, 2023 - ieeexplore.ieee.org
Communication learning is an important research direction in the multiagent reinforcement
learning (MARL) domain. Graph neural networks (GNNs) can aggregate the information of …