Collision avoidance for autonomous ship using deep reinforcement learning and prior-knowledge-based approximate representation

C Wang, X Zhang, Z Yang, M Bashir… - Frontiers in Marine …, 2023 - frontiersin.org
Reinforcement learning (RL) has shown superior performance in solving sequential
decision problems. In recent years, RL is gradually being used to solve unmanned driving …

Task allocation of multiple unmanned aerial vehicles based on deep transfer reinforcement learning

Y Yin, Y Guo, Q Su, Z Wang - Drones, 2022 - mdpi.com
With the development of UAV technology, the task allocation problem of multiple UAVs is
remarkable, but most of these existing heuristic methods are easy to fall into the problem of …

A survey of progress on cooperative multi-agent reinforcement learning in open environment

L Yuan, Z Zhang, L Li, C Guan, Y Yu - arxiv preprint arxiv:2312.01058, 2023 - arxiv.org
Multi-agent Reinforcement Learning (MARL) has gained wide attention in recent years and
has made progress in various fields. Specifically, cooperative MARL focuses on training a …

Diagnostic accuracy and potential covariates of artificial intelligence for diagnosing orthopedic fractures: a systematic literature review and meta-analysis

X Zhang, Y Yang, YW Shen, KR Zhang, Z Jiang… - European …, 2022 - Springer
Objectives To systematically quantify the diagnostic accuracy and identify potential
covariates affecting the performance of artificial intelligence (AI) in diagnosing orthopedic …

Using fuzzy logic to learn abstract policies in large-scale multiagent reinforcement learning

J Li, H Shi, KS Hwang - IEEE Transactions on Fuzzy Systems, 2022 - ieeexplore.ieee.org
Large-scale multiagent reinforcement learning requires huge computation and space costs,
and the too-long execution process makes it hard to train policies for agents. This work …

Autonomous air combat decision‐making of UAV based on parallel self‐play reinforcement learning

B Li, J Huang, S Bai, Z Gan, S Liang… - CAAI Transactions …, 2023 - Wiley Online Library
Aiming at addressing the problem of manoeuvring decision‐making in UAV air combat, this
study establishes a one‐to‐one air combat model, defines missile attack areas, and uses the …

Manoeuvre decision‐making of unmanned aerial vehicles in air combat based on an expert actor‐based soft actor critic algorithm

B Li, S Bai, S Liang, R Ma, E Neretin… - CAAI Transactions on …, 2023 - Wiley Online Library
The demand for autonomous motion control of unmanned aerial vehicles in air combat is
boosted as taking the initiative in combat appears more and more crucial. Unmanned aerial …

Safe adaptive policy transfer reinforcement learning for distributed multiagent control

B Du, W **e, Y Li, Q Yang, W Zhang… - … on Neural Networks …, 2023 - ieeexplore.ieee.org
Multiagent reinforcement learning (RL) training is usually difficult and time-consuming due to
mutual interference among agents. Safety concerns make an already difficult training …

A Multiagent Cooperative Learning System With Evolution of Social Roles

Y Hou, M Sun, Y Zeng, YS Ong, Y **… - IEEE Transactions …, 2023 - ieeexplore.ieee.org
Recent developments in reinforcement learning (RL) have been able to derive optimal
policies for sophisticated and capable agents, and shown to achieve human-level …

Fast transfer learning method using random layer freezing and feature refinement strategy

W Zhang, Y Yang, T Akilan, QMJ Wu… - IEEE Transactions on …, 2024 - ieeexplore.ieee.org
Recently, Moore-Penrose inverse (MPI)-based parameter fine-tuning of fully connected (FC)
layers in pretrained deep convolutional neural networks (DCNNs) has emerged within the …