On transforming reinforcement learning with transformers: The development trajectory

S Hu, L Shen, Y Zhang, Y Chen… - IEEE Transactions on …, 2024 - ieeexplore.ieee.org
Transformers, originally devised for natural language processing (NLP), have also produced
significant successes in computer vision (CV). Due to their strong expression power …

Generative ai for self-adaptive systems: State of the art and research roadmap

J Li, M Zhang, N Li, D Weyns, Z **, K Tei - ACM Transactions on …, 2024 - dl.acm.org
Self-adaptive systems (SASs) are designed to handle changes and uncertainties through a
feedback loop with four core functionalities: monitoring, analyzing, planning, and execution …

Deep generative models for offline policy learning: Tutorial, survey, and perspectives on future directions

J Chen, B Ganguly, Y Xu, Y Mei, T Lan… - arxiv preprint arxiv …, 2024 - arxiv.org
Deep generative models (DGMs) have demonstrated great success across various domains,
particularly in generating texts, images, and videos using models trained from offline data …

Transformer in reinforcement learning for decision-making: a survey

W Yuan, J Chen, S Chen, D Feng, Z Hu, P Li… - Frontiers of Information …, 2024 - Springer
Reinforcement learning (RL) has become a dominant decision-making paradigm and has
achieved notable success in many real-world applications. Notably, deep neural networks …

RiskQ: risk-sensitive multi-agent reinforcement learning value factorization

S Shen, C Ma, C Li, W Liu, Y Fu… - Advances in Neural …, 2023 - proceedings.neurips.cc
Multi-agent systems are characterized by environmental uncertainty, varying policies of
agents, and partial observability, which result in significant risks. In the context of Multi-Agent …

Sample-efficient multiagent reinforcement learning with reset replay

Y Yang, G Chen, HAO Jianye… - Forty-first International …, 2024 - openreview.net
The popularity of multiagent reinforcement learning (MARL) is growing rapidly with the
demand for real-world tasks that require swarm intelligence. However, a noticeable …

An Extended Benchmarking of Multi-Agent Reinforcement Learning Algorithms in Complex Fully Cooperative Tasks

G Papadopoulos, A Kontogiannis… - arxiv preprint arxiv …, 2025 - arxiv.org
Multi-Agent Reinforcement Learning (MARL) has recently emerged as a significant area of
research. However, MARL evaluation often lacks systematic diversity, hindering a …

SRMT: Shared Memory for Multi-agent Lifelong Pathfinding

A Sagirova, Y Kuratov, M Burtsev - arxiv preprint arxiv:2501.13200, 2025 - arxiv.org
Multi-agent reinforcement learning (MARL) demonstrates significant progress in solving
cooperative and competitive multi-agent problems in various environments. One of the …

Boosting value decomposition via unit-wise attentive state representation for cooperative multi-agent reinforcement learning

Q Zhao, Y Zhu, Z Liu, Z Wang, C Chen - arxiv preprint arxiv:2305.07182, 2023 - arxiv.org
In cooperative multi-agent reinforcement learning (MARL), the environmental stochasticity
and uncertainties will increase exponentially when the number of agents increases, which …

[PDF][PDF] Addressing Permutation Challenges in Multi-Agent Reinforcement Learning

S Hazra, P Dasgupta, S Dey - … of the 23rd International Conference on …, 2024 - ifaamas.org
ABSTRACT In Reinforcement Learning, deep neural networks play a crucial role, especially
in Multi-Agent Systems. Owing to information from multiple sources, the challenge lies in …