Resilient multi-agent reinforcement learning with adversarial value decomposition

T Phan, L Belzner, T Gabor, A Sedlmeier… - Proceedings of the …, 2021 - ojs.aaai.org
We focus on resilience in cooperative multi-agent systems, where agents can change their
behavior due to udpates or failures of hardware and software components. Current state-of …

Vast: Value function factorization with variable agent sub-teams

T Phan, F Ritz, L Belzner, P Altmann… - Advances in …, 2021 - proceedings.neurips.cc
Value function factorization (VFF) is a popular approach to cooperative multi-agent
reinforcement learning in order to learn local value functions from global rewards. However …

What happened next? Using deep learning to value defensive actions in football event-data

C Merhej, RJ Beal, T Matthews… - Proceedings of the 27th …, 2021 - dl.acm.org
Objectively quantifying the value of player actions in football (soccer) is a challenging
problem. To date, studies in football analytics have mainly focused on the attacking side of …

[PDF][PDF] Learning and testing resilience in cooperative multi-agent systems

T Phan, T Gabor, A Sedlmeier, F Ritz… - Proceedings of the 19th …, 2020 - ifaamas.org
State-of-the-art multi-agent reinforcement learning has achieved remarkable success in
recent years. The success has been mainly based on the assumption that all teammates …

Learning to Participate through Trading of Reward Shares

M Kölle, T Matheis, P Altmann, K Schmid - arxiv preprint arxiv:2301.07416, 2023 - arxiv.org
Enabling autonomous agents to act cooperatively is an important step to integrate artificial
intelligence in our daily lives. While some methods seek to stimulate cooperation by letting …

Memory bounded open-loop planning in large pomdps using thompson sampling

T Phan, L Belzner, M Kiermeier, M Friedrich… - Proceedings of the …, 2019 - ojs.aaai.org
State-of-the-art approaches to partially observable planning like POMCP are based on
stochastic tree search. While these approaches are computationally efficient, they may still …

Model-free distributed reinforcement learning state estimation of a dynamical system using integral value functions

B Salamat, G Elsbacher, AM Tonello… - IEEE Open Journal of …, 2023 - ieeexplore.ieee.org
One of the challenging problems in sensor network systems is to estimate and track the state
of a target point mass with unknown dynamics. Recent improvements in deep learning (DL) …

Distributed policy iteration for scalable approximation of cooperative multi-agent policies

T Phan, K Schmid, L Belzner, T Gabor, S Feld… - arxiv preprint arxiv …, 2019 - arxiv.org
Decision making in multi-agent systems (MAS) is a great challenge due to enormous state
and joint action spaces as well as uncertainty, making centralized control generally …

SAT-MARL: Specification aware training in multi-agent reinforcement learning

F Ritz, T Phan, R Müller, T Gabor, A Sedlmeier… - arxiv preprint arxiv …, 2020 - arxiv.org
A characteristic of reinforcement learning is the ability to develop unforeseen strategies
when solving problems. While such strategies sometimes yield superior performance, they …

Specification aware multi-agent reinforcement learning

F Ritz, T Phan, R Müller, T Gabor, A Sedlmeier… - … Conference on Agents …, 2021 - Springer
Engineering intelligent industrial systems is challenging due to high complexity and
uncertainty with respect to domain dynamics and multiple agents. If industrial systems act …