Resilient multi-agent reinforcement learning with adversarial value decomposition
We focus on resilience in cooperative multi-agent systems, where agents can change their
behavior due to udpates or failures of hardware and software components. Current state-of …
behavior due to udpates or failures of hardware and software components. Current state-of …
Vast: Value function factorization with variable agent sub-teams
Value function factorization (VFF) is a popular approach to cooperative multi-agent
reinforcement learning in order to learn local value functions from global rewards. However …
reinforcement learning in order to learn local value functions from global rewards. However …
What happened next? Using deep learning to value defensive actions in football event-data
Objectively quantifying the value of player actions in football (soccer) is a challenging
problem. To date, studies in football analytics have mainly focused on the attacking side of …
problem. To date, studies in football analytics have mainly focused on the attacking side of …
[PDF][PDF] Learning and testing resilience in cooperative multi-agent systems
State-of-the-art multi-agent reinforcement learning has achieved remarkable success in
recent years. The success has been mainly based on the assumption that all teammates …
recent years. The success has been mainly based on the assumption that all teammates …
Learning to Participate through Trading of Reward Shares
Enabling autonomous agents to act cooperatively is an important step to integrate artificial
intelligence in our daily lives. While some methods seek to stimulate cooperation by letting …
intelligence in our daily lives. While some methods seek to stimulate cooperation by letting …
Memory bounded open-loop planning in large pomdps using thompson sampling
State-of-the-art approaches to partially observable planning like POMCP are based on
stochastic tree search. While these approaches are computationally efficient, they may still …
stochastic tree search. While these approaches are computationally efficient, they may still …
Model-free distributed reinforcement learning state estimation of a dynamical system using integral value functions
One of the challenging problems in sensor network systems is to estimate and track the state
of a target point mass with unknown dynamics. Recent improvements in deep learning (DL) …
of a target point mass with unknown dynamics. Recent improvements in deep learning (DL) …
Distributed policy iteration for scalable approximation of cooperative multi-agent policies
Decision making in multi-agent systems (MAS) is a great challenge due to enormous state
and joint action spaces as well as uncertainty, making centralized control generally …
and joint action spaces as well as uncertainty, making centralized control generally …
SAT-MARL: Specification aware training in multi-agent reinforcement learning
A characteristic of reinforcement learning is the ability to develop unforeseen strategies
when solving problems. While such strategies sometimes yield superior performance, they …
when solving problems. While such strategies sometimes yield superior performance, they …
Specification aware multi-agent reinforcement learning
Engineering intelligent industrial systems is challenging due to high complexity and
uncertainty with respect to domain dynamics and multiple agents. If industrial systems act …
uncertainty with respect to domain dynamics and multiple agents. If industrial systems act …