Imitating human behaviour with diffusion models

T Pearce, T Rashid, A Kanervisto, D Bignell… - arxiv preprint arxiv …, 2023 - arxiv.org
Diffusion models have emerged as powerful generative models in the text-to-image domain.
This paper studies their application as observation-to-action models for imitating human …

A survey of progress on cooperative multi-agent reinforcement learning in open environment

L Yuan, Z Zhang, L Li, C Guan, Y Yu - arxiv preprint arxiv:2312.01058, 2023 - arxiv.org
Multi-agent Reinforcement Learning (MARL) has gained wide attention in recent years and
has made progress in various fields. Specifically, cooperative MARL focuses on training a …

Robust multi-agent coordination via evolutionary generation of auxiliary adversarial attackers

L Yuan, Z Zhang, K Xue, H Yin, F Chen… - Proceedings of the …, 2023 - ojs.aaai.org
Abstract Cooperative Multi-agent Reinforcement Learning (CMARL) has shown to be
promising for many real-world applications. Previous works mainly focus on improving …

Cooperation on the fly: Exploring language agents for ad hoc teamwork in the avalon game

Z Shi, M Fang, S Zheng, S Deng, L Chen… - arxiv preprint arxiv …, 2023 - arxiv.org
Multi-agent collaboration with Large Language Models (LLMs) demonstrates proficiency in
basic tasks, yet its efficiency in more complex scenarios remains unexplored. In gaming …

Generating diverse cooperative agents by learning incompatible policies

R Charakorn, P Manoonpong… - … Conference on Learning …, 2023 - openreview.net
Training a robust cooperative agent requires diverse partner agents. However, obtaining
those agents is difficult. Previous works aim to learn diverse behaviors by changing the state …

Decision making in open agent systems

A Eck, LK Soh, P Doshi - AI Magazine, 2023 - Wiley Online Library
In many real‐world applications of AI, the set of actors and tasks are not constant, but
instead change over time. Robots tasked with suppressing wildfires eventually run out of …

Fast teammate adaptation in the presence of sudden policy change

Z Zhang, L Yuan, L Li, K Xue, C Jia… - Uncertainty in …, 2023 - proceedings.mlr.press
Cooperative multi-agent reinforcement learning (MARL), where agents coordinates with
teammate (s) for a shared goal, may sustain non-stationary caused by the policy change of …

Byzantine robust cooperative multi-agent reinforcement learning as a bayesian game

S Li, J Guo, J **u, R Xu, X Yu, J Wang, A Liu… - arxiv preprint arxiv …, 2023 - arxiv.org
In this study, we explore the robustness of cooperative multi-agent reinforcement learning (c-
MARL) against Byzantine failures, where any agent can enact arbitrary, worst-case actions …

Heterogeneous multi-agent zero-shot coordination by coevolution

K Xue, Y Wang, C Guan, L Yuan, H Fu… - IEEE Transactions …, 2024 - ieeexplore.ieee.org
Generating agents that can achieve zero-shot coordination (ZSC) with unseen partners is a
new challenge in cooperative multi-agent reinforcement learning (MARL). Recently, some …

Concept--An Evaluation Protocol on Conversational Recommender Systems with System-centric and User-centric Factors

C Huang, P Qin, Y Deng, W Lei, J Lv… - arxiv preprint arxiv …, 2024 - arxiv.org
The conversational recommendation system (CRS) has been criticized regarding its user
experience in real-world scenarios, despite recent significant progress achieved in …