Multi-agent reinforcement learning: A selective overview of theories and algorithms

K Zhang, Z Yang, T Başar - Handbook of reinforcement learning and …, 2021 - Springer
Recent years have witnessed significant advances in reinforcement learning (RL), which
has registered tremendous success in solving various sequential decision-making problems …

Model-based reinforcement learning: A survey

TM Moerland, J Broekens, A Plaat… - … and Trends® in …, 2023 - nowpublishers.com
Sequential decision making, commonly formalized as Markov Decision Process (MDP)
optimization, is an important challenge in artificial intelligence. Two key approaches to this …

Mastering complex control in moba games with deep reinforcement learning

D Ye, Z Liu, M Sun, B Shi, P Zhao, H Wu, H Yu… - Proceedings of the AAAI …, 2020 - aaai.org
We study the reinforcement learning problem of complex action control in the Multi-player
Online Battle Arena (MOBA) 1v1 games. This problem involves far more complicated state …

Towards playing full moba games with deep reinforcement learning

D Ye, G Chen, W Zhang, S Chen… - Advances in …, 2020 - proceedings.neurips.cc
MOBA games, eg, Honor of Kings, League of Legends, and Dota 2, pose grand challenges
to AI systems such as multi-agent, enormous state-action space, complex action control, etc …

Language agent tree search unifies reasoning acting and planning in language models

A Zhou, K Yan, M Shlapentokh-Rothman… - arxiv preprint arxiv …, 2023 - arxiv.org
While large language models (LLMs) have demonstrated impressive performance on a
range of decision-making tasks, they rely on simple acting processes and fall short of broad …

A survey of deep reinforcement learning in video games

K Shao, Z Tang, Y Zhu, N Li, D Zhao - arxiv preprint arxiv:1912.10944, 2019 - arxiv.org
Deep reinforcement learning (DRL) has made great achievements since proposed.
Generally, DRL agents receive high-dimensional inputs at each step, and make actions …

Exponentially weighted imitation learning for batched historical data

Q Wang, J **ong, L Han, H Liu… - Advances in Neural …, 2018 - proceedings.neurips.cc
We consider deep policy learning with only batched historical trajectories. The main
challenge of this problem is that the learner no longer has a simulator or``environment …

Honor of kings arena: an environment for generalization in competitive reinforcement learning

H Wei, J Chen, X Ji, H Qin, M Deng… - Advances in …, 2022 - proceedings.neurips.cc
This paper introduces Honor of Kings Arena, a reinforcement learning (RL) environment
based on the Honor of Kings, one of the world's most popular games at present. Compared …

Deep reinforcement learning based video games: A review

KA ElDahshan, H Farouk… - 2022 2nd International …, 2022 - ieeexplore.ieee.org
Video game development is getting increasingly effective as AI paradigms advance. Deep
Reinforcement Learning (DRL) is a promising artificial intelligence (AI) approach. It …

Supervised learning achieves human-level performance in moba games: A case study of honor of kings

D Ye, G Chen, P Zhao, F Qiu, B Yuan… - … on Neural Networks …, 2020 - ieeexplore.ieee.org
We present JueWu-SL, the first supervised-learning-based artificial intelligence (AI) program
that achieves human-level performance in playing multiplayer online battle arena (MOBA) …