Honor of kings arena: an environment for generalization in competitive reinforcement learning

H Wei, J Chen, X Ji, H Qin, M Deng… - Advances in …, 2022 - proceedings.neurips.cc
This paper introduces Honor of Kings Arena, a reinforcement learning (RL) environment
based on the Honor of Kings, one of the world's most popular games at present. Compared …

Juewu-mc: Playing minecraft with sample-efficient hierarchical reinforcement learning

Z Lin, J Li, J Shi, D Ye, Q Fu, W Yang - arxiv preprint arxiv:2112.04907, 2021 - arxiv.org
Learning rational behaviors in open-world games like Minecraft remains to be challenging
for Reinforcement Learning (RL) research due to the compound challenge of partial …

Pretraining in deep reinforcement learning: A survey

Z **e, Z Lin, J Li, S Li, D Ye - arxiv preprint arxiv:2211.03959, 2022 - arxiv.org
The past few years have seen rapid progress in combining reinforcement learning (RL) with
deep learning. Various breakthroughs ranging from games to robotics have spurred the …

A Survey on Self-play Methods in Reinforcement Learning

R Zhang, Z Xu, C Ma, C Yu, WW Tu, S Huang… - arxiv preprint arxiv …, 2024 - arxiv.org
Self-play, characterized by agents' interactions with copies or past versions of itself, has
recently gained prominence in reinforcement learning. This paper first clarifies the …

Interpretable real-time win prediction for honor of kings—a popular mobile MOBA esport

Z Yang, Z Pan, Y Wang, D Cai, S Shi… - IEEE Transactions …, 2022 - ieeexplore.ieee.org
With the rapid prevalence and explosive development of Multiplayer Online Battle Arena
electronic sports (MOBA esports), much research effort has been devoted to automatically …

CTDS: Centralized Teacher With Decentralized Student for Multiagent Reinforcement Learning

J Zhao, X Hu, M Yang, W Zhou… - IEEE Transactions on …, 2022 - ieeexplore.ieee.org
Due to the partial observability and communication constraints in many multiagent
reinforcement learning (MARL) tasks, centralized training with decentralized execution …

Revisiting discrete soft actor-critic

H Zhou, Z Lin, J Li, Q Fu, W Yang, D Ye - arxiv preprint arxiv:2209.10081, 2022 - arxiv.org
We study the adaption of soft actor-critic (SAC) from continuous action space to discrete
action space. We revisit vanilla SAC and provide an in-depth understanding of its Q value …

A concise review of intelligent game agent

H Li, X Pang, B Sun, K Liu - Entertainment Computing, 2024 - Elsevier
Intelligent game agents are crafted using AI technologies to mimic player behavior and
make decisions autonomously. Over the past decades, the scope of intelligent agents has …

BPCoach: Exploring Hero Drafting in Professional MOBA Tournaments via Visual Analytics

S Liu, R Ma, C Zhao, Z Li, J **ao, Q Li - … of the ACM on Human-Computer …, 2024 - dl.acm.org
Hero drafting for multiplayer online arena (MOBA) games is crucial because drafting directly
affects the outcome of a match. Both sides take turns to" ban"/" pick" a hero from a roster of …

Learning diverse policies in moba games via macro-goals

Y Gao, B Shi, X Du, L Wang, G Chen… - Advances in …, 2021 - proceedings.neurips.cc
Recently, many researchers have made successful progress in building the AI systems for
MOBA-game-playing with deep reinforcement learning, such as on Dota 2 and Honor of …