Multi-agent reinforcement learning: A selective overview of theories and algorithms
Recent years have witnessed significant advances in reinforcement learning (RL), which
has registered tremendous success in solving various sequential decision-making problems …
has registered tremendous success in solving various sequential decision-making problems …
Model-based reinforcement learning: A survey
Sequential decision making, commonly formalized as Markov Decision Process (MDP)
optimization, is an important challenge in artificial intelligence. Two key approaches to this …
optimization, is an important challenge in artificial intelligence. Two key approaches to this …
Mastering complex control in moba games with deep reinforcement learning
We study the reinforcement learning problem of complex action control in the Multi-player
Online Battle Arena (MOBA) 1v1 games. This problem involves far more complicated state …
Online Battle Arena (MOBA) 1v1 games. This problem involves far more complicated state …
Towards playing full moba games with deep reinforcement learning
MOBA games, eg, Honor of Kings, League of Legends, and Dota 2, pose grand challenges
to AI systems such as multi-agent, enormous state-action space, complex action control, etc …
to AI systems such as multi-agent, enormous state-action space, complex action control, etc …
Language agent tree search unifies reasoning acting and planning in language models
While large language models (LLMs) have demonstrated impressive performance on a
range of decision-making tasks, they rely on simple acting processes and fall short of broad …
range of decision-making tasks, they rely on simple acting processes and fall short of broad …
A survey of deep reinforcement learning in video games
Deep reinforcement learning (DRL) has made great achievements since proposed.
Generally, DRL agents receive high-dimensional inputs at each step, and make actions …
Generally, DRL agents receive high-dimensional inputs at each step, and make actions …
Exponentially weighted imitation learning for batched historical data
We consider deep policy learning with only batched historical trajectories. The main
challenge of this problem is that the learner no longer has a simulator or``environment …
challenge of this problem is that the learner no longer has a simulator or``environment …
Honor of kings arena: an environment for generalization in competitive reinforcement learning
This paper introduces Honor of Kings Arena, a reinforcement learning (RL) environment
based on the Honor of Kings, one of the world's most popular games at present. Compared …
based on the Honor of Kings, one of the world's most popular games at present. Compared …
Deep reinforcement learning based video games: A review
KA ElDahshan, H Farouk… - 2022 2nd International …, 2022 - ieeexplore.ieee.org
Video game development is getting increasingly effective as AI paradigms advance. Deep
Reinforcement Learning (DRL) is a promising artificial intelligence (AI) approach. It …
Reinforcement Learning (DRL) is a promising artificial intelligence (AI) approach. It …
Supervised learning achieves human-level performance in moba games: A case study of honor of kings
We present JueWu-SL, the first supervised-learning-based artificial intelligence (AI) program
that achieves human-level performance in playing multiplayer online battle arena (MOBA) …
that achieves human-level performance in playing multiplayer online battle arena (MOBA) …