Pytorch: An imperative style, high-performance deep learning library

A Paszke, S Gross, F Massa, A Lerer… - Advances in neural …, 2019 - proceedings.neurips.cc
Deep learning frameworks have often focused on either usability or speed, but not both.
PyTorch is a machine learning library that shows that these two goals are in fact compatible …

Grandmaster level in StarCraft II using multi-agent reinforcement learning

O Vinyals, I Babuschkin, WM Czarnecki, M Mathieu… - nature, 2019 - nature.com
Many real-world applications require artificial agents to compete and coordinate with other
agents in complex environments. As a step** stone to this goal, the domain of StarCraft …

A survey of opponent modeling in adversarial domains

S Nashed, S Zilberstein - Journal of Artificial Intelligence Research, 2022 - jair.org
Opponent modeling is the ability to use prior knowledge and observations in order to predict
the behavior of an opponent. This survey presents a comprehensive overview of existing …

Efficient Reinforcement Learning for StarCraft by Abstract Forward Models and Transfer Learning

RZ Liu, H Guo, X Ji, Y Yu, ZJ Pang… - IEEE Transactions …, 2021 - ieeexplore.ieee.org
Injecting human knowledge is an effective way to accelerate reinforcement learning (RL).
However, these methods are underexplored. This article presents our discovery that an …

Alternate inference-decision reinforcement learning with generative adversarial inferring for bridge bidding

J Wang, S Wang, T Xu - Neural Computing and Applications, 2024 - Springer
Contract bridge is a competitive-cooperative multiplayer game. In the bidding phase, the
decision-making process is complex, given the extensive range of inaccessible information …

Light-weight probing of unsupervised representations for reinforcement learning

W Zhang, A GX-Chen, V Sobal, Y LeCun… - arxiv preprint arxiv …, 2022 - arxiv.org
Unsupervised visual representation learning offers the opportunity to leverage large corpora
of unlabeled trajectories to form useful visual representations, which can benefit the training …

[HTML][HTML] Proactive self-exploration: Leveraging information sharing and predictive modelling for anticipating and countering adversaries

SS Hashmi, HK Dam, MB Chhetri, AV Uzunov… - Expert Systems with …, 2025 - Elsevier
In contested environments, self-exploration is a self-* property that serves as a proactive
mechanism for countering threats posed by adversaries. Specifically, self-exploration …

Human-autonomy teaming and explainable ai capabilities in rts games

C Lucero, C Izumigawa, K Frederiksen, L Nans… - … 2020, Held as Part of the …, 2020 - Springer
Real-time strategy games often times mimic the appearance and feel of military-like
command and control systems. Artificial intelligence and machine learning research enjoys …

[PDF][PDF] Playing Card-Based RTS Games with Deep Reinforcement Learning.

T Liu, Z Zheng, H Li, K Bian, L Song - IJCAI, 2019 - ijcai.org
Game AI is of great importance as games are simulations of reality. Recent research on
game AI has shown much progress in various kinds of games, such as console games …

[PDF][PDF] Simple is better: Training an end-to-end contract bridge bidding agent without human knowledge

Q Gong, Y Jiang, Y Tian - Real-world Sequential Decision …, 2019 - realworld-sdm.github.io
Contract bridge is a multi-player imperfectinformation game where one partnership
collaborate with each other to compete against the other partnership. The game consists of …