General board game playing for education and research in generic AI game learning

W Konen - 2019 IEEE conference on Games (CoG), 2019 - ieeexplore.ieee.org
We present a new general board game (GBG) playing and learning framework. GBG defines
the common interfaces for board games, game states and their AI agents. It allows one to run …

Mastering 2048 with delayed temporal coherence learning, multistage weight promotion, redundant encoding, and carousel sha**

W Jaśkowski - IEEE Transactions on Games, 2017 - ieeexplore.ieee.org
2048 is an engaging single-player nondeterministic video puzzle game, which, thanks to the
simple rules and hard-tomaster gameplay, has gained massive popularity in recent years …

Multistage temporal difference learning for 2048-like games

KH Yeh, IC Wu, CH Hsueh, CC Chang… - … Intelligence and AI …, 2016 - ieeexplore.ieee.org
Szubert and Jaśkowski successfully used temporal difference (TD) learning together with n-
tuple networks for playing the game 2048. However, we observed a phenomenon that the …

Dynamic random distribution learning rate for neural networks training

X Hu, S Wen, HK Lam - Applied Soft Computing, 2022 - Elsevier
The learning rate is the most crucial hyper-parameter of a neural network that has a
significant impact on its performance. In this article, a novel learning rate setting idea termed …

[PDF][PDF] A history of meta-gradient: Gradient methods for meta-learning

RS Sutton - arxiv preprint arxiv:2202.09701, 2022 - arxiv.org
arxiv:2202.09701v1 [cs.LG] 20 Feb 2022 A History of Meta-gradient: Gradient Methods for
Meta-learning Page 1 arxiv:2202.09701v1 [cs.LG] 20 Feb 2022 A History of Meta-gradient …

[PDF][PDF] Alpha-beta pruning in mini-max algorithm–an optimized approach for a connect-4 game

R Nasa, R Didwania, S Maji, V Kumar - Int. Res. J. Eng. Technol, 2018 - academia.edu
More than six decades after the term Artificial Intelligence was coined by John McCarthy to
describe intelligent behavior displayed by machines, finally the technology enabled world …

AlphaZero-inspired game learning: Faster training by using MCTS only at test time

J Scheiermann, W Konen - IEEE Transactions on Games, 2022 - ieeexplore.ieee.org
Recently, the seminal algorithms AlphaGo and AlphaZero have started a new era in game
learning and deep reinforcement learning. While the achievements of AlphaGo and …

Temporal difference learning with eligibility traces for the game connect four

M Thill, S Bagheri, P Koch… - 2014 IEEE Conference on …, 2014 - ieeexplore.ieee.org
Systems that learn to play board games are often trained by self-play on the basis of
temporal difference (TD) learning. Successful examples include Tesauro's well known TD …

[PDF][PDF] Reinforcement learning for board games: The temporal difference algorithm

W Konen - … Intelligence, Optimization and Data Mining), TH Köln …, 2015 - researchgate.net
This technical report shows how the ideas of reinforcement learning (RL) and temporal
difference (TD) learning can be applied to board games. This report collects the main ideas …

[PDF][PDF] Temporal difference learning methods with automatic step-size adaption for strategic board games: Connect-4 and Dots-and-Boxes

M Thill - Cologne University of Applied Sciences Masters thesis, 2015 - gm.th-koeln.de
Abstract Machine learning tasks for board games which rely solely on self-play methods
remain rather challenging up till today. The perhaps most impressive breakthrough in this …