General board game playing for education and research in generic AI game learning
W Konen - 2019 IEEE conference on Games (CoG), 2019 - ieeexplore.ieee.org
We present a new general board game (GBG) playing and learning framework. GBG defines
the common interfaces for board games, game states and their AI agents. It allows one to run …
the common interfaces for board games, game states and their AI agents. It allows one to run …
Mastering 2048 with delayed temporal coherence learning, multistage weight promotion, redundant encoding, and carousel sha**
W Jaśkowski - IEEE Transactions on Games, 2017 - ieeexplore.ieee.org
2048 is an engaging single-player nondeterministic video puzzle game, which, thanks to the
simple rules and hard-tomaster gameplay, has gained massive popularity in recent years …
simple rules and hard-tomaster gameplay, has gained massive popularity in recent years …
Multistage temporal difference learning for 2048-like games
Szubert and Jaśkowski successfully used temporal difference (TD) learning together with n-
tuple networks for playing the game 2048. However, we observed a phenomenon that the …
tuple networks for playing the game 2048. However, we observed a phenomenon that the …
Dynamic random distribution learning rate for neural networks training
The learning rate is the most crucial hyper-parameter of a neural network that has a
significant impact on its performance. In this article, a novel learning rate setting idea termed …
significant impact on its performance. In this article, a novel learning rate setting idea termed …
[PDF][PDF] A history of meta-gradient: Gradient methods for meta-learning
RS Sutton - arxiv preprint arxiv:2202.09701, 2022 - arxiv.org
arxiv:2202.09701v1 [cs.LG] 20 Feb 2022 A History of Meta-gradient: Gradient Methods for
Meta-learning Page 1 arxiv:2202.09701v1 [cs.LG] 20 Feb 2022 A History of Meta-gradient …
Meta-learning Page 1 arxiv:2202.09701v1 [cs.LG] 20 Feb 2022 A History of Meta-gradient …
[PDF][PDF] Alpha-beta pruning in mini-max algorithm–an optimized approach for a connect-4 game
R Nasa, R Didwania, S Maji, V Kumar - Int. Res. J. Eng. Technol, 2018 - academia.edu
More than six decades after the term Artificial Intelligence was coined by John McCarthy to
describe intelligent behavior displayed by machines, finally the technology enabled world …
describe intelligent behavior displayed by machines, finally the technology enabled world …
AlphaZero-inspired game learning: Faster training by using MCTS only at test time
J Scheiermann, W Konen - IEEE Transactions on Games, 2022 - ieeexplore.ieee.org
Recently, the seminal algorithms AlphaGo and AlphaZero have started a new era in game
learning and deep reinforcement learning. While the achievements of AlphaGo and …
learning and deep reinforcement learning. While the achievements of AlphaGo and …
Temporal difference learning with eligibility traces for the game connect four
M Thill, S Bagheri, P Koch… - 2014 IEEE Conference on …, 2014 - ieeexplore.ieee.org
Systems that learn to play board games are often trained by self-play on the basis of
temporal difference (TD) learning. Successful examples include Tesauro's well known TD …
temporal difference (TD) learning. Successful examples include Tesauro's well known TD …
[PDF][PDF] Reinforcement learning for board games: The temporal difference algorithm
W Konen - … Intelligence, Optimization and Data Mining), TH Köln …, 2015 - researchgate.net
This technical report shows how the ideas of reinforcement learning (RL) and temporal
difference (TD) learning can be applied to board games. This report collects the main ideas …
difference (TD) learning can be applied to board games. This report collects the main ideas …
[PDF][PDF] Temporal difference learning methods with automatic step-size adaption for strategic board games: Connect-4 and Dots-and-Boxes
M Thill - Cologne University of Applied Sciences Masters thesis, 2015 - gm.th-koeln.de
Abstract Machine learning tasks for board games which rely solely on self-play methods
remain rather challenging up till today. The perhaps most impressive breakthrough in this …
remain rather challenging up till today. The perhaps most impressive breakthrough in this …