- Academic Search

W Konen - 2019 IEEE conference on Games (CoG), 2019 - ieeexplore.ieee.org

We present a new general board game (GBG) playing and learning framework. GBG defines
the common interfaces for board games, game states and their AI agents. It allows one to run …

Salva Cita Citato da 32 Articoli correlati Tutte e 11 le versioni

[Free GPT-4]

[PDF] arxiv.org

Mastering 2048 with delayed temporal coherence learning, multistage weight promotion, redundant encoding, and carousel sha**

W Jaśkowski - IEEE Transactions on Games, 2017 - ieeexplore.ieee.org

2048 is an engaging single-player nondeterministic video puzzle game, which, thanks to the
simple rules and hard-tomaster gameplay, has gained massive popularity in recent years …

Salva Cita Citato da 44 Articoli correlati Tutte e 3 le versioni

[Free GPT-4]

[PDF] arxiv.org

Multistage temporal difference learning for 2048-like games

KH Yeh, IC Wu, CH Hsueh, CC Chang… - … Intelligence and AI …, 2016 - ieeexplore.ieee.org

Szubert and Jaśkowski successfully used temporal difference (TD) learning together with n-
tuple networks for playing the game 2048. However, we observed a phenomenon that the …

Salva Cita Citato da 41 Articoli correlati Tutte e 7 le versioni

Dynamic random distribution learning rate for neural networks training

X Hu, S Wen, HK Lam - Applied Soft Computing, 2022 - Elsevier

The learning rate is the most crucial hyper-parameter of a neural network that has a
significant impact on its performance. In this article, a novel learning rate setting idea termed …

Salva Cita Citato da 10 Articoli correlati Tutte e 3 le versioni

[Free GPT-4]

[PDF] arxiv.org

[PDF][PDF] A history of meta-gradient: Gradient methods for meta-learning

RS Sutton - arxiv preprint arxiv:2202.09701, 2022 - arxiv.org

arxiv:2202.09701v1 [cs.LG] 20 Feb 2022 A History of Meta-gradient: Gradient Methods for
Meta-learning Page 1 arxiv:2202.09701v1 [cs.LG] 20 Feb 2022 A History of Meta-gradient …

Salva Cita Citato da 10 Articoli correlati Tutte e 2 le versioni Versione HTML

[Free GPT-4]

[PDF] academia.edu

[PDF][PDF] Alpha-beta pruning in mini-max algorithm–an optimized approach for a connect-4 game

R Nasa, R Didwania, S Maji, V Kumar - Int. Res. J. Eng. Technol, 2018 - academia.edu

More than six decades after the term Artificial Intelligence was coined by John McCarthy to
describe intelligent behavior displayed by machines, finally the technology enabled world …

Salva Cita Citato da 22 Articoli correlati Versione HTML

[Free GPT-4]

[PDF] arxiv.org

AlphaZero-inspired game learning: Faster training by using MCTS only at test time

J Scheiermann, W Konen - IEEE Transactions on Games, 2022 - ieeexplore.ieee.org

Recently, the seminal algorithms AlphaGo and AlphaZero have started a new era in game
learning and deep reinforcement learning. While the achievements of AlphaGo and …

Salva Cita Citato da 5 Articoli correlati Tutte e 4 le versioni

[Free GPT-4]

[PDF] academia.edu

Temporal difference learning with eligibility traces for the game connect four

M Thill, S Bagheri, P Koch… - 2014 IEEE Conference on …, 2014 - ieeexplore.ieee.org

Systems that learn to play board games are often trained by self-play on the basis of
temporal difference (TD) learning. Successful examples include Tesauro's well known TD …

Salva Cita Citato da 27 Articoli correlati Tutte e 11 le versioni

[Free GPT-4]

[PDF] researchgate.net

[PDF][PDF] Reinforcement learning for board games: The temporal difference algorithm

W Konen - … Intelligence, Optimization and Data Mining), TH Köln …, 2015 - researchgate.net

This technical report shows how the ideas of reinforcement learning (RL) and temporal
difference (TD) learning can be applied to board games. This report collects the main ideas …

Salva Cita Citato da 14 Articoli correlati Tutte e 2 le versioni Versione HTML

[Free GPT-4]

[PDF] th-koeln.de

[PDF][PDF] Temporal difference learning methods with automatic step-size adaption for strategic board games: Connect-4 and Dots-and-Boxes

M Thill - Cologne University of Applied Sciences Masters thesis, 2015 - gm.th-koeln.de

Abstract Machine learning tasks for board games which rely solely on self-play methods
remain rather challenging up till today. The perhaps most impressive breakthrough in this …

Salva Cita Citato da 13 Articoli correlati Tutte e 3 le versioni Versione HTML

Crea avviso

Cita

Ricerca avanzata

Salvato in La mia biblioteca

Online adaptable learning rates for the game Connect-4

General board game playing for education and research in generic AI game learning

Mastering 2048 with delayed temporal coherence learning, multistage weight promotion, redundant encoding, and carousel sha**

Multistage temporal difference learning for 2048-like games

Dynamic random distribution learning rate for neural networks training

[PDF][PDF] A history of meta-gradient: Gradient methods for meta-learning

[PDF][PDF] Alpha-beta pruning in mini-max algorithm–an optimized approach for a connect-4 game

AlphaZero-inspired game learning: Faster training by using MCTS only at test time

Temporal difference learning with eligibility traces for the game connect four

[PDF][PDF] Reinforcement learning for board games: The temporal difference algorithm

[PDF][PDF] Temporal difference learning methods with automatic step-size adaption for strategic board games: Connect-4 and Dots-and-Boxes