A survey of monte carlo tree search methods

CB Browne, E Powley, D Whitehouse… - … Intelligence and AI …, 2012 - ieeexplore.ieee.org
Monte Carlo tree search (MCTS) is a recently proposed search method that combines the
precision of tree search with the generality of random sampling. It has received considerable …

Monte-Carlo tree search and rapid action value estimation in computer Go

S Gelly, D Silver - Artificial Intelligence, 2011 - Elsevier
A new paradigm for search, based on Monte-Carlo simulation, has revolutionised the
performance of computer Go programs. In this article we describe two extensions to the …

Fuego—an open-source framework for board games and Go engine based on Monte Carlo tree search

M Enzenberger, M Müller, B Arneson… - IEEE Transactions on …, 2010 - ieeexplore.ieee.org
Fuego is both an open-source software framework and a state-of-the-art program that plays
the game of Go. The framework supports develo** game engines for full-information two …

Texplore: real-time sample-efficient reinforcement learning for robots

T Hester, P Stone - Machine learning, 2013 - Springer
The use of robots in society could be expanded by using reinforcement learning (RL) to
allow robots to learn and adapt to new situations online. RL is a paradigm for learning …

[PDF][PDF] Monte-carlo tree search

GMJBC Chaslot - 2010 - cris.maastrichtuniversity.nl
This thesis studies the use of Monte-Carlo simulations for tree-search problems. The Monte-
Carlo technique we investigate is Monte-Carlo Tree Search (MCTS). It is a best-first search …

The computational intelligence of MoGo revealed in Taiwan's computer Go tournaments

CS Lee, MH Wang, G Chaslot, JB Hoock… - … Intelligence and AI …, 2009 - ieeexplore.ieee.org
In order to promote computer Go and stimulate further development and research in the
field, the event activities, Computational Intelligence Forum and World 9 \,*\, 9 Computer Go …

Gossip-based distributed stochastic bandit algorithms

B Szorenyi, R Busa-Fekete, I Hegedus… - International …, 2013 - proceedings.mlr.press
The multi-armed bandit problem has attracted remarkable attention in the machine learning
community and many efficient algorithms have been proposed to handle the so-called …

Continuous lunches are free plus the design of optimal optimization algorithms

A Auger, O Teytaud - Algorithmica, 2010 - Springer
This paper analyses extensions of No-Free-Lunch (NFL) theorems to countably infinite and
uncountable infinite domains and investigates the design of optimal optimization algorithms …

Reinforcement learning and simulation-based search in computer Go

D Silver - 2009 - era.library.ualberta.ca
Learning and planning are two fundamental problems in artificial intelligence. The learning
problem can be tackled by reinforcement learning methods, such as temporal-difference …

Current frontiers in computer Go

A Rimmel, O Teytaud, CS Lee, SJ Yen… - … Intelligence and AI …, 2010 - ieeexplore.ieee.org
This paper presents the recent technical advances in Monte Carlo tree search (MCTS) for
the game of Go, shows the many similarities and the rare differences between the current …