Monte Carlo tree search: A review of recent modifications and applications
Abstract Monte Carlo Tree Search (MCTS) is a powerful approach to designing game-
playing bots or solving sequential decision problems. The method relies on intelligent tree …
playing bots or solving sequential decision problems. The method relies on intelligent tree …
A survey of monte carlo tree search methods
Monte Carlo tree search (MCTS) is a recently proposed search method that combines the
precision of tree search with the generality of random sampling. It has received considerable …
precision of tree search with the generality of random sampling. It has received considerable …
Progressive strategies for Monte-Carlo tree search
Monte-Carlo Tree Search (MCTS) is a new best-first search guided by the results of Monte-
Carlo simulations. In this article, we introduce two progressive strategies for MCTS, called …
Carlo simulations. In this article, we introduce two progressive strategies for MCTS, called …
Parallel monte-carlo tree search
Abstract Monte-Carlo Tree Search (MCTS) is a new best-first search method that started a
revolution in the field of Computer Go. Parallelizing MCTS is an important way to increase …
revolution in the field of Computer Go. Parallelizing MCTS is an important way to increase …
Episodic exploration for deep deterministic policies: An application to starcraft micromanagement tasks
We consider scenarios from the real-time strategy game StarCraft as new benchmarks for
reinforcement learning algorithms. We propose micromanagement tasks, which present the …
reinforcement learning algorithms. We propose micromanagement tasks, which present the …
The combinatorial multi-armed bandit problem and its application to real-time strategy games
S Ontanón - Proceedings of the AAAI Conference on Artificial …, 2013 - ojs.aaai.org
Game tree search in games with large branching factors is a notoriously hard problem. In
this paper, we address this problem with a new sampling strategy for Monte Carlo Tree …
this paper, we address this problem with a new sampling strategy for Monte Carlo Tree …
A monte-carlo aixi approximation
This paper introduces a principled approach for the design of a scalable general
reinforcement learning agent. Our approach is based on a direct approximation of AIXI, a …
reinforcement learning agent. Our approach is based on a direct approximation of AIXI, a …
Making friends on the fly: Cooperating with new teammates
Robots are being deployed in an increasing variety of environments for longer periods of
time. As the number of robots grows, they will increasingly need to interact with other robots …
time. As the number of robots grows, they will increasingly need to interact with other robots …
UCT for tactical assault battles in real-time strategy games
RK Balla - 2009 - ir.library.oregonstate.edu
We consider the problem of tactical assault planning in real-time strategy games where a
team of friendly agents must launch an assault on an enemy. This problem offers many …
team of friendly agents must launch an assault on an enemy. This problem offers many …
[HTML][HTML] POMDP-based control of workflows for crowdsourcing
Crowdsourcing, outsourcing of tasks to a crowd of unknown people (“workers”) in an open
call, is rapidly rising in popularity. It is already being heavily used by numerous employers …
call, is rapidly rising in popularity. It is already being heavily used by numerous employers …