BI-DIRECTIONAL MONTE CARLO TREE SEARCH
This paper describes a new algorithm called Bi-Directional Monte Carlo Tree Search. The essential idea of Bi-directional Monte Carlo Tree Search is to run an MCTS forwards from the start state, and simultaneously run an MCTS backwards from the goal state, and stop when the two searches meet. Bi-Dire...
Main Author: | |
---|---|
Format: | Article |
Language: | English |
Published: |
UKM Press
2021-06-01
|
Series: | Asia-Pacific Journal of Information Technology and Multimedia |
Subjects: | |
Online Access: | https://www.ukm.my/apjitm/view.php?id=199 |
id |
doaj-8743658040d14ce6be66fc39fbe36e1d |
---|---|
record_format |
Article |
spelling |
doaj-8743658040d14ce6be66fc39fbe36e1d2021-06-10T13:24:27ZengUKM PressAsia-Pacific Journal of Information Technology and Multimedia2289-21922021-06-0110011726https://doi.org/10.17576/apjitm-2021-1001-02BI-DIRECTIONAL MONTE CARLO TREE SEARCHKristian SpoererThis paper describes a new algorithm called Bi-Directional Monte Carlo Tree Search. The essential idea of Bi-directional Monte Carlo Tree Search is to run an MCTS forwards from the start state, and simultaneously run an MCTS backwards from the goal state, and stop when the two searches meet. Bi-Directional MCTS is tested on 8-Puzzle and Pancakes Problem, two single-agent search problems, which allow control over the optimal solution length d and average branching factor b respectively. Preliminary results indicate that enhancing Monte Carlo Tree Search by making it Bi-Directional speeds up the search. The speedup of Bi-directional MCTS grows with increasing the problem size, in terms of both optimal solution length d and also branching factor b. Furthermore, Bi-Directional Search has been applied to a Reinforcement Learning algorithm. It is hoped that the speed enhancement of Bi-directional Monte Carlo Tree Search will also apply to other planning problems.https://www.ukm.my/apjitm/view.php?id=199single agent searchmonte carlo tree searchbi-directional search8-puzzlepancakes problemreinforcement learning. |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Kristian Spoerer |
spellingShingle |
Kristian Spoerer BI-DIRECTIONAL MONTE CARLO TREE SEARCH Asia-Pacific Journal of Information Technology and Multimedia single agent search monte carlo tree search bi-directional search 8-puzzle pancakes problem reinforcement learning. |
author_facet |
Kristian Spoerer |
author_sort |
Kristian Spoerer |
title |
BI-DIRECTIONAL MONTE CARLO TREE SEARCH |
title_short |
BI-DIRECTIONAL MONTE CARLO TREE SEARCH |
title_full |
BI-DIRECTIONAL MONTE CARLO TREE SEARCH |
title_fullStr |
BI-DIRECTIONAL MONTE CARLO TREE SEARCH |
title_full_unstemmed |
BI-DIRECTIONAL MONTE CARLO TREE SEARCH |
title_sort |
bi-directional monte carlo tree search |
publisher |
UKM Press |
series |
Asia-Pacific Journal of Information Technology and Multimedia |
issn |
2289-2192 |
publishDate |
2021-06-01 |
description |
This paper describes a new algorithm called Bi-Directional Monte Carlo Tree Search. The essential idea of Bi-directional Monte Carlo Tree Search is to run an MCTS forwards from the start state, and simultaneously run an MCTS backwards from the goal state, and stop when the two searches meet. Bi-Directional MCTS is tested on 8-Puzzle and Pancakes Problem, two single-agent search problems, which allow control over the optimal solution length d and average branching factor b respectively. Preliminary results indicate that enhancing Monte Carlo Tree Search by making it Bi-Directional speeds up the search. The speedup of Bi-directional MCTS grows with increasing the problem size, in terms of both optimal solution length d and also branching factor b. Furthermore, Bi-Directional Search has been applied to a Reinforcement Learning algorithm. It is hoped that the speed enhancement of Bi-directional Monte Carlo Tree Search will also apply to other planning problems. |
topic |
single agent search monte carlo tree search bi-directional search 8-puzzle pancakes problem reinforcement learning. |
url |
https://www.ukm.my/apjitm/view.php?id=199 |
work_keys_str_mv |
AT kristianspoerer bidirectionalmontecarlotreesearch |
_version_ |
1721384823097917440 |