BI-DIRECTIONAL MONTE CARLO TREE SEARCH

This paper describes a new algorithm called Bi-Directional Monte Carlo Tree Search. The essential idea of Bi-directional Monte Carlo Tree Search is to run an MCTS forwards from the start state, and simultaneously run an MCTS backwards from the goal state, and stop when the two searches meet. Bi-Dire...

Full description

Bibliographic Details
Main Author: Kristian Spoerer
Format: Article
Language:English
Published: UKM Press 2021-06-01
Series:Asia-Pacific Journal of Information Technology and Multimedia
Subjects:
Online Access:https://www.ukm.my/apjitm/view.php?id=199
id doaj-8743658040d14ce6be66fc39fbe36e1d
record_format Article
spelling doaj-8743658040d14ce6be66fc39fbe36e1d2021-06-10T13:24:27ZengUKM PressAsia-Pacific Journal of Information Technology and Multimedia2289-21922021-06-0110011726https://doi.org/10.17576/apjitm-2021-1001-02BI-DIRECTIONAL MONTE CARLO TREE SEARCHKristian SpoererThis paper describes a new algorithm called Bi-Directional Monte Carlo Tree Search. The essential idea of Bi-directional Monte Carlo Tree Search is to run an MCTS forwards from the start state, and simultaneously run an MCTS backwards from the goal state, and stop when the two searches meet. Bi-Directional MCTS is tested on 8-Puzzle and Pancakes Problem, two single-agent search problems, which allow control over the optimal solution length d and average branching factor b respectively. Preliminary results indicate that enhancing Monte Carlo Tree Search by making it Bi-Directional speeds up the search. The speedup of Bi-directional MCTS grows with increasing the problem size, in terms of both optimal solution length d and also branching factor b. Furthermore, Bi-Directional Search has been applied to a Reinforcement Learning algorithm. It is hoped that the speed enhancement of Bi-directional Monte Carlo Tree Search will also apply to other planning problems.https://www.ukm.my/apjitm/view.php?id=199single agent searchmonte carlo tree searchbi-directional search8-puzzlepancakes problemreinforcement learning.
collection DOAJ
language English
format Article
sources DOAJ
author Kristian Spoerer
spellingShingle Kristian Spoerer
BI-DIRECTIONAL MONTE CARLO TREE SEARCH
Asia-Pacific Journal of Information Technology and Multimedia
single agent search
monte carlo tree search
bi-directional search
8-puzzle
pancakes problem
reinforcement learning.
author_facet Kristian Spoerer
author_sort Kristian Spoerer
title BI-DIRECTIONAL MONTE CARLO TREE SEARCH
title_short BI-DIRECTIONAL MONTE CARLO TREE SEARCH
title_full BI-DIRECTIONAL MONTE CARLO TREE SEARCH
title_fullStr BI-DIRECTIONAL MONTE CARLO TREE SEARCH
title_full_unstemmed BI-DIRECTIONAL MONTE CARLO TREE SEARCH
title_sort bi-directional monte carlo tree search
publisher UKM Press
series Asia-Pacific Journal of Information Technology and Multimedia
issn 2289-2192
publishDate 2021-06-01
description This paper describes a new algorithm called Bi-Directional Monte Carlo Tree Search. The essential idea of Bi-directional Monte Carlo Tree Search is to run an MCTS forwards from the start state, and simultaneously run an MCTS backwards from the goal state, and stop when the two searches meet. Bi-Directional MCTS is tested on 8-Puzzle and Pancakes Problem, two single-agent search problems, which allow control over the optimal solution length d and average branching factor b respectively. Preliminary results indicate that enhancing Monte Carlo Tree Search by making it Bi-Directional speeds up the search. The speedup of Bi-directional MCTS grows with increasing the problem size, in terms of both optimal solution length d and also branching factor b. Furthermore, Bi-Directional Search has been applied to a Reinforcement Learning algorithm. It is hoped that the speed enhancement of Bi-directional Monte Carlo Tree Search will also apply to other planning problems.
topic single agent search
monte carlo tree search
bi-directional search
8-puzzle
pancakes problem
reinforcement learning.
url https://www.ukm.my/apjitm/view.php?id=199
work_keys_str_mv AT kristianspoerer bidirectionalmontecarlotreesearch
_version_ 1721384823097917440