Exploring the Effect of Different Numbers of Convolutional Filters and Training Loops on the Performance of AlphaZero

Exploring the Effect of Different Numbers of Convolutional Filters and Training Loops on the Performance of AlphaZero

In this work, the algorithm used by AlphaZero is adapted for dots and boxes, a two-player game. This algorithm is explored using different numbers of convolutional filters and training loops, in order to better understand the effect these parameters have on the learning of the player. Different boar...

Full description

Bibliographic Details
Main Author:	Prince, Jared
Format:	Others
Published:	TopSCHOLAR® 2018
Subjects:	Monte Carlo tree search neural network dots and boxes Other Computer Sciences Robotics Theory and Algorithms
Online Access:	https://digitalcommons.wku.edu/theses/3087 https://digitalcommons.wku.edu/cgi/viewcontent.cgi?article=4090&context=theses

Similar Items

Improving Monte Carlo Tree Search with Artificial Neural Networks without Heuristics
by: Alba Cotarelo, et al.
Published: (2021-02-01)

AlphaZero with Input Convex Neural Networks
by: Zhang, Shuyuan
Published: (2020)

Playing the Game of Risk with an AlphaZero Agent
by: Blomqvist, Erik
Published: (2020)

A Software Framework for AlphaZero-Like Applications
by: Li, Wei, et al.
Published: (2018)

PARALLEL MACHINE SCHEDULING WITH MONTE CARLO TREE SEARCH
by: Anita Agárdi, et al.
Published: (2021-04-01)

Deep Reinforcement LearningA case study of AlphaZero
by: Mattisson, Fredrik
Published: (2021)

Mastering the Game of Gomoku without Human Knowledge
by: Wang, Yuan
Published: (2018)

An improved relief feature selection algorithm based on Monte-Carlo tree search
by: Jianyang Zheng, et al.
Published: (2019-01-01)

An Entropy-Guided Monte Carlo Tree Search Approach for Generating Optimal Container Loading Layouts
by: Richard Cant, et al.
Published: (2018-11-01)

Information Distribution in Multi-Robot Systems: Generic, Utility-Aware Optimization Middleware
by: Michał Barciś, et al.
Published: (2021-07-01)

Learning to Play the Chess Variant Crazyhouse Above World Champion Level With Deep Neural Networks and Human Data
by: Johannes Czech, et al.
Published: (2020-04-01)

Efficient Searching With MCTS and Imitation Learning: A Case Study in Pommerman
by: Hailan Yang, et al.
Published: (2021-01-01)

A Student Attendance Management Method Based on Crowdsensing in Classroom Environment
by: Zhigang Gao, et al.
Published: (2021-01-01)

AiZynthFinder: a fast, robust and flexible open-source software for retrosynthetic planning
by: Samuel Genheden, et al.
Published: (2020-11-01)

RNA inverse folding using Monte Carlo tree search
by: Xiufeng Yang, et al.
Published: (2017-11-01)

ChemTS: an efficient python library for de novo molecular generation
by: Xiufeng Yang, et al.
Published: (2017-12-01)

A Path Planning Method Based on Improved Single Player-Monte Carlo Tree Search
by: Yu-Wei Xia, et al.
Published: (2020-01-01)

BI-DIRECTIONAL MONTE CARLO TREE SEARCH
by: Kristian Spoerer
Published: (2021-06-01)

Solving Games and All That
by: Saffidine, Abdallah
Published: (2013)

Reinforcement Learning-Based Motion Planning for Automatic Parking System
by: Jiren Zhang, et al.
Published: (2020-01-01)

Domain independent enhancements to Monte Carlo tree search for eurogames
by: Bergh, Peter
Published: (2020)

MONTE CARLO BENZETİMİNİN BİR KARAR PROBLEMİNE UYGULANMASI
by: Çiğdem ALABAŞ, et al.
Published: (2001-01-01)

MONTE CARLO BENZETİMİNİN BİR KARAR PROBLEMİNE UYGULANMASI
by: Çiğdem ALABAŞ, et al.
Published: (2001-01-01)

MONTE CARLO BENZETİMİNİN BİR KARAR PROBLEMİNE UYGULANMASI
by: Çiğdem ALABAŞ, et al.
Published: (2001-01-01)

MONTE CARLO BENZETİMİNİN BİR KARAR PROBLEMİNE UYGULANMASI
by: Çiğdem Alabaş, et al.
Published: (2001-01-01)

AlphaZero to Alpha Hero : A pre-study on Additional Tree Sampling within Self-Play Reinforcement Learning
by: Carlsson, Fredrik, et al.
Published: (2019)

Introduction of statistics in optimization
by: Teytaud, Fabien
Published: (2011)

Adversarial Game Playing Using Monte Carlo Tree Search
by: Sista, Subrahmanya Srivathsava
Published: (2016)

A Reinforcement Learning Scheme for Active Multi-Debris Removal Mission Planning With Modified Upper Confidence Bound Tree Search
by: Jianan Yang, et al.
Published: (2020-01-01)

HotSpot: Anomaly Localization for Additive KPIs With Multi-Dimensional Attributes
by: Yongqian Sun, et al.
Published: (2018-01-01)

Exploration of the Use of the Kinetic Monte Carlo Method in Simulation of Quantum Dot Growth
by: Ramsey, James J.
Published: (2011)

KINETIC MONTE CARLO SIMULATION OF BINARY ALLOYS
by: Marshall, Timothy Craig
Published: (2018)

RESOURCE CONSTRAINT COOPERATIVE GAME WITH MONTE CARLO TREE SEARCH
by: Cheng, Chee Chian
Published: (2016)

Monte Carlo Tree Search-Based Recursive Algorithm for Feature Selection in High-Dimensional Datasets
by: Muhammad Umar Chaudhry, et al.
Published: (2020-09-01)

MDTS: automatic complex materials design using Monte Carlo tree search
by: Thaer M. Dieb, et al.
Published: (2017-12-01)

Hybrid Policy Learning for Multi-Agent Pathfinding
by: Alexey Skrynnik, et al.
Published: (2021-01-01)

Feature Selection for High Dimensional Data Using Monte Carlo Tree Search
by: Muhammad Umar Chaudhry, et al.
Published: (2018-01-01)

Parallel Go on CUDA with Monte Carlo Tree Search
by: Zhou, Jun
Published: (2013)

Introduction of statistics in optimization
by: Teytaud, Fabien
Published: (2011)

Possible Impacts of Climate Change on Potential Tree Plant Forms of a Mountain Region in Central Taiwan
by: Biing-T. Guan, et al.
Published: (2003-12-01)