情報処理学会研究報告 IPSJ SIG Technical Report Vol.2010-GI-24 No /6/25 UCT UCT UCT UCB A new UCT search method using position evaluation function an
|
|
- Arlene Laureen Bridges
- 5 years ago
- Views:
Transcription
1 UCT UCT UCT UCB A new UCT search method using position evaluation function and its evaluation by Othello Shota Maehara, 1 Tsuyoshi Hashimoto 2 and Yasuyuki Kobayashi 1 The Monte Carlo tree search, particularly UCT, gains a great success and is being widely studied as a game tree search method. However UCT using position evaluation function has been studied because of the difficulty of calculating position evaluation function the game of GO, the main target of UCT research. We propose a new method that adds position evaluation function to the UCB value in this paper. It is implemented for the game of Othello that is relatively easy to make position evaluation function and experiments are performed. The results show the overwhelming ability of proposed method and its effectiveness is verified ) UCT 2) UCT 1 Graduate School of Science and Engineering, University of Shimane 2 Department of Information Engineering, College of Technology of Matsue 1 M.Buro Logistello 1997 UCB 3) 4)5) 1)2)6) UCT Amazons 7)8) LOA 9) 2 Nested Monte-Carlo Morpion Solitaire 10)11) 1 c 2010 Information
2 UCT UCB UCT 2. UCB UCB Upper Confidence Bound 3) UCB UCB i X i n c X i + c (1) UCB UCB1 3) (1) UCB UCB UCT(UCB applied to Trees) 2) UCT UCB )13) Amazons 8) UCT 13) MINMAX 7) MINMAX Amazons 4)5) UCB 4. UCT UCB UCB UCB UCB UCB UCT+ i i E i E i (1) UCB (X i + E i) + c (2) UCT+ (2) UCT+ (1) UCT 2 c 2010 Information
3 Sum1 Sum2 (2) 2 Score Sum1 Sum i Sum1 i Sum2 i Score i UCT+ (3) E = (X i + Sum1i Sum2i, c = Sum1i Sum2i ) (3) (2) E i c UCT+ 1 15) 14) Sum1 Sum2 Score (+1) ( 1) (0) 1 A 1 Sum1 A Sum2 Score Sum1 Sum2 Score Score (2) X i Sum1 Sum2 E i 5.2 UCT scrzebra 1 S S=40 40 S=50 (g,1),-10, (h,1),14, (h,2),-10, (h,3),0 (h,7),2, 2 5 (h,1) (h,7) c 2010 Information
4 Table 3 3 (%) Right answer percentage by simple method (Playout=2000) % UCT (%) Table 1 Right answer percentage by UCT. Playout= Playout= UCT+ (%) Table 2 Right answer percentage by UCT+. Playout= Playout= ( 0.5) OS Debian Linux5.0.3 CPU Pentium4 3.2GHz 512MB C UCB UCT UCT+ c UCT UCT 1 3 UCT UCT+ UCB UCT Amazons 2 UCB 8. UCB Logistello 1) Coulom, R.: Efficient selectivity and backup operators in monte-carlo tree search, Proceedings of the 5th International Conference on Computers and Games, Turin, Italy (2006). 2) Kocsis L. and Szepesvari, C.: Bandit based Monte-Carlo Planning Proceedings of the 15th European Conference on Machine Learning pp (2006). 3) Auer, P., Cesa-Bianchi, N. and Fischer, P.: Finite time Analysis of the Multi-armed Bandit Problem, Machine Learning, Vol. 47, pp c 2010 Information
5 256 (2002). 4),, :, Proceedings of The 11th Game Programming Workshop pp (2006). 5), :, 13 pp.1 8 (2008). 6) Gelly, S., Wang, Y., Munos, R. and Teytaud, O.: Modifications of UCT with Patterns in Monte-Carlo Go, Technical Report RR-6062, INRIA (2006). 7) Lorentz, R.: Amazons Discover Monte Carlo Computers and Games, Lecture Notes in Computer Science, Vol. 5131, pp.13 24, (2008). 8) Julien Kloetzer, Hiroyuki Iida and Bruno Bouzy: Playing Amazons Endgames, ICGA Journal, To be appear, 9) Winands, M.H.M. and Bjornsson, Y. (2010): Evaluation Function Based Monte-Carlo LOA, In Advances in Computer Games (ACG 2009), Lecture Notes in Computer Science (LNCS 6048), pp c Springer, Berlin Heidelberg. 10) Tristan Cazenave: Nested Monte-Carlo Search, IJCAI2009, pp (2009). 11) Nested Monte-Carlo AMAF, Vol.2010, No.7, 2009-GI-23, pp.1 7 (2010). 12) Coulom, R.: Computing Elo Ratings of Move Patterns in the Game of Go, In Computer Game Workshop, Amsterdam, The Netherlands (2007). 13) :, Vol.2009, No.27, 2009-GI-21, pp (2009). 14) : 15) :, 17 (2005). 5 c 2010 Information
A Bandit Approach for Tree Search
A An Example in Computer-Go Department of Statistics, University of Michigan March 27th, 2008 A 1 Bandit Problem K-Armed Bandit UCB Algorithms for K-Armed Bandit Problem 2 Classical Tree Search UCT Algorithm
More informationA Parallel Monte-Carlo Tree Search Algorithm
A Parallel Monte-Carlo Tree Search Algorithm Tristan Cazenave and Nicolas Jouandeau LIASD, Université Paris 8, 93526, Saint-Denis, France cazenave@ai.univ-paris8.fr n@ai.univ-paris8.fr Abstract. Monte-Carlo
More informationAvailable online at ScienceDirect. Procedia Computer Science 62 (2015 ) 31 38
Available online at www.sciencedirect.com ScienceDirect Procedia Computer Science 62 (2015 ) 31 38 The 2015 International Conference on Soft Computing and Software Engineering (SCSE 2015) Analysis of a
More informationScore Bounded Monte-Carlo Tree Search
Score Bounded Monte-Carlo Tree Search Tristan Cazenave and Abdallah Saffidine LAMSADE Université Paris-Dauphine Paris, France cazenave@lamsade.dauphine.fr Abdallah.Saffidine@gmail.com Abstract. Monte-Carlo
More informationEarly Playout Termination in MCTS
Early Playout Termination in MCTS Richard Lorentz (B) Department of Computer Science, California State University, Northridge, CA 91330-8281, USA lorentz@csun.edu Abstract. Many researchers view mini-max
More informationA Study of UCT and its Enhancements in an Artificial Game
A Study of UCT and its Enhancements in an Artificial Game David Tom and Martin Müller Department of Computing Science, University of Alberta, Edmonton, Canada, T6G 2E8 {dtom, mmueller}@cs.ualberta.ca Abstract.
More informationCombining Final Score with Winning Percentage by Sigmoid Function in Monte-Carlo Simulations
Combining Final Score with Winning Percentage by Sigmoid Function in Monte-Carlo Simulations Kazutomo SHIBAHARA Yoshiyuki KOTANI Abstract Monte-Carlo method recently has produced good results in Go. Monte-Carlo
More informationExploration exploitation in Go: UCT for Monte-Carlo Go
Exploration exploitation in Go: UCT for Monte-Carlo Go Sylvain Gelly(*) and Yizao Wang(*,**) (*)TAO (INRIA), LRI, UMR (CNRS - Univ. Paris-Sud) University of Paris-Sud, Orsay, France sylvain.gelly@lri.fr
More informationMonte-Carlo Tree Search Enhancements for Havannah
Monte-Carlo Tree Search Enhancements for Havannah Jan A. Stankiewicz, Mark H.M. Winands, and Jos W.H.M. Uiterwijk Department of Knowledge Engineering, Maastricht University j.stankiewicz@student.maastrichtuniversity.nl,
More informationMonte-Carlo Tree Search for the Simultaneous Move Game Tron
Monte-Carlo Tree Search for the Simultaneous Move Game Tron N.G.P. Den Teuling June 27, 2011 Abstract Monte-Carlo Tree Search (MCTS) has been successfully applied to many games, particularly in Go. In
More informationBuilding Opening Books for 9 9 Go Without Relying on Human Go Expertise
Journal of Computer Science 8 (10): 1594-1600, 2012 ISSN 1549-3636 2012 Science Publications Building Opening Books for 9 9 Go Without Relying on Human Go Expertise 1 Keh-Hsun Chen and 2 Peigang Zhang
More informationPlayout Search for Monte-Carlo Tree Search in Multi-Player Games
Playout Search for Monte-Carlo Tree Search in Multi-Player Games J. (Pim) A.M. Nijssen and Mark H.M. Winands Games and AI Group, Department of Knowledge Engineering, Faculty of Humanities and Sciences,
More informationGeneralized Rapid Action Value Estimation
Proceedings of the Twenty-Fourth International Joint Conference on Artificial Intelligence (IJCAI 2015) Generalized Rapid Action Value Estimation Tristan Cazenave LAMSADE - Universite Paris-Dauphine Paris,
More informationMonte-Carlo Tree Search and Minimax Hybrids
Monte-Carlo Tree Search and Minimax Hybrids Hendrik Baier and Mark H.M. Winands Games and AI Group, Department of Knowledge Engineering Faculty of Humanities and Sciences, Maastricht University Maastricht,
More informationMonte-Carlo Tree Search and Minimax Hybrids with Heuristic Evaluation Functions
Monte-Carlo Tree Search and Minimax Hybrids with Heuristic Evaluation Functions Hendrik Baier and Mark H.M. Winands Games and AI Group, Department of Knowledge Engineering Faculty of Humanities and Sciences,
More informationAdding expert knowledge and exploration in Monte-Carlo Tree Search
Adding expert knowledge and exploration in Monte-Carlo Tree Search Guillaume Chaslot, Christophe Fiter, Jean-Baptiste Hoock, Arpad Rimmel, Olivier Teytaud To cite this version: Guillaume Chaslot, Christophe
More informationApplication of UCT Search to the Connection Games of Hex, Y, *Star, and Renkula!
Application of UCT Search to the Connection Games of Hex, Y, *Star, and Renkula! Tapani Raiko and Jaakko Peltonen Helsinki University of Technology, Adaptive Informatics Research Centre, P.O. Box 5400,
More informationComparing UCT versus CFR in Simultaneous Games
Comparing UCT versus CFR in Simultaneous Games Mohammad Shafiei Nathan Sturtevant Jonathan Schaeffer Computing Science Department University of Alberta {shafieik,nathanst,jonathan}@cs.ualberta.ca Abstract
More informationαβ-based Play-outs in Monte-Carlo Tree Search
αβ-based Play-outs in Monte-Carlo Tree Search Mark H.M. Winands Yngvi Björnsson Abstract Monte-Carlo Tree Search (MCTS) is a recent paradigm for game-tree search, which gradually builds a gametree in a
More informationOn the Huge Benefit of Decisive Moves in Monte-Carlo Tree Search Algorithms
On the Huge Benefit of Decisive Moves in Monte-Carlo Tree Search Algorithms Fabien Teytaud, Olivier Teytaud To cite this version: Fabien Teytaud, Olivier Teytaud. On the Huge Benefit of Decisive Moves
More informationNested Monte-Carlo Search
Nested Monte-Carlo Search Tristan Cazenave LAMSADE Université Paris-Dauphine Paris, France cazenave@lamsade.dauphine.fr Abstract Many problems have a huge state space and no good heuristic to order moves
More informationRevisiting Monte-Carlo Tree Search on a Normal Form Game: NoGo
Revisiting Monte-Carlo Tree Search on a Normal Form Game: NoGo C.-W. Chou, Olivier Teytaud, Shi-Jim Yen To cite this version: C.-W. Chou, Olivier Teytaud, Shi-Jim Yen. Revisiting Monte-Carlo Tree Search
More informationGame-Tree Properties and MCTS Performance
Game-Tree Properties and MCTS Performance Hilmar Finnsson and Yngvi Björnsson School of Computer Science Reykjavík University, Iceland {hif,yngvi}@ru.is Abstract In recent years Monte-Carlo Tree Search
More informationGO for IT. Guillaume Chaslot. Mark Winands
GO for IT Guillaume Chaslot Jaap van den Herik Mark Winands (UM) (UvT / Big Grid) (UM) Partnership for Advanced Computing in EUROPE Amsterdam, NH Hotel, Industrial Competitiveness: Europe goes HPC Krasnapolsky,
More informationEnhancements for Monte-Carlo Tree Search in Ms Pac-Man
Enhancements for Monte-Carlo Tree Search in Ms Pac-Man Tom Pepels Mark H.M. Winands Abstract In this paper enhancements for the Monte-Carlo Tree Search (MCTS) framework are investigated to play Ms Pac-Man.
More informationUCD : Upper Confidence bound for rooted Directed acyclic graphs
UCD : Upper Confidence bound for rooted Directed acyclic graphs Abdallah Saffidine a, Tristan Cazenave a, Jean Méhat b a LAMSADE Université Paris-Dauphine Paris, France b LIASD Université Paris 8 Saint-Denis
More informationComparison of Monte Carlo Tree Search Methods in the Imperfect Information Card Game Cribbage
Comparison of Monte Carlo Tree Search Methods in the Imperfect Information Card Game Cribbage Richard Kelly and David Churchill Computer Science Faculty of Science Memorial University {richard.kelly, dchurchill}@mun.ca
More informationA Comparative Study of Solvers in Amazons Endgames
A Comparative Study of Solvers in Amazons Endgames Julien Kloetzer, Hiroyuki Iida, and Bruno Bouzy Abstract The game of Amazons is a fairly young member of the class of territory-games. The best Amazons
More informationEnhancements for Monte-Carlo Tree Search in Ms Pac-Man
Enhancements for Monte-Carlo Tree Search in Ms Pac-Man Tom Pepels June 19, 2012 Abstract In this paper enhancements for the Monte-Carlo Tree Search (MCTS) framework are investigated to play Ms Pac-Man.
More informationON THE TACTICAL AND STRATEGIC BEHAVIOUR OF MCTS WHEN BIASING RANDOM SIMULATIONS
On the tactical and strategic behaviour of MCTS when biasing random simulations 67 ON THE TACTICAL AND STATEGIC BEHAVIOU OF MCTS WHEN BIASING ANDOM SIMULATIONS Fabien Teytaud 1 Julien Dehos 2 Université
More informationPruning playouts in Monte-Carlo Tree Search for the game of Havannah
Pruning playouts in Monte-Carlo Tree Search for the game of Havannah Joris Duguépéroux, Ahmad Mazyad, Fabien Teytaud, Julien Dehos To cite this version: Joris Duguépéroux, Ahmad Mazyad, Fabien Teytaud,
More information43.1 Introduction. Foundations of Artificial Intelligence Introduction Monte-Carlo Methods Monte-Carlo Tree Search. 43.
May 6, 20 3. : Introduction 3. : Introduction Malte Helmert University of Basel May 6, 20 3. Introduction 3.2 3.3 3. Summary May 6, 20 / 27 May 6, 20 2 / 27 Board Games: Overview 3. : Introduction Introduction
More informationCreating a Havannah Playing Agent
Creating a Havannah Playing Agent B. Joosten August 27, 2009 Abstract This paper delves into the complexities of Havannah, which is a 2-person zero-sum perfectinformation board game. After determining
More informationTree Parallelization of Ary on a Cluster
Tree Parallelization of Ary on a Cluster Jean Méhat LIASD, Université Paris 8, Saint-Denis France, jm@ai.univ-paris8.fr Tristan Cazenave LAMSADE, Université Paris-Dauphine, Paris France, cazenave@lamsade.dauphine.fr
More informationThe Combinatorial Multi-Armed Bandit Problem and Its Application to Real-Time Strategy Games
Proceedings of the Ninth AAAI Conference on Artificial Intelligence and Interactive Digital Entertainment The Combinatorial Multi-Armed Bandit Problem and Its Application to Real-Time Strategy Games Santiago
More informationSimulation-Based Approach to General Game Playing
Simulation-Based Approach to General Game Playing Hilmar Finnsson and Yngvi Björnsson School of Computer Science Reykjavík University, Iceland {hif,yngvi}@ru.is Abstract The aim of General Game Playing
More informationAdversarial Reasoning: Sampling-Based Search with the UCT algorithm. Joint work with Raghuram Ramanujan and Ashish Sabharwal
Adversarial Reasoning: Sampling-Based Search with the UCT algorithm Joint work with Raghuram Ramanujan and Ashish Sabharwal Upper Confidence bounds for Trees (UCT) n The UCT algorithm (Kocsis and Szepesvari,
More informationProbability of Potential Model Pruning in Monte-Carlo Go
Available online at www.sciencedirect.com Procedia Computer Science 6 (211) 237 242 Complex Adaptive Systems, Volume 1 Cihan H. Dagli, Editor in Chief Conference Organized by Missouri University of Science
More informationBandit Algorithms Continued: UCB1
Bandit Algorithms Continued: UCB1 Noel Welsh 09 November 2010 Noel Welsh () Bandit Algorithms Continued: UCB1 09 November 2010 1 / 18 Annoucements Lab is busy Wednesday afternoon from 13:00 to 15:00 (Some)
More informationSufficiency-Based Selection Strategy for MCTS
Proceedings of the Twenty-Third International Joint Conference on Artificial Intelligence Sufficiency-Based Selection Strategy for MCTS Stefan Freyr Gudmundsson and Yngvi Björnsson School of Computer Science
More informationMonte Carlo Tree Search in a Modern Board Game Framework
Monte Carlo Tree Search in a Modern Board Game Framework G.J.B. Roelofs Januari 25, 2012 Abstract This article describes the abstraction required for a framework capable of playing multiple complex modern
More informationVirtual Global Search: Application to 9x9 Go
Virtual Global Search: Application to 9x9 Go Tristan Cazenave LIASD Dept. Informatique Université Paris 8, 93526, Saint-Denis, France cazenave@ai.univ-paris8.fr Abstract. Monte-Carlo simulations can be
More informationImproving Monte Carlo Tree Search Policies in StarCraft via Probabilistic Models Learned from Replay Data
Proceedings, The Twelfth AAAI Conference on Artificial Intelligence and Interactive Digital Entertainment (AIIDE-16) Improving Monte Carlo Tree Search Policies in StarCraft via Probabilistic Models Learned
More informationRecent Progress in Computer Go. Martin Müller University of Alberta Edmonton, Canada
Recent Progress in Computer Go Martin Müller University of Alberta Edmonton, Canada 40 Years of Computer Go 1960 s: initial ideas 1970 s: first serious program - Reitman & Wilcox 1980 s: first PC programs,
More informationFuego An Open-source Framework for Board Games and Go Engine Based on Monte-Carlo Tree Search
Fuego An Open-source Framework for Board Games and Go Engine Based on Monte-Carlo Tree Search Markus Enzenberger Martin Müller May 1, 2009 Abstract Fuego is an open-source software framework for developing
More informationUpper Confidence Trees with Short Term Partial Information
Author manuscript, published in "EvoGames 2011 6624 (2011) 153-162" DOI : 10.1007/978-3-642-20525-5 Upper Confidence Trees with Short Term Partial Information Olivier Teytaud 1 and Sébastien Flory 2 1
More informationAdversarial Search (I)
Adversarial Search (I) Instructor: Tsung-Che Chiang tcchiang@ieee.org Department of Computer Science and Information Engineering National Taiwan Normal University Artificial Intelligence, Spring, 2010
More informationAdversarial Search (I)
Adversarial Search (I) Instructor: Tsung-Che Chiang tcchiang@ieee.org Department of Computer Science and Information Engineering National Taiwan Normal University Artificial Intelligence, Spring, 2010
More informationOld-fashioned Computer Go vs Monte-Carlo Go
Old-fashioned Computer Go vs Monte-Carlo Go Bruno Bouzy Paris Descartes University, France CIG07 Tutorial April 1 st 2007 Honolulu, Hawaii 1 Outline Computer Go (CG) overview Rules of the game History
More informationMONTE-CARLO TWIXT. Janik Steinhauer. Master Thesis 10-08
MONTE-CARLO TWIXT Janik Steinhauer Master Thesis 10-08 Thesis submitted in partial fulfilment of the requirements for the degree of Master of Science of Artificial Intelligence at the Faculty of Humanities
More informationMonte Carlo Tree Search and AlphaGo. Suraj Nair, Peter Kundzicz, Kevin An, Vansh Kumar
Monte Carlo Tree Search and AlphaGo Suraj Nair, Peter Kundzicz, Kevin An, Vansh Kumar Zero-Sum Games and AI A player s utility gain or loss is exactly balanced by the combined gain or loss of opponents:
More informationMonte Carlo Tree Search. Simon M. Lucas
Monte Carlo Tree Search Simon M. Lucas Outline MCTS: The Excitement! A tutorial: how it works Important heuristics: RAVE / AMAF Applications to video games and real-time control The Excitement Game playing
More informationTowards Human-Competitive Game Playing for Complex Board Games with Genetic Programming
Towards Human-Competitive Game Playing for Complex Board Games with Genetic Programming Denis Robilliard, Cyril Fonlupt To cite this version: Denis Robilliard, Cyril Fonlupt. Towards Human-Competitive
More informationCurrent Frontiers in Computer Go
Current Frontiers in Computer Go Arpad Rimmel, Olivier Teytaud, Chang-Shing Lee, Shi-Jim Yen, Mei-Hui Wang, Shang-Rong Tsai To cite this version: Arpad Rimmel, Olivier Teytaud, Chang-Shing Lee, Shi-Jim
More informationThesis : Improvements and Evaluation of the Monte-Carlo Tree Search Algorithm. Arpad Rimmel
Thesis : Improvements and Evaluation of the Monte-Carlo Tree Search Algorithm Arpad Rimmel 15/12/2009 ii Contents Acknowledgements Citation ii ii 1 Introduction 1 1.1 Motivations............................
More informationNOTE 6 6 LOA IS SOLVED
234 ICGA Journal December 2008 NOTE 6 6 LOA IS SOLVED Mark H.M. Winands 1 Maastricht, The Netherlands ABSTRACT Lines of Action (LOA) is a two-person zero-sum game with perfect information; it is a chess-like
More informationMonte Carlo Methods for the Game Kingdomino
Monte Carlo Methods for the Game Kingdomino Magnus Gedda, Mikael Z. Lagerkvist, and Martin Butler Tomologic AB Stockholm, Sweden Email: firstname.lastname@tomologic.com arxiv:187.4458v2 [cs.ai] 15 Jul
More informationNested Monte Carlo Search for Two-player Games
Nested Monte Carlo Search for Two-player Games Tristan Cazenave LAMSADE Université Paris-Dauphine cazenave@lamsade.dauphine.fr Abdallah Saffidine Michael Schofield Michael Thielscher School of Computer
More informationNeural Networks Learning the Concept of Influence in Go
Proceedings of the Twenty-Ninth International Florida Artificial Intelligence Research Society Conference Neural Networks Learning the Concept of Influence in Go Gabriel Machado Santos, Rita Maria Silva
More informationBy David Anderson SZTAKI (Budapest, Hungary) WPI D2009
By David Anderson SZTAKI (Budapest, Hungary) WPI D2009 1997, Deep Blue won against Kasparov Average workstation can defeat best Chess players Computer Chess no longer interesting Go is much harder for
More informationComputing Elo Ratings of Move Patterns in the Game of Go
Computing Elo Ratings of Move Patterns in the Game of Go Rémi Coulom To cite this veion: Rémi Coulom Computing Elo Ratings of Move Patterns in the Game of Go van den Herik, H Jaap and Mark Winands and
More informationThe Grand Challenge of Computer Go: Monte Carlo Tree Search and Extensions
The Grand Challenge of Computer Go: Monte Carlo Tree Search and Extensions Sylvain Gelly, Marc Schoenauer, Michèle Sebag, Olivier Teytaud, Levente Kocsis, David Silver, Csaba Szepesvari To cite this version:
More informationENHANCED REALIZATION PROBABILITY SEARCH
New Mathematics and Natural Computation c World Scientific Publishing Company ENHANCED REALIZATION PROBABILITY SEARCH MARK H.M. WINANDS MICC-IKAT Games and AI Group, Faculty of Humanities and Sciences
More informationHeuristic Move Pruning in Monte Carlo Tree Search for the Strategic Card Game Lords of War
Heuristic Move Pruning in Monte Carlo Tree Search for the Strategic Card Game Lords of War Nick Sephton, Peter I. Cowling, Edward Powley, and Nicholas H. Slaven York Centre for Complex Systems Analysis,
More informationMONTE CARLO TREE SEARCH (MCTS) is a method
IEEE TRANSACTIONS ON COMPUTATIONAL INTELLIGENCE AND AI IN GAMES, VOL. 4, NO. 1, MARCH 2012 1 A Survey of Monte Carlo Tree Search Methods Cameron B. Browne, Member, IEEE, Edward Powley, Member, IEEE, Daniel
More informationUsing Genetic Programming to Evolve Heuristics for a Monte Carlo Tree Search Ms Pac-Man Agent
Using Genetic Programming to Evolve Heuristics for a Monte Carlo Tree Search Ms Pac-Man Agent Atif M. Alhejali, Simon M. Lucas School of Computer Science and Electronic Engineering University of Essex
More informationOptimizing UCT for Settlers of Catan
Optimizing UCT for Settlers of Catan Gabriel Rubin Bruno Paz Felipe Meneguzzi Pontifical Catholic University of Rio Grande do Sul, Computer Science Department, Brazil A BSTRACT Settlers of Catan is one
More informationGame Algorithms Go and MCTS. Petr Baudiš, 2011
Game Algorithms Go and MCTS Petr Baudiš, 2011 Outline What is Go and why is it interesting Possible approaches to solving Go Monte Carlo and UCT Enhancing the MC simulations Enhancing the tree search Automatic
More informationImplementation of Upper Confidence Bounds for Trees (UCT) on Gomoku
Implementation of Upper Confidence Bounds for Trees (UCT) on Gomoku Guanlin Zhou (gz2250), Nan Yu (ny2263), Yanqing Dai (yd2369), Yingtao Zhong (yz3276) 1. Introduction: Reinforcement Learning for Gomoku
More informationMove Prediction in Go Modelling Feature Interactions Using Latent Factors
Move Prediction in Go Modelling Feature Interactions Using Latent Factors Martin Wistuba and Lars Schmidt-Thieme University of Hildesheim Information Systems & Machine Learning Lab {wistuba, schmidt-thieme}@ismll.de
More informationEvoMCTS: Enhancing MCTS-Based Players through Genetic Programming
EvoMCTS: Enhancing MCTS-Based Players through Genetic Programming Amit Benbassat Computer Science Department Ben-Gurion University of the Negev Beer-Sheva, Israel Email: amitbenb@cs.bgu.ac.il Moshe Sipper
More informationA Multi Armed Bandit Formulation of Cognitive Spectrum Access
000 001 002 003 004 005 006 007 008 009 010 011 012 013 014 015 016 017 018 019 020 021 022 023 024 025 026 027 028 029 030 031 032 033 034 035 036 037 038 039 040 041 042 043 044 045 046 047 048 049 050
More informationMore on games (Ch )
More on games (Ch. 5.4-5.6) Alpha-beta pruning Previously on CSci 4511... We talked about how to modify the minimax algorithm to prune only bad searches (i.e. alpha-beta pruning) This rule of checking
More informationEfficient Selectivity and Backup Operators in Monte-Carlo Tree Search
Efficient Selectivity and Backup Operators in Monte-Carlo Tree Search Rémi Coulom To cite this version: Rémi Coulom. Efficient Selectivity and Backup Operators in Monte-Carlo Tree Search. Paolo Ciancarini
More informationA Monte Carlo Approach for Football Play Generation
A Monte Carlo Approach for Football Play Generation Kennard Laviers School of EECS U. of Central Florida Orlando, FL klaviers@eecs.ucf.edu Gita Sukthankar School of EECS U. of Central Florida Orlando,
More informationUCT for Tactical Assault Planning in Real-Time Strategy Games
Proceedings of the Twenty-First International Joint Conference on Artificial Intelligence (IJCAI-09) UCT for Tactical Assault Planning in Real-Time Strategy Games Radha-Krishna Balla and Alan Fern School
More informationArtificial Intelligence for Games
Artificial Intelligence for Games Bruno Bouzy, Tristan Cazenave, Vincent Corruble, and Olivier Teytaud Abstract This chapter presents the classical alpha-beta algorithm and several variants, Monte Carlo
More informationTRIAL-BASED HEURISTIC TREE SEARCH FOR FINITE HORIZON MDPS. Thomas Keller and Malte Helmert Presented by: Ryan Berryhill
TRIAL-BASED HEURISTIC TREE SEARCH FOR FINITE HORIZON MDPS Thomas Keller and Malte Helmert Presented by: Ryan Berryhill Outline Motivation Background THTS framework THTS algorithms Results Motivation Advances
More informationThe Parameterized Poker Squares EAAI NSG Challenge
The Parameterized Poker Squares EAAI NSG Challenge What is the EAAI NSG Challenge? Goal: a fun way to encourage good, faculty-mentored undergraduate research experiences that includes an option for peer-reviewed
More informationEvolutionary MCTS for Multi-Action Adversarial Games
Evolutionary MCTS for Multi-Action Adversarial Games Hendrik Baier Digital Creativity Labs University of York York, UK hendrik.baier@york.ac.uk Peter I. Cowling Digital Creativity Labs University of York
More informationTheory of Computer Games
Theory of Computer Games Tsan-sheng Hsu tshsu@iis.sinica.edu.tw http://www.iis.sinica.edu.tw/~tshsu 1 Goal Course name: Theory of Computer Games Prerequisite: Computer Programming, and Data Structure and
More informationCS 387: GAME AI BOARD GAMES
CS 387: GAME AI BOARD GAMES 5/28/2015 Instructor: Santiago Ontañón santi@cs.drexel.edu Class website: https://www.cs.drexel.edu/~santi/teaching/2015/cs387/intro.html Reminders Check BBVista site for the
More informationInvestigations with Monte Carlo Tree Search for finding better multivariate Horner schemes
Investigations with Monte Carlo Tree Search for finding better multivariate Horner schemes H. Jaap van den Herik, Jan Kuipers, 2 Jos A.M. Vermaseren 2, and Aske Plaat Tilburg University, Tilburg center
More informationAN MCTS AGENT FOR EINSTEIN WÜRFELT NICHT! Emanuel Oster. Master Thesis DKE 15-19
AN MCTS AGENT FOR EINSTEIN WÜRFELT NICHT! Emanuel Oster Master Thesis DKE 15-19 Thesis submitted in partial fulfilment of the requirements for the degree of Master of Science of Artificial Intelligence
More informationAddressing NP-Complete Puzzles with Monte-Carlo Methods 1
Addressing NP-Complete Puzzles with Monte-Carlo Methods 1 Maarten P.D. Schadd and Mark H.M. Winands H. Jaap van den Herik and Huib Aldewereld 2 Abstract. NP-complete problems are a challenging task for
More informationAI, AlphaGo and computer Hex
a math and computing story computing.science university of alberta 2018 march thanks Computer Research Hex Group Michael Johanson, Yngvi Björnsson, Morgan Kan, Nathan Po, Jack van Rijswijck, Broderick
More informationComputer Go: from the Beginnings to AlphaGo. Martin Müller, University of Alberta
Computer Go: from the Beginnings to AlphaGo Martin Müller, University of Alberta 2017 Outline of the Talk Game of Go Short history - Computer Go from the beginnings to AlphaGo The science behind AlphaGo
More informationCombinatorial games: from theoretical solving to AI algorithms
Combinatorial games: from theoretical solving to AI algorithms Eric Duchene To cite this version: Eric Duchene. Combinatorial games: from theoretical solving to AI algorithms. SUM, Sep 2016, NIce, France.
More informationAdvanced Game AI. Level 6 Search in Games. Prof Alexiei Dingli
Advanced Game AI Level 6 Search in Games Prof Alexiei Dingli MCTS? MCTS Based upon Selec=on Expansion Simula=on Back propaga=on Enhancements The Mul=- Armed Bandit Problem At each step pull one arm Noisy/random
More informationMonte-Carlo Game Tree Search: Advanced Techniques
Monte-Carlo Game Tree Search: Advanced Techniques Tsan-sheng Hsu tshsu@iis.sinica.edu.tw http://www.iis.sinica.edu.tw/~tshsu 1 Abstract Adding new ideas to the pure Monte-Carlo approach for computer Go.
More informationMonte Carlo Go Has a Way to Go
Haruhiro Yoshimoto Department of Information and Communication Engineering University of Tokyo, Japan hy@logos.ic.i.u-tokyo.ac.jp Monte Carlo Go Has a Way to Go Kazuki Yoshizoe Graduate School of Information
More informationTTIC 31230, Fundamentals of Deep Learning David McAllester, April AlphaZero
TTIC 31230, Fundamentals of Deep Learning David McAllester, April 2017 AlphaZero 1 AlphaGo Fan (October 2015) AlphaGo Defeats Fan Hui, European Go Champion. 2 AlphaGo Lee (March 2016) 3 AlphaGo Zero vs.
More informationCS 380: ARTIFICIAL INTELLIGENCE MONTE CARLO SEARCH. Santiago Ontañón
CS 380: ARTIFICIAL INTELLIGENCE MONTE CARLO SEARCH Santiago Ontañón so367@drexel.edu Recall: Adversarial Search Idea: When there is only one agent in the world, we can solve problems using DFS, BFS, ID,
More informationMultiple Tree for Partially Observable Monte-Carlo Tree Search
Multiple Tree for Partially Observable Monte-Carlo Tree Search David Auger To cite this version: David Auger. Multiple Tree for Partially Observable Monte-Carlo Tree Search. 2011. HAL
More informationSymbolic Classification of General Two-Player Games
Symbolic Classification of General Two-Player Games Stefan Edelkamp and Peter Kissmann Technische Universität Dortmund, Fakultät für Informatik Otto-Hahn-Str. 14, D-44227 Dortmund, Germany Abstract. In
More informationCS229 Project: Building an Intelligent Agent to play 9x9 Go
CS229 Project: Building an Intelligent Agent to play 9x9 Go Shawn Hu Abstract We build an AI to autonomously play the board game of Go at a low amateur level. Our AI uses the UCT variation of Monte Carlo
More informationSmall and large MCTS playouts applied to Chinese Dark Chess stochastic game
Small and large MCTS playouts applied to Chinese Dark Chess stochastic game Nicolas Jouandeau 1 and Tristan Cazenave 2 1 LIASD, Université de Paris 8, France n@ai.univ-paris8.fr 2 LAMSADE, Université Paris-Dauphine,
More informationUsing Neural Network and Monte-Carlo Tree Search to Play the Game TEN
Using Neural Network and Monte-Carlo Tree Search to Play the Game TEN Weijie Chen Fall 2017 Weijie Chen Page 1 of 7 1. INTRODUCTION Game TEN The traditional game Tic-Tac-Toe enjoys people s favor. Moreover,
More informationBlunder Cost in Go and Hex
Advances in Computer Games: 13th Intl. Conf. ACG 2011; Tilburg, Netherlands, Nov 2011, H.J. van den Herik and A. Plaat (eds.), Springer-Verlag Berlin LNCS 7168, 2012, pp 220-229 Blunder Cost in Go and
More informationMonte Carlo Tree Search and Related Algorithms for Games
25 Monte Carlo Tree Search and Related Algorithms for Games Nathan R. Sturtevant 25.1 Introduction 25.2 Background 25.3 Algorithm 1: Online UCB1 25.4 Algorithm 2: Regret Matching 25.5 Algorithm 3: Offline
More information