Search CORE

7 research outputs found

Monte Carlo Tree Search with Heuristic Evaluations using Implicit Minimax Backups

Author: Lanctot Marc
Pepels Tom
Sturtevant Nathan R.
Winands Mark H. M.
Publication venue
Publication date: 01/01/2014
Field of study

Monte Carlo Tree Search (MCTS) has improved the performance of game engines in domains such as Go, Hex, and general game playing. MCTS has been shown to outperform classic alpha-beta search in games where good heuristic evaluations are difficult to obtain. In recent years, combining ideas from traditional minimax search in MCTS has been shown to be advantageous in some domains, such as Lines of Action, Amazons, and Breakthrough. In this paper, we propose a new way to use heuristic evaluations to guide the MCTS search by storing the two sources of information, estimated win rates and heuristic evaluations, separately. Rather than using the heuristic evaluations to replace the playouts, our technique backs them up implicitly during the MCTS simulations. These minimax values are then used to guide future simulations. We show that using implicit minimax backups leads to stronger play performance in Kalah, Breakthrough, and Lines of Action.Comment: 24 pages, 7 figures, 9 tables, expanded version of paper presented at IEEE Conference on Computational Intelligence and Games (CIG) 2014 conferenc

arXiv.org e-Print Archive

CiteSeerX

Maastricht University Research Portal

Crossref

REPORT ON THE CIG'12 MS PAC-MAN VS GHOSTS COMPETITION 2012 IEEE Conference on Computational Intelligence in Games (CIG'12) Granada, Spain 11-14 September, 2012

Author: Pepels Tom
Publication venue: 'IOS Press'
Publication date: 01/12/2012
Field of study

Maastricht University Research Portal

Enhancements for Monte-Carlo Tree Search in Ms Pac-Man

Author: Pepels Tom
Winands Mark H M
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/01/2012
Field of study

Abstract—In this paper enhancements for the Monte-Carlo Tree Search (MCTS) framework are investigated to play Ms Pac-Man. MCTS is used to find an optimal path for an agent at each turn, determining the move to make based on randomised simulations. Ms Pac-Man is a real-time arcade game, in which the protagonist has several independent goals but no conclusive terminal state. Unlike games such as Chess or Go there is no state in which the player wins the game. Furthermore, the Pac-Man agent has to compete with a range of different ghost agents, hence limited assumptions can be made about the opponent’s behaviour. In order to expand the capabilities of existing MCTS agents, five enhancements are discussed: 1) a variable depth tree, 2) playout strategies for the ghost-team and Pac-Man, 3) including long-term goals in scoring, 4) endgame tactics, and 5) a Last-Good-Reply policy for memorising rewarding moves during playouts. An average performance gain of 40,962 points, compared to the average score of the top scoring Pac-Man agent during the CIG’11, is achieved by employing these methods. I

Maastricht University Research Portal

CiteSeerX

Real-Time Monte Carlo Tree Search in Ms Pac-Man

Author: Lanctot Marc
Pepels Tom
Winands Mark H. M.
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/09/2014
Field of study

Maastricht University Research Portal

Sequential halving for partially observable games

Author: Cazenave Tristan
Pepels Tom
Winands Mark H M
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2016
Field of study

This paper investigates Sequential Halving as a selection policy in the following four partially observable games: Go Fish, Lost Cities, Phantom Domineering, and Phantom Go. Additionally, H-MCTS is studied, which uses Sequential Halving at the root of the search tree, and UCB elsewhere. Experimental results reveal that H-MCTS performs the best in Go Fish, whereas its performance is on par in Lost Cities and Phantom Domineering. Sequential Halving as a flat Monte-Carlo Search appears to be the stronger technique in Phantom Go

Maastricht University Research Portal

CiteSeerX

Quality-based Rewards for Monte-Carlo Tree Search Simulations

Author: Lanctot Marc
Pepels Tom
Tak Mandy J. W.
Winands Mark H. M.
Publication venue: 'IOS Press'
Publication date: 01/01/2014
Field of study

Maastricht University Research Portal

Strategien der Internationalisierung

Author: Ah Keng Kau
Alain Verbeke
Albrecht Söllner
Alfred D. Chandler
Amir Manzoor
Andrew J. Rohm
Astrid Jagenberg
Bernd Camphausen
Bernd W. Wirtz
Bruce D. Keillor
Bruce D. Keillor
C. P. Rao
Christine K. Volkmann
David Luna
Dianne Cyr
Dirk Holtbrügge
Farhad Analoui
Franklin R. Root
Franz X. Bea
Gerhard Girmscheid
Gerrit Heinemann
Gunter Stahr
Guo Zhang
Günter Wöhe
Hadi S. Alhorr
Hans-Dieter Haas
Heinrich J. Rommelfanger
Herbert Strunz
Heribert Meffert
Holger Luczak
Igal Ayal
Ingolf Bamberger
James B. Quinn
Jean J. Boddewyn
Johann Engelhard
Jürgen Nebel
Klaus Backhaus
Lisa Fröhlich
Lutz Krafft
Manfred Fuchs
Manfred Perlitz
Marhild von Behr
Marieke de Mooij
Mario Glowik
Mark Brown
Martin K. Welge
Michael Kutschker
Michael Lewitt
Michael Schenk
Naveen Donthu
Nitish Singh
Nitish Singh
Nitish Singh
Norbert Kailer
Patrick D. Lynch
Peter Loevenich
Predeep K. Korgaonkar
Qi Wang
Ralph Berndt
Ralph Berndt
Reinhard Meckl
Robert D. Buzzell
Rolf F. Scholl
Rudolf R. Sinkovits
Ruth Melzer-Ridinger
Stefan Busch
Stefan Schmid
Steven J. Simon
Susan P. Douglas
Sönke Albers
Tamer Cavusgil
Tobias Kollmann
Tom Duncan
Ulrich Krystek
Waltraud Martius
Warren J. Keagan
Werner Kirsch
Werner Pepels
Wolfgang Hammes
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref