Search CORE

30 research outputs found

Distributed Nested Rollout Policy for Same Game

Author: A Rimmel
AL Zobrist
S Edelkamp
S Edelkamp
S Edelkamp
T Cazenave
T Cazenave
T Graf
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/08/2017
Field of study

Nested Rollout Policy Adaptation (NRPA) is a Monte Carlo search heuristic for puzzles and other optimization problems. It achieves state-of-the-art performance on several games including SameGame. In this paper, we design several parallel and distributed NRPA-based search techniques, and we provide a number of experimental insights about their execution. Finally, we use our best implementation to discover 15 better scores for 20 standard SameGame boards

Crossref

Recommended from our members

Angry birds – the use of International Union for the Conservation of Nature categories as biodiversity disclosures in extinction accounting

Author: Atkins J.
Atkins J.
Atkins J.
Denzin N. K.
Dey C.
European Environment Agency
Gray R.
Houdet J.
IUCN
IUCN
IUCN CEC
IUCN Species Survival Commission
IUCN Species Survival Commission
Jones M. J.
Lindblom C. K.
Miles M.
Rimmel G.
Spicer J. I.
Taylor S. J.
Thomson I.
UNEP
WAZA (World Association of Zoos and Aquariums)
Yin R. K.
Publication venue: 'Informa UK Limited'
Publication date: 15/02/2021
Field of study

The purpose of this research is to provide an account of whether extinction accounting and the use of IUCN categories offers a valuable and feasible addition to biodiversity disclosures for an organisation that has a professional interest in conservation programmes. Specifically, when and where IUCN categories can be used as biodiversity disclosures to address the threat of extinction. This study is based on a single anomalous case a Nordic zoo, located in Sweden, which has focused its operations exclusively on the conservation of threatened species and is the only zoo in Europe to do so. In order to comprehend the use of IUCN categories the annual report and the corporate website of Nordic Zoo have been examined. An open-ended interview with zoo management has been conducted to learn the intentions behind such specific disclosures and the use of IUCN categories. The findings of this study reveal that IUCN categories are appropriate biodiversity disclosures for highlighting extinction threats to various species. In an organisation with a professional interest in practicing conservation programmes, IUCN categories play a central role in communicating with stakeholders. This study demonstrates that biodiversity disclosures are part of a sincere effort to report on conservation

Central Archive at the University of Reading

Crossref

VBN

Investigating the Limits of Monte-Carlo Tree Search Methods in Computer Go

Author: A Rimmel
C Browne
L Kocsis
S-C Huang
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

Monte-Carlo Tree Search for the Physical Travelling Salesman Problem

Author: A. Rimmel
L. Kocsis
M.P.D. Schadd
R. Coulom
S. Gelly
S. Samothrakis
Y. Björnsson
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2012
Field of study

The significant success of MCTS in recent years, particularly in the game Go, has led to the application of MCTS to numerous other domains. In an ongoing effort to better understand the performance of MCTS in open-ended real-time video games, we apply MCTS to the Physical Travelling Salesman Problem (PTSP). We discuss different approaches to tailor MCTS to this particular problem domain and subsequently identify and attempt to overcome some of the apparent shortcomings. Results show that suitable heuristics can boost the performance of MCTS significantly in this domain. However, visualisations of the search indicate that MCTS is currently seeking solutions in a rather greedy manner, and coercing it to balance short term and long term constraints for the PTSP remains an open problem. © 2012 Springer-Verlag

University of Essex Research Repository

Crossref

Learning a Move-Generator for Upper Confidence Trees

Author: A. Couëtoux
A. Couëtoux
A. Rimmel
R. Coulom
S. Gelly
S. Meyer-Nieberg
S. Sharma
S.-C. Huang
Publication venue
Publication date: 01/01/2013
Field of study

We experiment the introduction of machine learning tools to improve Monte-Carlo Tree Search. More precisely, we propose the use of Direct Policy Search, a classical reinforcement learning paradigm, to learn the Monte-Carlo Move Generator. We experiment our algorithm on different forms of unit commitment problems, including experiments on a problem with both macrolevel and microlevel decisions

CiteSeerX

Crossref

Challenging Established Move Ordering Strategies with Adaptive Data Structures

Author: A Rimmel
CE Shannon
DE Knuth
J Schaeffer
J Schaeffer
JH Hester
MPD Schadd
S Polk
SJ Russell
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2016
Field of study

The field of game playing is a particularly well-studied area within the context of AI, leading to the development of powerful techniques, such as the alpha-beta search, capable of achieving competitive game play against an intelligent opponent. It is well known that tree pruning strategies, such as alpha-beta, benefit strongly from proper move ordering, that is, searching the best element first. Inspired by the formerly unrelated field of Adaptive Data Structures (ADSs), we have previously introduced the History-ADS technique, which employs an adaptive list to achieve effective and dynamic move ordering, in a domain independent fashion, and found that it performs well in a wide range of cases. However, previous work did not compare the performance of the History-ADS heuristic to any established move ordering strategy. In an attempt to address this problem, we present here a comparison to two well-known, acclaimed strategies, which operate on a similar philosophy to the History-ADS, the History Heuristic, and the Killer Moves technique. We find that, in a wide range of two-player and multi-player games, at various points in the game’s progression, the History-ADS performs at least as well as these strategies, and, in fact, outperforms them in the majority of cases

Crossref

Carleton University's Institutional Repository

Agder University Research Archive

On Semeai Detection in Monte-Carlo Go

Author: A Rimmel
BW Silverman
K Fukunaga
R Coulom
R Coulom
R Coulom
S Pellegrino
S-C Huang
S-C Huang
Y Cheng
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

Minimizing Simple and Cumulative Regret in Monte-Carlo Tree Search

Author: A. Rimmel
B. Arneson
C. Browne
L. Kocsis
M.H.M. Winands
M.H.M. Winands
P. Auer
R. Coulom
S. Bubeck
T. Cazenave
T. Pepels
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2014
Field of study

Regret minimization is important in both the Multi-Armed Bandit problem and Monte-Carlo Tree Search (MCTS). Recently, sim-ple regret, i.e., the regret of not recommending the best action, has been proposed as an alternative to cumulative regret in MCTS, i.e., regret accumulated over time. Each type of regret is appropriate in different contexts. Although the majority of MCTS research applies the UCT se-lection policy for minimizing cumulative regret in the tree, this paper introduces a new MCTS variant, Hybrid MCTS (H-MCTS), which min-imizes both types of regret in different parts of the tree. H-MCTS uses SHOT, a recursive version of Sequential Halving, to minimize simple regret near the root, and UCT to minimize cumulative regret when de-scending further down the tree. We discuss the motivation for this new search technique, and show the performance of H-MCTS in six distinc

Maastricht University Research Portal

CiteSeerX

Crossref