Search CORE

83 research outputs found

Preference-Based Monte Carlo Tree Search

Author: A Rimmel
CB Browne
CS Lee
D Silver
J Fürnkranz
JD Knowles
L Kocsis
LL Thurstone
ML Puterman
P Auer
R Busa-Fekete
RS Sutton
T Pepels
Y Yue
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 17/07/2018
Field of study

Monte Carlo tree search (MCTS) is a popular choice for solving sequential anytime problems. However, it depends on a numeric feedback signal, which can be difficult to define. Real-time MCTS is a variant which may only rarely encounter states with an explicit, extrinsic reward. To deal with such cases, the experimenter has to supply an additional numeric feedback signal in the form of a heuristic, which intrinsically guides the agent. Recent work has shown evidence that in different areas the underlying structure is ordinal and not numerical. Hence erroneous and biased heuristics are inevitable, especially in such domains. In this paper, we propose a MCTS variant which only depends on qualitative feedback, and therefore opens up new applications for MCTS. We also find indications that translating absolute into ordinal feedback may be beneficial. Using a puzzle domain, we show that our preference-based MCTS variant, wich only receives qualitative feedback, is able to reach a performance level comparable to a regular MCTS baseline, which obtains quantitative feedback.Comment: To be publishe

arXiv.org e-Print Archive

Crossref

Block Copolymers of “PE-Like” Poly(pentadecalactone) and Poly( l

Author: Anne B. Spoelstra
Cor E. Koning
Fox T. G.
Han Goossens
Mark P. F. Pepels
Rob Duchateau
Rob Kleijnen
Wilma P. Hofman
Publication venue: 'American Chemical Society (ACS)'
Publication date
Field of study

Crossref

Survival and axillary recurrence following sentinel node-positive breast cancer without completion axillary lymph node dissection: the randomized controlled SENOMAC trial

Author: A Gondos
A Lucci
AC Degnim
AE Giuliano
AE Giuliano
AE Giuliano
AS Caudle
B Fisher
BV Offersen
CL Carter
Dan Lundstedt
DN Krag
F Celebioglu
H Sackey
Hemming Johansson
I Soerjomataram
IM Ploeg van der
J Park
Jan Frisell
Jana de Boniface
Johan Ahlgren
K Bjordal
Leif Bergkvist
Lisa Rydén
M Donker
M Herdman
M Schmidt-Hansen
MA Sprangers
Malin Sund
MJ Pepels
MJ Pepels
N Devoogdt
NK Aaronson
OE Nieweg
Roger Olofsson Bagge
S Latosinsky
SM Gainer
SR Land
T Kuehn
TC Kenny
U Veronesi
V Galimberti
Y Andersson
Yvette Andersson
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

Strokes services gespiegeld

Author: Huijsman Robbert
Kool T
Nieboer Anna Petra
Pepels R
van Have L
Publication venue: Prismant
Publication date: 01/01/2005
Field of study

EUR Research Repository

In situ compatibilisation of alkenyl-terminated polymer blends using cross metathesis

Author: Descour C.D.
Duchateau R.
Macko T.
Pepels M.P.F.
Schreur - Piet I.
Publication venue: 'Royal Society of Chemistry (RSC)'
Publication date: 01/01/2015
Field of study

Several compatibilised polyolefin-based blends have been obtained via rather simple and robust chemistry: olefin cross metathesis using Grubbs' second-generation catalyst (G2) of alkenyl-terminated macromolecules of different nature. The viability of the concept was first demonstrated for low molecular weight polyolefin macromolecules before being extended to higher molecular weight polymers, including polar ones such as poly(3-caprolactone) (PCL), poly(pentadecalactone) (PPDL) and poly(methylmethacrylate) (PMMA). When taking all the possible cross metathesis reactions into account, a statistical distribution of homopolymers and diblock copolymers is likely to be formed. While clear macrophase separation is visible in the uncompatibilised blends of macromolecules, it is absent for the in situ compatibilised products, as was confirmed by optical microscopy. It was demonstrated that even small amounts of diblock copolymers can effectively compatibilise the two phases. All materials were analysed by HT SEC, DSC, HT HPLC and optical microscopy. Such a proof of principle indicates that using cross metathesis on a large library of macromolecules might be a versatile "synthetic handle" to reach a variety of in situ compatibilised blends

Strokes services gespiegeld

Author: Huijsman Robbert
Kool T
Nieboer Anna Petra
Pepels R
van Have L
Publication venue: Prismant
Publication date: 01/01/2005
Field of study

EUR Research Repository

Adapting Improved Upper Confidence Bounds for Monte-Carlo Tree Search

Author: CB Browne
E Kaufmann
L Kocsis
P Auer
P Auer
RS Sutton
T Cazenave
T Pepels
TL Lai
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

Markenparks

Author: B Behrens
C Bäuchee
E Jacob
G Schulze
H Meffert
HJ Kagelmann
HJ Kiel
HR Scherrieb
HW Opaschowski
J Hofer
K Ahrens
K Ahrens
KE Goehrmann
O Nickel
O Zils
R Linxweiler
T Bieger
W Pepels
W Pepels
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2005
Field of study

Crossref

Minimizing Simple and Cumulative Regret in Monte-Carlo Tree Search

Author: A. Rimmel
B. Arneson
C. Browne
L. Kocsis
M.H.M. Winands
M.H.M. Winands
P. Auer
R. Coulom
S. Bubeck
T. Cazenave
T. Pepels
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2014
Field of study

Regret minimization is important in both the Multi-Armed Bandit problem and Monte-Carlo Tree Search (MCTS). Recently, sim-ple regret, i.e., the regret of not recommending the best action, has been proposed as an alternative to cumulative regret in MCTS, i.e., regret accumulated over time. Each type of regret is appropriate in different contexts. Although the majority of MCTS research applies the UCT se-lection policy for minimizing cumulative regret in the tree, this paper introduces a new MCTS variant, Hybrid MCTS (H-MCTS), which min-imizes both types of regret in different parts of the tree. H-MCTS uses SHOT, a recursive version of Sequential Halving, to minimize simple regret near the root, and UCT to minimize cumulative regret when de-scending further down the tree. We discuss the motivation for this new search technique, and show the performance of H-MCTS in six distinc

Maastricht University Research Portal

CiteSeerX

Crossref

Kommunikationspolitik im Spannungsfeld von Unternehmen, Medien und Politik

Author: C Moeskes
E Gärtner
H Schuh
L Rolke
L Rolke
M Allmaier
P Gross
T Leif
U Manz
W Pepels
W Preusker
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2003
Field of study

Crossref