Search CORE

39 research outputs found

Pilot, Rollout and Monte Carlo Tree Search Methods for Job Shop Scheduling

Author: C. Duin
D. Bertsekas
E. Taillard
L. Kocsis
L. Xu
M.J. Streeter
P. Auer
P. Rolet
S. Panwalkar
S. Voß
T. Lai
Á. Fialho
Publication venue
Publication date: 01/01/2012
Field of study

Greedy heuristics may be attuned by looking ahead for each possible choice, in an approach called the rollout or Pilot method. These methods may be seen as meta-heuristics that can enhance (any) heuristic solution, by repetitively modifying a master solution: similarly to what is done in game tree search, better choices are identified using lookahead, based on solutions obtained by repeatedly using a greedy heuristic. This paper first illustrates how the Pilot method improves upon some simple well known dispatch heuristics for the job-shop scheduling problem. The Pilot method is then shown to be a special case of the more recent Monte Carlo Tree Search (MCTS) methods: Unlike the Pilot method, MCTS methods use random completion of partial solutions to identify promising branches of the tree. The Pilot method and a simple version of MCTS, using the

\varepsilon

-greedy exploration paradigms, are then compared within the same framework, consisting of 300 scheduling problems of varying sizes with fixed-budget of rollouts. Results demonstrate that MCTS reaches better or same results as the Pilot methods in this context.Comment: Learning and Intelligent OptimizatioN (LION'6) 7219 (2012

arXiv.org e-Print Archive

HAL-CentraleSupelec

Crossref

INRIA a CCSD electronic archive server

HAL-Rennes 1

Bandit-based Monte-Carlo structure learning of probabilistic logic programs

Author: A Darwiche
A Duboc
A Rauzy
A Srinivasan
B Bollig
BL Richards
C Browne
D Poole
E Bellodi
Elena Bellodi
F Riguzzi
F Riguzzi
F Riguzzi
F Riguzzi
F Zelezný
Fabrizio Riguzzi
G Schwarz
H Khosravi
H Robbins
HH Hoos
J Dean
L Raedt De
N Beerenwinkel
Nicola Di Mauro
O Grumberg
O Schulte
P Auer
P Rolet
P Rujan
R Martí
S Bubeck
S Muggleton
S Natarajan
SH Nienhuys-Cheng
T Fawcett
T Sang
T Sato
T Sato
V Santos Costa
W Meert
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

Le magmatisme, marqueur de l'évolution orogénique, en domaine hercynien ouest-armoricain

Author: Barriere M.
Rolet J.
Thonon P.
Publication venue: HAL CCSD
Publication date: 01/01/1983
Field of study

National audienc

HAL Descartes

HAL-Université de Bretagne Occidentale

HAL-Rennes 1

A multicompartment model for intratumor tissue‐specific analysis of DCE‐MRI using non‐negative matrix factorization

Author: Andrzejewski P
Gillis N
Rolet A
Publication venue: 'Wiley'
Publication date
Field of study

Crossref

Magmatisme tholéiitique continental en contexte orogénique hercynien ; L'exemple du volcanime viséen de Kerroc'h, massif armoricain (France)

Author: Caroff Martial
Cotten J.
Le Gal
Rolet J.
Thonon P.
Publication venue: HAL CCSD
Publication date: 01/01/1996
Field of study

National audienc

HAL Descartes

HAL-Université de Bretagne Occidentale

Power ultrasound irradiation during the alkaline etching process of the 2024 aluminium alloy

Author: J-Y. Hihn
J. Rolet
M-P. Gigandet
R. Viennet
V. Moutarlier
Publication venue: 'Elsevier BV'
Publication date: 01/01/2015
Field of study

International audienc

Continuous Upper Confidence Trees

Author: G. Chaslot
H. Finnsson
L. Kocsis
P. Auer
P. Rolet
S. Gelly
T. Lai
Publication venue
Publication date: 01/01/2011
Field of study

Upper Confidence Trees are a very efficient tool for solving Markov Decision Processes; originating in difficult games like the game of Go, it is in particular surprisingly efficient in high dimensional problems. It is known that it can be adapted to continuous domains in some cases (in particular continuous action spaces). We here present an extension of Upper Confidence Trees to continuous stochastic problems. We (i) show a deceptive problem on which the classical Upper Confidence Tree approach does not work, even with arbitrarily large computational power and with progressive widening (ii) propose an improvement, termed double-progressive widening, which takes care of the compromise between variance (we want infinitely many simulations for each action/state) and bias (we want sufficiently many nodes to avoid a bias by the first nodes) and which extends the classical progressive widening (iii) discuss its consistency and show experimentally that it performs well on the deceptive problem and on experimental benchmarks. We guess that the double-progressive widening trick can be used for other algorithms as well, as a general tool for ensuring a good bias/variance compromise in search algorithms

HAL-CentraleSupelec

CiteSeerX

Crossref

INRIA a CCSD electronic archive server

HAL-Rennes 1