Learning a Move-Generator for Upper Con dence Trees

Couetoux, Adrien; Doghmen, Hassen; Teytaud, Olivier

Learning a Move-Generator for Upper Con dence Trees

Authors: Adrien Couetoux
Hassen Doghmen
Olivier Teytaud
Publication date: 12 December 2012
Publisher: HAL CCSD

Abstract

International audienceWe experiment the introduction of machine learning tools to improve Monte-Carlo Tree Search. More precisely, we propose the use of Direct Policy Search, a classical reinforcement learning paradigm, to learn the Monte-Carlo Move Generator. We experiment our algorithm on di erent forms of unit commitment problems, including experiments on a problem with both macrolevel and microlevel decisions

Similar works

Full text

Open in the Core reader

Download PDF

Available Versions

HAL-CentraleSupelec

oai:HAL:hal-00759822v1

Last time updated on 19/06/2021

INRIA a CCSD electronic archive server

oai:HAL:hal-00759822v1

Last time updated on 09/11/2016