Hypervolume-based multi-objective reinforcement learning

Drugan, MM Madalina; Nowé, A Ann; Van Moffaert, K Kristof

Hypervolume-based multi-objective reinforcement learning

Authors: MM Madalina Drugan
A Ann Nowé
K Kristof Van Moffaert
Publication date: 1 January 2013
Publisher: 'Springer Fachmedien Wiesbaden GmbH'

Abstract

Indicator-based evolutionary algorithms are amongst the best performing methods for solving multi-objective optimization (MOO) problems. In reinforcement learning (RL), introducing a quality indicator in an algorithm’s decision logic was not attempted before. In this paper, we propose a novel on-line multi-objective reinforcement learning (MORL) algorithm that uses the hypervolume indicator as an action selection strategy. We call this algorithm the hypervolume-based MORL algorithm or HB-MORL and conduct an empirical study of the performance of the algorithm using multiple quality assessment metrics from multi-objective optimization. We compare the hypervolume-based learning algorithm on different environments to two multi-objective algorithms that rely on scalarization techniques, such as the linear scalarization and the weighted Chebyshev function. We conclude that HB-MORL significantly outperforms the linear scalarization method and performs similarly to the Chebyshev algorithm without requiring any user-specified emphasis on particular objectives

Similar works

Full text

Available Versions

Repository TU/e

oai:library.tue.nl:889109

Last time updated on 06/12/2019