General self-motivation and strategy identification : Case studies based on Sokoban and Pac-Man

Anthony, Tom; Nehaniv, C.L.; Polani, D.

research

General self-motivation and strategy identification : Case studies based on Sokoban and Pac-Man

Authors: Tom Anthony
C.L. Nehaniv
D. Polani
Publication date: 18 December 2013
Publisher: 'Institute of Electrical and Electronics Engineers (IEEE)'
Doi

Abstract

(c) 2014 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other users, including reprinting/ republishing this material for advertising or promotional purposes, creating new collective works for resale or redistribution to servers or lists, or reuse of any copyrighted components of this work in other works.In this paper, we use empowerment, a recently introduced biologically inspired measure, to allow an AI player to assign utility values to potential future states within a previously unencountered game without requiring explicit specification of goal states. We further introduce strategic affinity, a method of grouping action sequences together to form "strategies," by examining the overlap in the sets of potential future states following each such action sequence. We also demonstrate an information-theoretic method of predicting future utility. Combining these methods, we extend empowerment to soft-horizon empowerment which enables the player to select a repertoire of action sequences that aim to maintain anticipated utility. We show how this method provides a proto-heuristic for nonterminal states prior to specifying concrete game goals, and propose it as a principled candidate model for "intuitive" strategy selection, in line with other recent work on "self-motivated agent behavior." We demonstrate that the technique, despite being generically defined independently of scenario, performs quite well in relatively disparate scenarios, such as a Sokoban-inspired box-pushing scenario and in a Pac-Man-inspired predator game, suggesting novel and principle-based candidate routes toward more general game-playing algorithms.Peer reviewedFinal Accepted Versio

Similar works

Full text

Open in the Core reader

Download PDF

Available Versions

University of Hertfordshire Research Archive

oai:uhra.herts.ac.uk:2299/1537...

Last time updated on 01/10/2015

Crossref

info:doi/10.1109%2Ftciaig.2013...

Last time updated on 13/11/2020