Search CORE

29 research outputs found

Increasing the Action Gap: New Operators for Reinforcement Learning

Author: Bellemare Marc G.
Guez Arthur
Munos Rémi
Ostrovski Georg
Thomas Philip S.
Publication venue
Publication date: 15/12/2015
Field of study

This paper introduces new optimality-preserving operators on Q-functions. We first describe an operator for tabular representations, the consistent Bellman operator, which incorporates a notion of local policy consistency. We show that this local consistency leads to an increase in the action gap at each state; increasing this gap, we argue, mitigates the undesirable effects of approximation and estimation errors on the induced greedy policies. This operator can also be applied to discretized continuous space and time problems, and we provide empirical results evidencing superior performance in this context. Extending the idea of a locally consistent operator, we then derive sufficient conditions for an operator to preserve optimality, leading to a family of operators which includes our consistent Bellman operator. As corollaries we provide a proof of optimality for Baird's advantage learning algorithm and derive other gap-increasing operators with interesting properties. We conclude with an empirical study on 60 Atari 2600 games illustrating the strong potential of these new operators

arXiv.org e-Print Archive

Association for the Advancement of Artificial Intelligence: AAAI Publications

Online evolution for multi-action adversarial games

Author: Justesen Niels Orsleff
Mahlmann Tobias
Togelius Julian
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2016
Field of study

We present Online Evolution, a novel method for playing turn-based multi-action adversarial games. Such games, which include most strategy games, have extremely high branching factors due to each turn having multiple actions. In Online Evolution, an evolutionary algorithm is used to evolve the combination of atomic actions that make up a single move, with a state evaluation function used for fitness. We implement Online Evolution for the turn-based multi-action game Hero Academy and compare it with a standard Monte Carlo Tree Search implementation as well as two types of greedy algorithms. Online Evolution is shown to outperform these methods by a large margin. This shows that evolutionary planning on the level of a single move can be very effective for this sort of problems

Lund University Publications

Crossref

The IT University of Copenhagen's Repository

Balance trucks:Using crowd-sourced data to procedurally-generate gameplay within mobile games

Author: Arnab Sylvester
Bogliolo Alessandro
Clarke Samantha
Delpriori Saverio
Klopfenstein Lorenz
Lewis Mark Russell
Masters Alex
Morini Luca
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/01/2018
Field of study

Within the field of procedural content generation (PCG) research, the use of crowd-sensing data has, until now, primarily been used as a means of collecting information and generating feedback relating to player experience within games, and game aesthetics. However, crowd-sensing data can offer much more, supplying a seemingly untapped font of information which may be used within the creation of unique PCG game spaces or content, whilst providing a visible outlet for the dissemination of crowd-sensed material to users. This paper examines one such use of crowd-sensed data, the creation of a game which will reside within the CROWD4ROADS (C4RS) application, SmartRoadSense (SRS). The authors will open with a brief discussion of PCG. Following this, an explanation of the features and aims of the SRS application will be provided. Finally, the paper will introduce ‘Balance Trucks’, the SRS game, discussing the concepts behind using crowd-sensed data within its design, its development and use of PCG

Archivio istituzionale della ricerca - Università di Urbino

Crossref

Coventry University Pure Portal

Evolving levels for Super Mario Bros using grammatical evolution

Author: 2012 IEEE Conference on Computational Intelligence and Games
Nicolau Miguel
O'Neill Michael
Shaker Noor
Togelius Julian
Yannakakis Georgios N.
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/01/2012
Field of study

This paper presents the use of design grammars to evolve playable 2D platform levels through grammatical evolution (GE). Representing levels using design grammars allows simple encoding of important level design constraints, and allows remarkably compact descriptions of large spaces of levels. The expressive range of the GE-based level generator is analyzed and quantitatively compared to other feature-based and the original level generators by means of aesthetic and similarity based measures. The analysis reveals strengths and shortcomings of each generator and provides a general framework for comparing content generated by different generators. The approach presented can be used as an assistive tool by game designers to compare and analyze generators' capabilities within the same game genre.peer-reviewe

OAR@UM

Evolving levels for Super Mario Bros using grammatical evolution

Author: Nicolau Miguel
O’Neill Michael
Shaker Noor
Togelius Julian
Yannakakis Georgios N.
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/01/2012
Field of study

Crossref

The IT University of Copenhagen's Repository

Considerations for comparing video-game AI agents with humans

Author: Madan Christopher R
Publication venue: 'MDPI AG'
Publication date: 20/08/2020
Field of study

Video games are sometimes used as environments to evaluate AI agents’ ability to develop and execute complex action sequences to maximize a defined reward. However, humans cannot match the fine precision of the timed actions of AI agents; in games such as StarCraft, build orders take the place of chess opening gambits. However, unlike strategy games, such as chess and Go, video games also rely heavily on sensorimotor precision. If the “finding” was merely that AI agents have superhuman reaction times and precision, none would be surprised. The goal is rather to look at adaptive reasoning and strategies produced by AI agents that may replicate human approaches or even result in strategies not previously produced by humans. Here, I will provide: (1) an overview of observations where AI agents are perhaps not being fairly evaluated relative to humans, (2) a potential approach for making this comparison more appropriate, and (3) highlight some important recent advances in video game play provided by AI agent

Multidisciplinary Digital Publishing Institute

Repository@Nottingham

From Chess and Atari to StarCraft and Beyond: How Game AI is Driving the World of AI

Author: Preuss Mike
Risi Sebastian
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2020
Field of study

This paper reviews the field of Game AI, which not only deals with creating agents that can play a certain game, but also with areas as diverse as creating game content automatically, game analytics, or player modelling. While Game AI was for a long time not very well recognized by the larger scientific community, it has established itself as a research area for developing and testing the most advanced forms of AI algorithms and articles covering advances in mastering video games such as StarCraft 2 and Quake III appear in the most prestigious journals. Because of the growth of the field, a single review cannot cover it completely. Therefore, we put a focus on important recent developments, including that advances in Game AI are starting to be extended to areas outside of games, such as robotics or the synthesis of chemicals. In this article, we review the algorithms and methods that have paved the way for these breakthroughs, report on the other important areas of Game AI research, and also point out exciting directions for the future of Game AI

arXiv.org e-Print Archive

The IT University of Copenhagen's Repository