The method of successive approximations for the discounted Markov game

Wal, van der, J.

The method of successive approximations for the discounted Markov game

Authors: van der, J. Wal
Publication date: 1 January 1975
Publisher: Technische Hogeschool Eindhoven

Abstract

This paper presents a number of successive approximation algorithms for the repeated two-person zero-sum game called Markov game using the criterion of total expected discounted rewards. As Wessels [12] did for Markov decision processes stopping times are introduced in order to simplify the proofs. It is shown that each algorithm provides upper and lower bounds for the value of the game and nearly optimal stationary strategies for both players

Similar works

Full text

Open in the Core reader

Download PDF

Available Versions

NARCIS

Last time updated on 18/06/2018

Repository TU/e

oai:library.tue.nl:339616

Last time updated on 12/11/2016

Pure OAI Repository

oai:pure.tue.nl:openaire_cris_...

Last time updated on 11/08/2023