Markov Decision Processes

Bäuerle, N.; Rieder, U.

Markov Decision Processes

Authors: N. Bäuerle
U. Rieder
Publication date: 6 February 2013
Publisher: Springer
Doi

Abstract

The theory of Markov Decision Processes is the theory of controlled Markov chains. Its origins can be traced back to R. Bellman and L. Shapley in the 1950\u27s. During the decades of the last century this theory has grown dramatically. It has found applications in various areas like e.g. computer science, engineering, operations research, biology and economics. In this article we give a short introduction to parts of this theory. We treat Markov Decision Processes with finite and infinite time horizon where we will restrict the presentation to the so-called (generalized) negative case. Solution algorithms like Howard\u27s policy improvement and linear programming are also explained. Various examples show the application of the theory. We treat stochastic linear-quadratic control problems, bandit problems and dividend pay-out problems

Similar works

Full text

Open in the Core reader

Download PDF

Available Versions

KITopen

oai:EVASTAR-Karlsruhe.de:10000...

Last time updated on 07/05/2019