Search CORE

65 research outputs found

Polynomial Time Algorithms for Branching Markov Decision Processes and Probabilistic Min(Max) Polynomial Bellman Equations

Author: C. Courcoubetis
E. Allender
E. Denardo
J. Esparza
J. Esparza
J. Esparza
K. Etessami
K. Etessami
S. Pliska
T. Brázdil
U. Rothblum
Publication venue
Publication date: 01/01/2012
Field of study

We show that one can approximate the least fixed point solution for a multivariate system of monotone probabilistic max(min) polynomial equations, referred to as maxPPSs (and minPPSs, respectively), in time polynomial in both the encoding size of the system of equations and in log(1/epsilon), where epsilon > 0 is the desired additive error bound of the solution. (The model of computation is the standard Turing machine model.) We establish this result using a generalization of Newton's method which applies to maxPPSs and minPPSs, even though the underlying functions are only piecewise-differentiable. This generalizes our recent work which provided a P-time algorithm for purely probabilistic PPSs. These equations form the Bellman optimality equations for several important classes of infinite-state Markov Decision Processes (MDPs). Thus, as a corollary, we obtain the first polynomial time algorithms for computing to within arbitrary desired precision the optimal value vector for several classes of infinite-state MDPs which arise as extensions of classic, and heavily studied, purely stochastic processes. These include both the problem of maximizing and mininizing the termination (extinction) probability of multi-type branching MDPs, stochastic context-free MDPs, and 1-exit Recursive MDPs. Furthermore, we also show that we can compute in P-time an epsilon-optimal policy for both maximizing and minimizing branching, context-free, and 1-exit-Recursive MDPs, for any given desired epsilon > 0. This is despite the fact that actually computing optimal strategies is Sqrt-Sum-hard and PosSLP-hard in this setting. We also derive, as an easy consequence of these results, an FNP upper bound on the complexity of computing the value (within arbitrary desired precision) of branching simple stochastic games (BSSGs)

arXiv.org e-Print Archive

Crossref

Edinburgh Research Explorer

Stability and convergence in discrete convex monotone dynamical systems

Author: B. Lemmens
B. Lemmens
Bas Lemmens
G. Cooman de
H. Schneider
H.L. Smith
M. Akian
M. Akian
M.G. Crandall
M.G. Kreĭn
M.W. Hirsch
M.W. Hirsch
Marianne Akian
N. Dancer
P. Alexandroff
P. Hess
P. Poláčik
P. Takáč
R.D. Nussbaum
R.D. Nussbaum
R.D. Nussbaum
S. Boyd
Stéphane Gaubert
U. Krause
U.G. Rothblum
U.G. Rothblum
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 28/03/2010
Field of study

We study the stable behaviour of discrete dynamical systems where the map is convex and monotone with respect to the standard positive cone. The notion of tangential stability for fixed points and periodic points is introduced, which is weaker than Lyapunov stability. Among others we show that the set of tangentially stable fixed points is isomorphic to a convex inf-semilattice, and a criterion is given for the existence of a unique tangentially stable fixed point. We also show that periods of tangentially stable periodic points are orders of permutations on

n

letters, where

n

is the dimension of the underlying space, and a sufficient condition for global convergence to periodic orbits is presented.Comment: 36 pages, 1 fugur

arXiv.org e-Print Archive

Crossref

Kent Academic Repository

Eating Christmas Cookies, Whole-wheat Bread and Frozen Chicken in the Kindergarten: Doing Pedagogy by Other Means

Author: A. C. Saguy
A. Mol
C. Tomrley
C. Wulf
E. Rawlins
E. Rawlins
E. Rich
E. Rothblum
J. Evans
J. Evans
J. Guthman
J. Niewöhner
K. Audehm
M. Kontopodis
M. Kontopodis
Michalis Kontopodis
P. Rabinow
U. Spiekermann
Publication venue
Publication date: 01/01/2013
Field of study

The study presented here explores eating as a pedagogical practice by paying attention to arrangements of things such as Christmas cookies, whole-wheat and white bread, frozen chicken, plates, chairs, tables, and freezers. Through a series of ethnographic research examples from German and Brazilian preschools, it investigates how eating in the kindergarten can be a sensual pleasure, a health risk, an ethnic custom, or a civil right within different local histories. Through specific arrangements of foods and other things, young children are educated to eat with moderation, to change their ethnic dietary habits, or to be "modern citizens". Pedagogy can thus consist of doing public health, doing ethnic identity, or doing citizenship. Eating is an important way of doing pedagogy in early childhood education and care settings. © 2013 Springer Fachmedien Wiesbaden

Crossref

VU Research Portal

Greatest Fixed Points of Probabilistic Min/Max Polynomial Equations, and Reachability for Branching Markov Decision Processes?

Author: I Bozic
J Esparza
J Esparza
JG Reiter
K Etessami
K Etessami
R Bonnet
T Brázdil
T Brázdil
T Chen
U Rothblum
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2015
Field of study

We give polynomial time algorithms for quantitative (and qualitative) reachability analysis for Branching Markov Decision Processes (BMDPs). Specifically, given a BMDP, and given an initial population, where the objective of the controller is to maximize (or minimize) the probability of eventually reaching a population that contains an object of a desired (or undesired) type, we give algorithms for approximating the supremum (infimum) reachability probability, within desired precision epsilon > 0, in time polynomial in the encoding size of the BMDP and in log(1/epsilon). We furthermore give P-time algorithms for computing epsilon-optimal strategies for both maximization and minimization of reachability probabilities. We also give P-time algorithms for all associated qualitative analysis problems, namely: deciding whether the optimal (supremum or infimum) reachability probabilities are 0 or 1. Prior to this paper, approximation of optimal reachability probabilities for BMDPs was not even known to be decidable. Our algorithms exploit the following basic fact: we show that for any BMDP, its maximum (minimum) non-reachability probabilities are given by the greatest fixed point (GFP) solution g* in [0,1]^n of a corresponding monotone max (min) Probabilistic Polynomial System of equations (max/min-PPS), x=P(x), which are the Bellman optimality equations for a BMDP with non-reachability objectives. We show how to compute the GFP of max/min PPSs to desired precision in P-time. We also study more general Branching Simple Stochastic Games (BSSGs) with (non-)reachability objectives. We show that: (1) the value of these games is captured by the GFP of a corresponding max-minPPS; (2) the quantitative problem of approximating the value is in TFNP; and (3) the qualitative problems associated with the value are all solvable in P-time

arXiv.org e-Print Archive

Crossref

Edinburgh Research Explorer

Q-learning and policy iteration algorithms for stochastic shortest path problems

Author: A. F. Veinott Jr.
C. Derman
C. Thiery
D. P. Bertsekas
D. P. Bertsekas
D. P. Bertsekas
D. P. Bertsekas
D. P. Bertsekas
D. P. Bertsekas
D. P. Bertsekas
D. P. Bertsekas
D. P. Bertsekas
D. S. Choi
Dimitri P. Bertsekas
E. A. Feinberg
G. J. Gordon
G. M. Baudet
H. Yu
Huizhen Yu
J. H. Eaton
J. N. Tsitsiklis
J. N. Tsitsiklis
J. N. Tsitsiklis
M. L. Puterman
P. G. Canbolat
P. Whittle
R. S. Sutton
T. S. Jaakkola
T. S. Jaakkola
U. G. Rothblum
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

Block diagonalization

Author: /. Marek
A. S. Householder
B. Noble
Ivan Straškraba
J. J. Koliha
S. L. Campbell
S. L. Campbell
T. Liu
U. G. Rothblum
U. G. Rothblum
U. G. Rothblum
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2001
Field of study

summary:We study block diagonalization of matrices induced by resolutions of the unit matrix into the sum of idempotent matrices. We show that the block diagonal matrices have disjoint spectra if and only if each idempotent matrix in the inducing resolution double commutes with the given matrix. Applications include a new characterization of an eigenprojection and of the Drazin inverse of a given matrix

Crossref

Institute of Mathematics AS CR, v. v. i.

Consecutive optimizers for a partitioning problem with applications to optimal inventory groupings for joint replenishment

Author: A. K. Chakravarty
J. B. Orlin
U. G. Rothblum
Publication venue
Publication date: 01/01/1985
Field of study

We consider several subclasses of the problem of grouping n items (indexed 1, 2,.., n) into m subsets so as to minimize the function g(S 1,.., S,). In general, these problems are very difficult to solve to optimality, even for the case m = 2. We provide several sufficient conditions on g(') that guarantee that there is an optimum partition in which each subset consists of consecutive integers (or else the partition S,,-, S,, satisfies a more general condition called semiconsecutiveness"). Moreover, by restricting attention to 'consecutive" (or serniconsecutive " ) partitions, we can solve the partition problem in polynomial time for small values of m. If, in addition, g is symmetric, then the partition problem is solvable in purely polynomial time. We apply these results to generalizations of a problem in inventory groupings considered by the authors in a previous paper. We also relate the results to the Neyman-Pearson lemma in statistical hypothesis testing and to a graph partitioning problem of Barnes and Hoffman. C · lg · · ·II CIL ·I D···1C- ·------- · 111-ET a, , a and b,-, b be real numbers ordered so that for some integer 0 r n, b, *.., b, are negative, b,+,.., b are nonnegative and al ar-- c.-- and tbi I b ar+l an br+i- bn For b, = 0, we consider adb, to be +cc or- according to a> 0 or a, < 0. If ai = bi = 0, al/b1 is defined arbitrarily so that inequality (1) holds. As usual, we let a and b denote the vectors whose coordinates are a, and bi, respectively. Subject clasification: 334 partitioning items into subgroups, 625 optimal inventory groupings

CiteSeerX

Consecutive Optimizers for a Partitioning Problem with Applications to Optimal Inventory Groupings for Joint Replenishment

Author: A. K. Chakravarty
J. B. Orlin
U. G. Rothblum
Publication venue: 'Institute for Operations Research and the Management Sciences (INFORMS)'
Publication date
Field of study

Crossref