3 research outputs found

    Extreme occupation measures in Markov decision processes with a cemetery

    Full text link
    In this paper, we consider a Markov decision process (MDP) with a Borel state space X∪{Δ}\textbf{X}\cup\{\Delta\}, where Δ\Delta is an absorbing state (cemetery), and a Borel action space A\textbf{A}. We consider the space of finite occupation measures restricted on X×A\textbf{X}\times \textbf{A}, and the extreme points in it. It is possible that some strategies have infinite occupation measures. Nevertheless, we prove that every finite extreme occupation measure is generated by a deterministic stationary strategy. Then, for this MDP, we consider a constrained problem with total undiscounted criteria and JJ constraints, where the cost functions are nonnegative. By assumption, the strategies inducing infinite occupation measures are not optimal. Then, our second main result is that, under mild conditions, the solution to this constrained MDP is given by a mixture of no more than J+1J+1 occupation measures generated by deterministic stationary strategies

    Proper Policies in Infinite-State Stochastic Shortest Path Problems

    No full text
    corecore