The exponential cost optimality for finite horizon semi-Markov decision processes

Abstract

summary:This paper considers an exponential cost optimality problem for finite horizon semi-Markov decision processes (SMDPs). The objective is to calculate an optimal policy with minimal exponential costs over the full set of policies in a finite horizon. First, under the standard regular and compact-continuity conditions, we establish the optimality equation, prove that the value function is the unique solution of the optimality equation and the existence of an optimal policy by using the minimum nonnegative solution approach. Second, we establish a new value iteration algorithm to calculate both the value function and the ϵ\epsilon-optimal policy. Finally, we give a computable machine maintenance system to illustrate the convergence of the algorithm

    Similar works