Approximation, estimation and control of stochastic systems under a randomized discounted cost criterion

Abstract

summary:The paper deals with a class of discrete-time stochastic control processes under a discounted optimality criterion with random discount rate, and possibly unbounded costs. The state process {xt}\left\{ x_{t}\right\} and the discount process {αt}\left\{ \alpha _{t}\right\} evolve according to the coupled difference equations xt+1=F(xt,αt,at,ξt),x_{t+1}=F(x_{t},\alpha _{t},a_{t},\xi _{t}), αt+1=G(αt,ηt) \alpha _{t+1}=G(\alpha _{t},\eta _{t}) where the state and discount disturbance processes {ξt}\{\xi _{t}\} and {ηt}\{\eta _{t}\} are sequences of i.i.d. random variables with densities ρξ\rho ^{\xi } and ρη\rho ^{\eta } respectively. The main objective is to introduce approximation algorithms of the optimal cost function that lead up to construction of optimal or nearly optimal policies in the cases when the densities ρξ\rho ^{\xi } and ρη\rho ^{\eta } are either known or unknown. In the latter case, we combine suitable estimation methods with control procedures to construct an asymptotically discounted optimal policy

    Similar works