Search CORE

18,206 research outputs found

Continuous-Time Markov Decision Processes with Exponential Utility

Author: Zhang Y
Publication venue: 'Society for Industrial & Applied Mathematics (SIAM)'
Publication date: 25/11/2016
Field of study

In this paper, we consider a continuous-time Markov decision process (CTMDP) in Borel spaces, where the certainty equivalent with respect to the exponential utility of the total undiscounted cost is to be minimized. The cost rate is nonnegative. We establish the optimality equation. Under the compactness-continuity condition, we show the existence of a deterministic stationary optimal policy. We reduce the risk-sensitive CTMDP problem to an equivalent risk-sensitive discrete-time Markov decision process, which is with the same state and action spaces as the original CTMDP. In particular, the value iteration algorithm for the CTMDP problem follows from this reduction. We essentially do not need to impose a condition on the growth of the transition and cost rate in the state, and the controlled process could be explosive

arXiv.org e-Print Archive

University of Liverpool Repository

University of Birmingham Research Portal

On gradual-impulse control of continuous-time Markov decision processes with multiplicative cost

Author: Guo Xin
Kurushima Aiko
Piunovskiy Alexey
Zhang Yi
Publication venue
Publication date: 28/11/2018
Field of study

In this paper, we consider the gradual-impulse control problem of continuous-time Markov decision processes, where the system performance is measured by the expectation of the exponential utility of the total cost. We prove, under very general conditions on the system primitives, the existence of a deterministic stationary optimal policy out of a more general class of policies. Policies that we consider allow multiple simultaneous impulses, randomized selection of impulses with random effects, relaxed gradual controls, and accumulation of jumps. After characterizing the value function using the optimality equation, we reduce the continuous-time gradual-impulse control problem to an equivalent simple discrete-time Markov decision process, whose action space is the union of the sets of gradual and impulsive actions

arXiv.org e-Print Archive

ON GRADUAL-IMPULSE CONTROL OF CONTINUOUS-TIME MARKOV DECISION PROCESSES WITH EXPONENTIAL UTILITY

Author: Guo Xin
Kurushima Aiko
Piunovskiy Alexey
Zhang Yi
Publication venue: 'Cambridge University Press (CUP)'
Publication date: 01/06/2021
Field of study

University of Liverpool Repository

University of Birmingham Research Portal

Continuous-time Markov decision processes under the risk-sensitive average cost criterion

Author: Chen Xian
Wei Qingda
Publication venue
Publication date: 21/12/2015
Field of study

This paper studies continuous-time Markov decision processes under the risk-sensitive average cost criterion. The state space is a finite set, the action space is a Borel space, the cost and transition rates are bounded, and the risk-sensitivity coefficient can take arbitrary positive real numbers. Under the mild conditions, we develop a new approach to establish the existence of a solution to the risk-sensitive average cost optimality equation and obtain the existence of an optimal deterministic stationary policy.Comment: 14 page

arXiv.org e-Print Archive