Search CORE

6 research outputs found

Continuous-time Markov decision processes under the risk-sensitive average cost criterion

Author: Chen Xian
Wei Qingda
Publication venue
Publication date: 21/12/2015
Field of study

This paper studies continuous-time Markov decision processes under the risk-sensitive average cost criterion. The state space is a finite set, the action space is a Borel space, the cost and transition rates are bounded, and the risk-sensitivity coefficient can take arbitrary positive real numbers. Under the mild conditions, we develop a new approach to establish the existence of a solution to the risk-sensitive average cost optimality equation and obtain the existence of an optimal deterministic stationary policy.Comment: 14 page

arXiv.org e-Print Archive

Markov Decision Processes with Risk-Sensitive Criteria: An Overview

Author: Bäuerle Nicole
Jaśkiewicz Anna
Publication venue
Publication date: 12/11/2023
Field of study

The paper provides an overview of the theory and applications of risk-sensitive Markov decision processes. The term 'risk-sensitive' refers here to the use of the Optimized Certainty Equivalent as a means to measure expectation and risk. This comprises the well-known entropic risk measure and Conditional Value-at-Risk. We restrict our considerations to stationary problems with an infinite time horizon. Conditions are given under which optimal policies exist and solution procedures are explained. We present both the theory when the Optimized Certainty Equivalent is applied recursively as well as the case where it is applied to the cumulated reward. Discounted as well as non-discounted models are reviewe

arXiv.org e-Print Archive

Exit Time Risk-Sensitive Control for Systems of Cooperative Agents

Author: Dupuis Paul
Laschos Vaios
Ramanan Kavita
Publication venue
Publication date: 22/08/2018
Field of study

We study sequences, parametrized by the number of agents, of many agent exit time stochastic control problems with risk-sensitive cost structure. We identify a fully characterizing assumption, under which each of such control problem corresponds to a risk-neutral stochastic control problem with additive cost, and sequentially to a risk-neutral stochastic control problem on the simplex, where the specific information about the state of each agent can be discarded. We also prove that, under some additional assumptions, the sequence of value functions converges to the value function of a deterministic control problem, which can be used for the design of nearly optimal controls for the original problem, when the number of agents is sufficiently large

arXiv.org e-Print Archive

Publications Server of the Weierstrass Institute for Applied Analysis and Stochastics

Exit time risk-sensitive stochastic control problems related to systems of cooperative agents

Author: Dupuis Paul
Laschos Vaios
Ramanan Kavita
Publication venue
Publication date: 01/01/2017
Field of study

We study sequences, parametrized by the number of agents, of exit time stochastic control problems with risk-sensitive costs structures generate by unbounded costs. We identify a fully characterizing assumption, under which, each of them corresponds to a risk-neutral stochastic control problem with additive cost, and also to a risk-neutral stochastic control problem on the simplex, where the specific information about the state of each agent can be discarded. We finally prove that, under some additional assumptions, the sequence of value functions converges to the value function of a deterministic control problem

Publications Server of the Weierstrass Institute for Applied Analysis and Stochastics