Search CORE

12 research outputs found

Randomized and Relaxed Strategies in Continuous-Time Markov Decision Processes.

Author: Piunovskiy AB
Publication venue: 'Society for Industrial & Applied Mathematics (SIAM)'
Publication date: 01/01/2015
Field of study

One of the goals of this article is to describe a wide class of control strategies, which includes the traditional relaxed strategies, as well as the so called randomized strategies which appeared earlier only in the framework of semi-Markov decision processes. If the objective is the total expected cost up to the accumulation of jumps, then without loss of generality one can consider only Markov relaxed strategies. Under a simple condition, the Markov randomized strategies are also sufficient. An example shows that the mentioned condition is important. Finally, without any conditions, the class of so called Poisson-related strategies is also sufficient in the optimization problems. All the results are applicable to the discounted model, they may be useful also for the case of long-run average cost. Read More: https://epubs.siam.org/doi/10.1137/15M101401

University of Liverpool Repository

Crossref

Realizable strategies in continuous-time Markov decision processes

Author: Piunovskiy AB
Publication venue: 'Society for Industrial & Applied Mathematics (SIAM)'
Publication date: 01/01/2018
Field of study

For the Borel model of the continuous-time Markov decision process, we introduce a wide class of control strategies. In a particular case, such strategies transform to the standard relaxed strategies, intensively studied in the last decade. In another special case, if one restricts to another special subclass of the general strategies, the model transforms to the semi-Markov decision process. Further, we show that the relaxed strategies are not realizable. For the constrained optimal control problem with total expected costs, we describe the sufficient class of realizable strategies, the so-called Poisson-related strategies. Finally, we show that, for solving the formulated optimal control problems, one can use all the tools developed earlier for the classical discrete-time Markov decision processes

University of Liverpool Repository

Crossref

Constrained and Unconstrained Optimal Discounted Control of Piecewise Deterministic Markov Processes

Author: Costa O
Dufour F
Piunovskiy AB
Publication venue: 'Society for Industrial & Applied Mathematics (SIAM)'
Publication date: 01/01/2016
Field of study

International audienceThe main goal of this paper is to study the infinite-horizon expected discounted continuous-time optimal control problem of piecewise deterministic Markov processes with the control acting continuously on the jump intensity

\lambda

and on the transition measure

Q

of the process but not on the deterministic flow

\phi

. The contributions of the paper are for the unconstrained as well as the constrained cases. The set of admissible control strategies is assumed to be formed by policies, possibly randomized and depending on the history of the process, taking values in a set valued action space. For the unconstrained case we provide sufficient conditions based on the three local characteristics of the process

\phi

\lambda

Q

and the semicontinuity properties of the set valued action space, to guarantee the existence and uniqueness of the integro-differential optimality equation (the so-called Bellman--Hamilton--Jacobi equation) as well as the existence of an optimal (and

\delta

-optimal, as well) deterministic stationary control strategy for the problem. For the constrained case we show that the values of the constrained control problem and an associated infinite dimensional linear programming (LP) problem are the same, and moreover we provide sufficient conditions for the solvability of the LP problem as well as for the existence of an optimal feasible randomized stationary control strategy for the constrained problem

University of Liverpool Repository

Crossref

INRIA a CCSD electronic archive server

Oskar Bordeaux

Optimal implulse control of dynamical systems

Author: Piunovskiy AB
Plakhov
Torres
Zhang Yi
Publication venue: 'The Japan Society for Industrial and Applied Mathematics'
Publication date
Field of study

University of Liverpool Repository

Optimal impulse control of dynamical systems

Author: Piunovskiy AB
Plakhov Alexander
Torres Delfim
Zhang Yi
Publication venue: Society for Industrial and Applied Mathematics
Publication date: 27/02/2018
Field of study

Using the tools of the Markov decision processes, we justify the dynamic programming approach to the optimal impulse control of deterministic dynamical systems. We prove the equivalence of the integral and differential forms of the optimality equation. The theory is illustrated by an example from mathematical epidemiology. The developed methods can be also useful for the study of piecewise deterministic Markov processes

arXiv.org e-Print Archive

University of Liverpool Repository

Repositório Institucional da Universidade de Aveiro

University of Birmingham Research Portal

Compactness of the space of non-randomized policies in countable-state sequential decision processes

Author: AB Piunovskiy
AS Kechris
AS Nowak
E Altman
EA Feinberg
EA Feinberg
EB Dynkin
EJ Balder
Eugene A. Feinberg
H Nikaido
HL Royden
M Schäl
M Schäl
M Schäl
O Hernandez-Lerma
P Billingsley
R Strauch
RC Chen
RC Chen
Richard C. Chen
VS Borkar
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

Optimal policies for constrained average-cost Markov decision processes

Author: AA Yushkevich
AB Piunovskiy
AB Piunovskiy
AB Piunovskiy
AF Karr
César E. Villarreal
E Altman
EA Feimberg
EB Frid
EJ Collins
FJ Beutler
G Winkler
J González-Hernández
JR Munkres
Juan González-Hernández
K Hinderer
K Tanaka
K Yoshida
LI Sennott
LI Sennott
M Haviv
M Kurano
M Loève
N Bourbaki
O Hernández-Lerma
O Hernández-Lerma
O Hernández-Lerma
P Billingsley
P Billingsley
Q Hu
RB Ash
RR Phelps
VS Borkar
Publication venue
Publication date: 02/07/2009
Field of study

Markov decision processes, Constraints, Stable measures, 90C40,

Crossref

LAReferencia - Red Federada de Repositorios Institucionales de Publicaciones Científicas Latinoamericanas

Red Mexicana de Repositorios Institucionales

Repositorio Academico Digital UANL

Research Papers in Economics

Impulsive Control for Continuous-Time Markov Decision Processes: A Linear Programming Approach

Author: A Arapostathis
A Hordijk
A Hordijk
A Piunovskiy
A. B. Piunovskiy
AA Yushkevich
AA Yushkevich
AA Yushkevich
AB Piunovskiy
Abhay G Bhatt
F Dufour
F Dufour
F. Dufour
G Last
HJ Kushner
HJ Kushner
J Jacod
K Helmes
MHA Davis
MY Kitaev
O Hernández-Lerma
O Hernández-Lerma
OLV Costa
R Buckdahn
RH Stockbridge
Sören Christensen
T Prieto-Rumeau
T Prieto-Rumeau
TG Kurtz
X Guo
X Guo
X Guo
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2016
Field of study

International audienc

University of Liverpool Repository

Crossref

INRIA a CCSD electronic archive server

Oskar Bordeaux

Constrained Discounted Semimarkov Decision Processes

Author: AB Piunovskiy
C Derman
CG Cassandras
DP Bertsekas
E Altman
EA Feinberg
EA Feinberg
EA Feinberg
EA Feinberg
EA Feinberg
EA Feinberg
EV Denardo
J Neveu
M Pinedo
ML Puterman
NV Krylov
O Hernández-Lerma
SM Ross
VS Borkar
Publication venue: Kluwer
Publication date: 01/01/2002
Field of study

This paper reduces problems on the existence and the nding of optimal policies for multiple criterion discounted SMDPs to similar problems for MDPs. We prove this reduction and illustrate it by extending to SMDPs several results for constrained discounted MDPs

CiteSeerX

Crossref

Performance Analysis and Monotone Control of a Tandem Queueing System

Author: AB Piunovskiy
D Efrosinin
E Altman
FJ Beutler
G Koole
HA Ghoneim
HC Tijms
HM Liang
LI Sennott
ML Puterman
MN Veatch
MP Farhadov
R Howard
R Yang
R Yang
V Rykov
Y Aviv
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref