Search CORE

20 research outputs found

Partially observable Markov decision processes with partially observable random discount factors

Author: Martinez-Garcia E. Everardo
Minjárez-Sosa J. Adolfo
Vega-Amaya Oscar
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2022
Field of study

summary:This paper deals with a class of partially observable discounted Markov decision processes defined on Borel state and action spaces, under unbounded one-stage cost. The discount rate is a stochastic process evolving according to a difference equation, which is also assumed to be partially observable. Introducing a suitable control model and filtering processes, we prove the existence of optimal control policies. In addition, we illustrate our results in a class of GI/GI/1 queueing systems where we obtain explicitly the corresponding optimality equation and the filtering process

Institute of Mathematics AS CR, v. v. i.

PROCESOS DE MARKOV CONTROLADOS EN ESPACIOS DE BOREL: CRITERIOS NO DESCONTADOS

Author: OSCAR VEGA AMAYA
Publication venue: 'Universidad Autonoma Metropolitana'
Publication date: 01/01/1998
Field of study

LAReferencia - Red Federada de Repositorios Institucionales de Publicaciones Científicas Latinoamericanas

Red Mexicana de Repositorios Institucionales

BINDANI

Sample path average optimality of Markov control processes with strictly unbounded cost

Author: Vega-Amaya Oscar
Publication venue: Polska Akademia Nauk. Instytut Matematyczny PAN
Publication date: 01/01/1999
Field of study

We study the existence of sample path average cost (SPAC-) optimal policies for Markov control processes on Borel spaces with strictly unbounded costs, i.e., costs that grow without bound on the complement of compact subsets. Assuming only that the cost function is lower semicontinuous and that the transition law is weakly continuous, we show the existence of a relaxed policy with 'minimal' expected average cost and that the optimal average cost is the limit of discounted programs. Moreover, we show that if such a policy induces a positive Harris recurrent Markov chain, then it is also sample path average (SPAC-) optimal. We apply our results to inventory systems and, in a particular case, we compute explicitly a deterministic stationary SPAC-optimal policy

Biblioteka Nauki - repozytorium artykuÅÃ³w

Infinite-horizon Markov control processes with undiscounted cost criteria: from average to overtaking optimality

Author: Hernández-Lerma Onésimo
Vega-Amaya Oscar
Publication venue: Polska Akademia Nauk. Instytut Matematyczny PAN
Publication date: 01/01/1998
Field of study

We consider discrete-time Markov control processes on Borel spaces and infinite-horizon undiscounted cost criteria which are sensitive to the growth rate of finite-horizon costs. These criteria include, at one extreme, the grossly underselective average cos

Biblioteka Nauki - repozytorium artykuÅÃ³w

Sample-path average cost optimality for semi-Markov control processes on Borel spaces: unbounded costs and mean holding times

Author: Luque-Vásquez Fernando
Vega-Amaya Oscar
Publication venue: Polska Akademia Nauk. Instytut Matematyczny PAN
Publication date: 01/01/2000
Field of study

We deal with semi-Markov control processes (SMCPs) on Borel spaces with unbounded cost and mean holding time. Under suitable growth conditions on the cost function and the mean holding time, together with stability properties of the embedded Markov chains, we show the equivalence of several average cost criteria as well as the existence of stationary optimal policies with respect to each of these criteria

Biblioteka Nauki - repozytorium artykuÅÃ³w