Search CORE

606 research outputs found

Sensitive optimality in stationary Markovian decision problems on a general state space

Author: Wijngaard J.
Publication venue: Technische Hogeschool Eindhoven
Publication date: 01/01/1976
Field of study

Repository TU/e

Pure OAI Repository

Stationary Markovian decision problems : discrete time, general state space

Author: Wijngaard J.
Publication venue: Technische Hogeschool Eindhoven
Publication date: 01/01/1975
Field of study

Repository TU/e

Pure OAI Repository

Risk-Sensitive Reinforcement Learning: A Constrained Optimization Viewpoint

Author: A. Prashanth L.
Fu Michael
Publication venue
Publication date: 22/10/2018
Field of study

The classic objective in a reinforcement learning (RL) problem is to find a policy that minimizes, in expectation, a long-run objective such as the infinite-horizon discounted or long-run average cost. In many practical applications, optimizing the expected value alone is not sufficient, and it may be necessary to include a risk measure in the optimization process, either as the objective or as a constraint. Various risk measures have been proposed in the literature, e.g., mean-variance tradeoff, exponential utility, the percentile performance, value at risk, conditional value at risk, prospect theory and its later enhancement, cumulative prospect theory. In this article, we focus on the combination of risk criteria and reinforcement learning in a constrained optimization framework, i.e., a setting where the goal to find a policy that optimizes the usual objective of infinite-horizon discounted/average cost, while ensuring that an explicit risk constraint is satisfied. We introduce the risk-constrained RL framework, cover popular risk measures based on variance, conditional value-at-risk and cumulative prospect theory, and present a template for a risk-sensitive RL algorithm. We survey some of our recent work on this topic, covering problems encompassing discounted cost, average cost, and stochastic shortest path settings, together with the aforementioned risk measures in a constrained framework. This non-exhaustive survey is aimed at giving a flavor of the challenges involved in solving a risk-sensitive RL problem, and outlining some potential future research directions

arXiv.org e-Print Archive

Denumerable state semi-Markov decision processes with unbounded costs average cost criterion

Author: Federgruen A.
Hordijk A. (Arie)
Tijms H.C. (Henk)
Publication venue: North-Holland Publishing Company
Publication date: 01/01/1978
Field of study

AbstractThis paper establishes a rather complete optimality theory for the average cost semi-Markov decision model with a denumerable state space, compact metric action sets and unbounded one-step costs for the case where the underlying Markov chains have a single ergotic set. Under a condition which, roughly speaking, requires the existence of a finite set such that the supremum over all stationary policies of the expected time and the total expected absolute cost incurred until the first return to this set are finite for any starting state, we shall verify the existence of a finite solution to the average costs optimality equation and the existence of an average cost optimal stationary policy

Elsevier - Publisher Connector

CWI's Institutional Repository

The effective temperature

Author: Hydon PE
Rasin OG
Sahadevan R
Publication venue
Publication date: 01/07/2007
Field of study

This review presents the effective temperature notion as defined from the deviations from the equilibrium fluctuation-dissipation theorem in out of equilibrium systems with slow dynamics. The thermodynamic meaning of this quantity is discussed in detail. Analytic, numeric and experimental measurements are surveyed. Open issues are mentioned.Comment: 58 page

arXiv.org e-Print Archive

Crossref

Surrey Research Insight

Fourteenth Conference on Stochastic Processes and their Applications Gothenberg, Sweden, 12–16 June 1984

Author
Publication venue: Published by Elsevier B.V.
Publication date
Field of study

Elsevier - Publisher Connector

Reaction Networks and Population Dynamics

Author
Publication venue: Zürich : EMS Publ. House
Publication date: 01/01/2017
Field of study

Reaction systems and population dynamics constitute two highly developed areas of research that build on well-defined model classes, both in terms of dynamical systems and stochastic processes. Despite a significant core of common structures, the two fields have largely led separate lives. The workshop brought the communities together and emphasised concepts, methods and results that have, so far, appeared in one area but are potentially useful in the other as well

Repositorium für Naturwissenschaften und Technik