Search CORE

3 research outputs found

Exponential Convergence and stability of Howards's Policy Improvement Algorithm for Controlled Diffusions

Author: Kerimkulov Bekzhan
Siska David
Szpruch Lukas
Publication venue: 'Society for Industrial & Applied Mathematics (SIAM)'
Publication date: 22/05/2020
Field of study

Optimal control problems are inherently hard to solve as the optimization must be performed simultaneously with updating the underlying system. Starting from an initial guess, Howard's policy improvement algorithm separates the step of updating the trajectory of the dynamical system from the optimization and iterations of this should converge to the optimal control. In the discrete space-time setting this is often the case and even rates of convergence are known. In the continuous space-time setting of controlled diffusion the algorithm consists of solving a linear PDE followed by maximization problem. This has been shown to converge, in some situations, however no global rate of is known. The first main contribution of this paper is to establish global rate of convergence for the policy improvement algorithm and a variant, called here the gradient iteration algorithm. The second main contribution is the proof of stability of the algorithms under perturbations to both the accuracy of the linear PDE solution and the accuracy of the maximization step. The proof technique is new in this context as it uses the theory of backward stochastic differential equations.Comment: Identical to the published version except minor typographical detail

arXiv.org e-Print Archive

Edinburgh Research Explorer

Time discretization of FBSDE with polynomial growth drivers and reaction-diffusion PDEs

Author: Dos Reis Goncalo
Lionnet Arnaud
Szpruch Lukas
Publication venue: 'Institute of Mathematical Statistics'
Publication date: 01/01/2015
Field of study

In this paper, we undertake the error analysis of the time discretization of systems of Forward-Backward Stochastic Differential Equations (FBSDEs) with drivers having polynomial growth and that are also monotone in the state variable. We show with a counter-example that the natural explicit Euler scheme may diverge, unlike in the canonical Lipschitz driver case. This is due to the lack of a certain stability property of the Euler scheme which is essential to obtain convergence. However, a thorough analysis of the family of

\theta

-schemes reveals that this required stability property can be recovered if the scheme is sufficiently implicit. As a by-product of our analysis, we shed some light on higher order approximation schemes for FBSDEs under non-Lipschitz condition. We then return to fully explicit schemes and show that an appropriately tamed version of the explicit Euler scheme enjoys the required stability property and as a consequence converges. In order to establish convergence of the several discretizations, we extend the canonical path- and first-order variational regularity results to FBSDEs with polynomial growth drivers which are also monotone. These results are of independent interest for the theory of FBSDEs.Comment: Published at http://dx.doi.org/10.1214/14-AAP1056 in the Annals of Applied Probability (http://www.imstat.org/aap/) by the Institute of Mathematical Statistics (http://www.imstat.org

arXiv.org e-Print Archive

CiteSeerX

Crossref

University of Birmingham Research Portal

Edinburgh Research Explorer

Almost sure exponential stability of numerical solutions for stochastic delay differential equations

Author: A. Rodkina
A. Rodkina
A.N. Shiryaev
C.T.H. Baker
C.T.H. Baker
D.J. Higham
D.J. Higham
E. Hairer
Fuke Wu
K. Burrage
K. Burrage
Lukas Szpruch
P.E. Kloeden
R.Sh. Liptser
S. Pang
X. Mao
X. Mao
X. Mao
X. Mao
X. Mao
X. Mao
X. Mao
X. Mao
X. Mao
X. Mao
X. Mao
Xuerong Mao
Y. Saito
Y. Saito
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/06/2010
Field of study

Using techniques based on the continuous and discrete semimartingale convergence theorems, this paper investigates if numerical methods may reproduce the almost sure exponential stability of the exact solutions to stochastic delay differential equations (SDDEs). The important feature of this technique is that it enables us to study the almost sure exponential stability of numerical solutions of SDDEs directly. This is significantly different from most traditional methods by which the almost sure exponential stability is derived from the moment stability by the Chebyshev inequality and the Borel–Cantelli lemma

Crossref

University of Strathclyde Institutional Repository