10,722 research outputs found

    Switching games of stochastic differential systems

    Get PDF
    Version of RecordPublishe

    Switching Games of Stochastic Differential Systems

    Full text link

    Rectified deep neural networks overcome the curse of dimensionality for nonsmooth value functions in zero-sum games of nonlinear stiff systems

    Full text link
    In this paper, we establish that for a wide class of controlled stochastic differential equations (SDEs) with stiff coefficients, the value functions of corresponding zero-sum games can be represented by a deep artificial neural network (DNN), whose complexity grows at most polynomially in both the dimension of the state equation and the reciprocal of the required accuracy. Such nonlinear stiff systems may arise, for example, from Galerkin approximations of controlled stochastic partial differential equations (SPDEs), or controlled PDEs with uncertain initial conditions and source terms. This implies that DNNs can break the curse of dimensionality in numerical approximations and optimal control of PDEs and SPDEs. The main ingredient of our proof is to construct a suitable discrete-time system to effectively approximate the evolution of the underlying stochastic dynamics. Similar ideas can also be applied to obtain expression rates of DNNs for value functions induced by stiff systems with regime switching coefficients and driven by general L\'{e}vy noise.Comment: This revised version has been accepted for publication in Analysis and Application

    Multidimensional indefinite stochastic Riccati equations and zero-sum linear-quadratic stochastic differential games with non-markovian regime switching

    Full text link
    This paper is concerned with two-player zero-sum linear-quadratic stochastic differential games in a regime switching model. The controlled inhomogeneous system coefficients depend on the underlying noises, so it is a non-Markovian regime switching model. Based on a new kind of multidimensional indefinite stochastic Riccati equation (SRE) and a multidimensional linear backward stochastic differential equation (BSDE) with unbounded coefficients, we can provide optimal feedback control-strategy pairs for the two players in a closed-loop form. The main contribution of this paper, which is important in its own right from the BSDE theory point of view, is to prove the existence and uniqueness of the new kind of multidimensional indefinite SRE. Interestingly, the components of the solution can take positive, zero and negative values simultaneously. We also obtain the corresponding optimal feedback control-strategy pairs for homogeneous systems under closed convex cone control constraints. Finally, these results are applied to portfolio selection problems with different short-selling prohibition constraints in a regime switching market with random coefficients

    Switching Diffusions: Applications To Ecological Models, And Numerical Methods For Games In Insurance

    Get PDF
    Recently, a class of dynamic systems called ``hybrid systems containing both continuous dynamics and discrete events has been adapted to treat a wide variety of situations arising in many real-world situations. Motivated by such development, this dissertation is devoted to the study of dynamical systems involving a Markov chain as the randomly switching process. The systems studied include hybrid competitive Lotka-Volterra ecosystems and non-zero-sum stochastic differential games between two insurance companies with regime-switching. The first part is concerned with competitive Lotka-Volterra model with Markov switching. A novelty of the contribution is that the Markov chain has a countable state space. Our main objective is to reduce the computational complexity by using the two-time-scale formulation. Because the existence and uniqueness as well as continuity of solutions for Lotka-Volterra ecosystems with Markovian switching in which the switching takes place in a countable set are not available, such properties are studied first. The two-time scale feature is highlighted by introducing a small parameter into the generator of the Markov chain. When the small parameter goes to 0, there is a limit system or reduced system. It is established in this work that if the reduced system possesses certain properties such as permanence and extinction, etc., then the complex original system also has the same properties when the parameter is sufficiently small. These results are obtained by using the perturbed Lyapunov function methods. The second part develops an approximation procedure for a class of non-zero-sum stochastic differential games for investment and reinsurance between two insurance companies. Both proportional reinsurance and excess-of-loss reinsurance policies are considered. We develop numerical algorithms to obtain the approximation to the Nash equilibrium by adopting the Markov chain approximation methodology. We establish the convergence of the approximation sequences and the approximation to the value functions. Numerical examples are presented to illustrate the applicability of the algorithms

    Differential Games Controllers That Confine a System to a Safe Region in the State Space, With Applications to Surge Tank Control

    Get PDF
    Surge tanks are units employed in chemical processing to regulate the flow of fluids between reactors. A notable feature of surge tank control is the need to constrain the magnitude of the Maximum Rate of Change (MROC) of the surge tank outflow, since excessive fluctuations in the rate of change of outflow can adversely affect down-stream processing (through disturbance of sediments, initiation of turbulence, etc.). Proportional + Integral controllers, traditionally employed in surge tank control, do not take direct account of the MROC. It is therefore of interest to explore alternative approaches. We show that the surge tank controller design problem naturally fits a differential games framework, proposed by Dupuis and McEneaney, for controlling a system to confine the state to a safe region of the state space. We show furthermore that the differential game arising in this way can be solved by decomposing it into a collection of (one player) optimal control problems. We discuss the implications of this decomposition technique, for the solution of other controller design problems possessing some features of the surge tank controller design problem
    corecore