3,318 research outputs found

    Maximum principle for a stochastic delayed system involving terminal state constraints

    Full text link
    We investigate a stochastic optimal control problem where the controlled system is depicted as a stochastic differential delayed equation; however, at the terminal time, the state is constrained in a convex set. We firstly introduce an equivalent backward delayed system depicted as a time-delayed backward stochastic differential equation. Then a stochastic maximum principle is obtained by virtue of Ekeland's variational principle. Finally, applications to a state constrained stochastic delayed linear-quadratic control model and a production-consumption choice problem are studied to illustrate the main obtained result.Comment: 16 page

    Path integrals and symmetry breaking for optimal control theory

    Get PDF
    This paper considers linear-quadratic control of a non-linear dynamical system subject to arbitrary cost. I show that for this class of stochastic control problems the non-linear Hamilton-Jacobi-Bellman equation can be transformed into a linear equation. The transformation is similar to the transformation used to relate the classical Hamilton-Jacobi equation to the Schr\"odinger equation. As a result of the linearity, the usual backward computation can be replaced by a forward diffusion process, that can be computed by stochastic integration or by the evaluation of a path integral. It is shown, how in the deterministic limit the PMP formalism is recovered. The significance of the path integral approach is that it forms the basis for a number of efficient computational methods, such as MC sampling, the Laplace approximation and the variational approximation. We show the effectiveness of the first two methods in number of examples. Examples are given that show the qualitative difference between stochastic and deterministic control and the occurrence of symmetry breaking as a function of the noise.Comment: 21 pages, 6 figures, submitted to JSTA

    Team decision theory for linear continuous-time systems

    Get PDF
    This paper develops a team decision theory for linear-quadratic (LQ) continuous-time systems. First, a counterpart of the well-known result of Radner on quadratic static teams is obtained for two-member continuous-time LQ static team problems when the statistics of the random variables involved are not necessarily Gaussian. An iterative convergent scheme is developed, which in the limit yields the optimal team strategies. For the special case of Gaussian distributions, the team-optimal solution is affine in the information available to each DM, and for the further special case when the team cost function does not penalize the intermediate values of state, the optimal strategies can be obtained by solving a Liapunov type time-invariant matrix equation. This static theory is then extended to LQG continuous-time dynamic teams with sampled observations under the one-step-delay observation sharing pattern. The unique solution is again affine in the information available to each DM, and further, it features a certainty-equivalence property

    Stochastic Control of Memory Mean-Field Processes

    Full text link
    By a memory mean-field process we mean the solution X(⋅)X(\cdot) of a stochastic mean-field equation involving not just the current state X(t)X(t) and its law L(X(t))\mathcal{L}(X(t)) at time tt, but also the state values X(s)X(s) and its law L(X(s))\mathcal{L}(X(s)) at some previous times s<ts<t. Our purpose is to study stochastic control problems of memory mean-field processes. - We consider the space M\mathcal{M} of measures on R\mathbb{R} with the norm ∣∣⋅∣∣M|| \cdot||_{\mathcal{M}} introduced by Agram and {\O}ksendal in \cite{AO1}, and prove the existence and uniqueness of solutions of memory mean-field stochastic functional differential equations. - We prove two stochastic maximum principles, one sufficient (a verification theorem) and one necessary, both under partial information. The corresponding equations for the adjoint variables are a pair of \emph{(time-) advanced backward stochastic differential equations}, one of them with values in the space of bounded linear functionals on path segment spaces. - As an application of our methods, we solve a memory mean-variance problem as well as a linear-quadratic problem of a memory process
    • …
    corecore