3,318 research outputs found
Maximum principle for a stochastic delayed system involving terminal state constraints
We investigate a stochastic optimal control problem where the controlled
system is depicted as a stochastic differential delayed equation; however, at
the terminal time, the state is constrained in a convex set. We firstly
introduce an equivalent backward delayed system depicted as a time-delayed
backward stochastic differential equation. Then a stochastic maximum principle
is obtained by virtue of Ekeland's variational principle. Finally, applications
to a state constrained stochastic delayed linear-quadratic control model and a
production-consumption choice problem are studied to illustrate the main
obtained result.Comment: 16 page
Path integrals and symmetry breaking for optimal control theory
This paper considers linear-quadratic control of a non-linear dynamical
system subject to arbitrary cost. I show that for this class of stochastic
control problems the non-linear Hamilton-Jacobi-Bellman equation can be
transformed into a linear equation. The transformation is similar to the
transformation used to relate the classical Hamilton-Jacobi equation to the
Schr\"odinger equation. As a result of the linearity, the usual backward
computation can be replaced by a forward diffusion process, that can be
computed by stochastic integration or by the evaluation of a path integral. It
is shown, how in the deterministic limit the PMP formalism is recovered. The
significance of the path integral approach is that it forms the basis for a
number of efficient computational methods, such as MC sampling, the Laplace
approximation and the variational approximation. We show the effectiveness of
the first two methods in number of examples. Examples are given that show the
qualitative difference between stochastic and deterministic control and the
occurrence of symmetry breaking as a function of the noise.Comment: 21 pages, 6 figures, submitted to JSTA
Team decision theory for linear continuous-time systems
This paper develops a team decision theory for linear-quadratic (LQ) continuous-time systems. First, a counterpart of the well-known result of Radner on quadratic static teams is obtained for two-member continuous-time LQ static team problems when the statistics of the random variables involved are not necessarily Gaussian. An iterative convergent scheme is developed, which in the limit yields the optimal team strategies. For the special case of Gaussian distributions, the team-optimal solution is affine in the information available to each DM, and for the further special case when the team cost function does not penalize the intermediate values of state, the optimal strategies can be obtained by solving a Liapunov type time-invariant matrix equation. This static theory is then extended to LQG continuous-time dynamic teams with sampled observations under the one-step-delay observation sharing pattern. The unique solution is again affine in the information available to each DM, and further, it features a certainty-equivalence property
Stochastic Control of Memory Mean-Field Processes
By a memory mean-field process we mean the solution of a
stochastic mean-field equation involving not just the current state and
its law at time , but also the state values and
its law at some previous times . Our purpose is to
study stochastic control problems of memory mean-field processes.
- We consider the space of measures on with the
norm introduced by Agram and {\O}ksendal in
\cite{AO1}, and prove the existence and uniqueness of solutions of memory
mean-field stochastic functional differential equations.
- We prove two stochastic maximum principles, one sufficient (a verification
theorem) and one necessary, both under partial information. The corresponding
equations for the adjoint variables are a pair of \emph{(time-) advanced
backward stochastic differential equations}, one of them with values in the
space of bounded linear functionals on path segment spaces.
- As an application of our methods, we solve a memory mean-variance problem
as well as a linear-quadratic problem of a memory process
- …