7,603 research outputs found
Policy Optimization of Finite-Horizon Kalman Filter with Unknown Noise Covariance
This paper is on learning the Kalman gain by policy optimization method.
Firstly, we reformulate the finite-horizon Kalman filter as a policy
optimization problem of the dual system. Secondly, we obtain the global linear
convergence of exact gradient descent method in the setting of known
parameters. Thirdly, the gradient estimation and stochastic gradient descent
method are proposed to solve the policy optimization problem, and further the
global linear convergence and sample complexity of stochastic gradient descent
are provided for the setting of unknown noise covariance matrices and known
model parameters
Continuous-time Mean-Variance Portfolio Selection with Stochastic Parameters
This paper studies a continuous-time market {under stochastic environment}
where an agent, having specified an investment horizon and a target terminal
mean return, seeks to minimize the variance of the return with multiple stocks
and a bond. In the considered model firstly proposed by [3], the mean returns
of individual assets are explicitly affected by underlying Gaussian economic
factors. Using past and present information of the asset prices, a
partial-information stochastic optimal control problem with random coefficients
is formulated. Here, the partial information is due to the fact that the
economic factors can not be directly observed. Via dynamic programming theory,
the optimal portfolio strategy can be constructed by solving a deterministic
forward Riccati-type ordinary differential equation and two linear
deterministic backward ordinary differential equations
Solving Coupled Nonlinear Forward-backward Stochastic Differential Equations: An Optimization Perspective with Backward Measurability Loss
This paper aims to extend the BML method proposed in Wang et al. [22] to make
it applicable to more general coupled nonlinear FBSDEs. We interpret BML from
the fixed-point iteration perspective and show that optimizing BML is
equivalent to minimizing the distance between two consecutive trial solutions
in a fixed-point iteration. Thus, this paper provides a theoretical foundation
for an optimization-based approach to solving FBSDEs. We also empirically
evaluate the method through four numerical experiments
On a generalization of R. Chapman's "evil determinant"
Let be an odd prime and be an indeterminate. Recently, Z.-W. Sun
proposed the following conjecture:
\det\left[x+\left(\frac{j-i}{p}\right)\right]_{0\le i,j\le
\frac{p-1}{2}}=\begin{cases} (\frac{2}{p})pb_px-a_p & \mbox{if}\ p\equiv
1\pmod4, 1 & \mbox{if}\ p\equiv 3\pmod4, \end{cases} where and
are rational numbers related to the fundamental unit and class number of the
real quadratic field . In this paper, we confirm the
above conjecture of Sun based on Vsemirnov's decomposition of Chapman's "evil
determinant"
- β¦