7,603 research outputs found

    Policy Optimization of Finite-Horizon Kalman Filter with Unknown Noise Covariance

    Full text link
    This paper is on learning the Kalman gain by policy optimization method. Firstly, we reformulate the finite-horizon Kalman filter as a policy optimization problem of the dual system. Secondly, we obtain the global linear convergence of exact gradient descent method in the setting of known parameters. Thirdly, the gradient estimation and stochastic gradient descent method are proposed to solve the policy optimization problem, and further the global linear convergence and sample complexity of stochastic gradient descent are provided for the setting of unknown noise covariance matrices and known model parameters

    Continuous-time Mean-Variance Portfolio Selection with Stochastic Parameters

    Full text link
    This paper studies a continuous-time market {under stochastic environment} where an agent, having specified an investment horizon and a target terminal mean return, seeks to minimize the variance of the return with multiple stocks and a bond. In the considered model firstly proposed by [3], the mean returns of individual assets are explicitly affected by underlying Gaussian economic factors. Using past and present information of the asset prices, a partial-information stochastic optimal control problem with random coefficients is formulated. Here, the partial information is due to the fact that the economic factors can not be directly observed. Via dynamic programming theory, the optimal portfolio strategy can be constructed by solving a deterministic forward Riccati-type ordinary differential equation and two linear deterministic backward ordinary differential equations

    Solving Coupled Nonlinear Forward-backward Stochastic Differential Equations: An Optimization Perspective with Backward Measurability Loss

    Full text link
    This paper aims to extend the BML method proposed in Wang et al. [22] to make it applicable to more general coupled nonlinear FBSDEs. We interpret BML from the fixed-point iteration perspective and show that optimizing BML is equivalent to minimizing the distance between two consecutive trial solutions in a fixed-point iteration. Thus, this paper provides a theoretical foundation for an optimization-based approach to solving FBSDEs. We also empirically evaluate the method through four numerical experiments

    On a generalization of R. Chapman's "evil determinant"

    Full text link
    Let pp be an odd prime and xx be an indeterminate. Recently, Z.-W. Sun proposed the following conjecture: \det\left[x+\left(\frac{j-i}{p}\right)\right]_{0\le i,j\le \frac{p-1}{2}}=\begin{cases} (\frac{2}{p})pb_px-a_p & \mbox{if}\ p\equiv 1\pmod4, 1 & \mbox{if}\ p\equiv 3\pmod4, \end{cases} where apa_p and bpb_p are rational numbers related to the fundamental unit and class number of the real quadratic field Q(p)\mathbb{Q}(\sqrt{p}). In this paper, we confirm the above conjecture of Sun based on Vsemirnov's decomposition of Chapman's "evil determinant"
    • …
    corecore