993 research outputs found

    Riemannian Optimization via Frank-Wolfe Methods

    Full text link
    We study projection-free methods for constrained Riemannian optimization. In particular, we propose the Riemannian Frank-Wolfe (RFW) method. We analyze non-asymptotic convergence rates of RFW to an optimum for (geodesically) convex problems, and to a critical point for nonconvex objectives. We also present a practical setting under which RFW can attain a linear convergence rate. As a concrete example, we specialize Rfw to the manifold of positive definite matrices and apply it to two tasks: (i) computing the matrix geometric mean (Riemannian centroid); and (ii) computing the Bures-Wasserstein barycenter. Both tasks involve geodesically convex interval constraints, for which we show that the Riemannian "linear oracle" required by RFW admits a closed-form solution; this result may be of independent interest. We further specialize RFW to the special orthogonal group and show that here too, the Riemannian "linear oracle" can be solved in closed form. Here, we describe an application to the synchronization of data matrices (Procrustes problem). We complement our theoretical results with an empirical comparison of Rfw against state-of-the-art Riemannian optimization methods and observe that RFW performs competitively on the task of computing Riemannian centroids.Comment: Under Review. Largely revised version, including an extended experimental section and an application to the special orthogonal group and the Procrustes proble

    S-Lemma with Equality and Its Applications

    Full text link
    Let f(x)=xTAx+2aTx+cf(x)=x^TAx+2a^Tx+c and h(x)=xTBx+2bTx+dh(x)=x^TBx+2b^Tx+d be two quadratic functions having symmetric matrices AA and BB. The S-lemma with equality asks when the unsolvability of the system f(x)<0,h(x)=0f(x)<0, h(x)=0 implies the existence of a real number μ\mu such that f(x)+μh(x)0, xRnf(x) + \mu h(x)\ge0, ~\forall x\in \mathbb{R}^n. The problem is much harder than the inequality version which asserts that, under Slater condition, f(x)<0,h(x)0f(x)<0, h(x)\le0 is unsolvable if and only if f(x)+μh(x)0, xRnf(x) + \mu h(x)\ge0, ~\forall x\in \mathbb{R}^n for some μ0\mu\ge0. In this paper, we show that the S-lemma with equality does not hold only when the matrix AA has exactly one negative eigenvalue and h(x)h(x) is a non-constant linear function (B=0,b0B=0, b\not=0). As an application, we can globally solve inf{f(x)h(x)=0}\inf\{f(x)\vert h(x)=0\} as well as the two-sided generalized trust region subproblem inf{f(x)lh(x)u}\inf\{f(x)\vert l\le h(x)\le u\} without any condition. Moreover, the convexity of the joint numerical range {(f(x),h1(x),,hp(x)): xRn}\{(f(x), h_1(x),\ldots, h_p(x)):~x\in\Bbb R^n\} where ff is a (possibly non-convex) quadratic function and h1(x),,hp(x)h_1(x),\ldots,h_p(x) are affine functions can be characterized using the newly developed S-lemma with equality.Comment: 34 page
    corecore