Search CORE

2,459 research outputs found

Shape-Changing Trust-Region Methods Using Multipoint Symmetric Secant Matrices

Author: Brust Johannes J.
Erway Jennifer B.
Marcia Roummel F.
Publication venue
Publication date: 09/10/2022
Field of study

In this work, we consider methods for large-scale and nonconvex unconstrained optimization. We propose a new trust-region method whose subproblem is defined using a so-called "shape-changing" norm together with densely-initialized multipoint symmetric secant (MSS) matrices to approximate the Hessian. Shape-changing norms and dense initializations have been successfully used in the context of traditional quasi-Newton methods, but have yet to be explored in the case of MSS methods. Numerical results suggest that trust-region methods that use densely-initialized MSS matrices together with shape-changing norms outperform MSS with other trust-region methods

arXiv.org e-Print Archive

Efficient SDP Inference for Fully-connected CRFs Based on Low-rank Decomposition

Author: Hengel Anton van den
Shen Chunhua
Wang Peng
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/01/2015
Field of study

Conditional Random Fields (CRF) have been widely used in a variety of computer vision tasks. Conventional CRFs typically define edges on neighboring image pixels, resulting in a sparse graph such that efficient inference can be performed. However, these CRFs fail to model long-range contextual relationships. Fully-connected CRFs have thus been proposed. While there are efficient approximate inference methods for such CRFs, usually they are sensitive to initialization and make strong assumptions. In this work, we develop an efficient, yet general algorithm for inference on fully-connected CRFs. The algorithm is based on a scalable SDP algorithm and the low- rank approximation of the similarity/kernel matrix. The core of the proposed algorithm is a tailored quasi-Newton method that takes advantage of the low-rank matrix approximation when solving the specialized SDP dual problem. Experiments demonstrate that our method can be applied on fully-connected CRFs that cannot be solved previously, such as pixel-level image co-segmentation.Comment: 15 pages. A conference version of this work appears in Proc. IEEE Conference on Computer Vision and Pattern Recognition, 201

arXiv.org e-Print Archive

Adelaide Research & Scholarship

Fast B-spline Curve Fitting by L-BFGS

Author: Bo Pengbo
Liu Yang
Wang Wenping
Zheng Wenni
Publication venue
Publication date: 30/12/2011
Field of study

We propose a novel method for fitting planar B-spline curves to unorganized data points. In traditional methods, optimization of control points and foot points are performed in two very time-consuming steps in each iteration: 1) control points are updated by setting up and solving a linear system of equations; and 2) foot points are computed by projecting each data point onto a B-spline curve. Our method uses the L-BFGS optimization method to optimize control points and foot points simultaneously and therefore it does not need to perform either matrix computation or foot point projection in every iteration. As a result, our method is much faster than existing methods

arXiv.org e-Print Archive

CiteSeerX

HKU Scholars Hub

The Relationships Between CG, BFGS, and Two Limited-memory Algorithms

Author: Qin Zhiwei (Tony)
Publication venue: Furman University Scholar Exchange
Publication date: 01/06/2016
Field of study

For the solution of linear systems, the conjugate gradient (CG) and BFGS are among the most popular and successful algorithms with their respective advantages. The limited-memory methods have been developed to combine the best of the two. We describe and examine CG, BFGS, and two limited-memory methods (L-BFGS and VSCG) in the context of linear systems. We focus on the relationships between each of the four algorithms, and we present numerical results to illustrate those relationships

Furman University

Non-Uniform Stochastic Average Gradient Method for Training Conditional Random Fields

Author: Ahmed Mohamed Osama
Babanezhad Reza
Clifton Ann
Defazio Aaron
Sarkar Anoop
Schmidt Mark
Publication venue
Publication date: 16/04/2015
Field of study

We apply stochastic average gradient (SAG) algorithms for training conditional random fields (CRFs). We describe a practical implementation that uses structure in the CRF gradient to reduce the memory requirement of this linearly-convergent stochastic gradient method, propose a non-uniform sampling scheme that substantially improves practical performance, and analyze the rate of convergence of the SAGA variant under non-uniform sampling. Our experimental results reveal that our method often significantly outperforms existing methods in terms of the training objective, and performs as well or better than optimally-tuned stochastic gradient methods in terms of test error.Comment: AI/Stats 2015, 24 page

arXiv.org e-Print Archive

CiteSeerX

The Australian National University