Search CORE

11,226 research outputs found

R3MC: A Riemannian three-factor algorithm for low-rank matrix completion

Author: Mishra B.
Sepulchre R.
Publication venue
Publication date: 20/09/2014
Field of study

We exploit the versatile framework of Riemannian optimization on quotient manifolds to develop R3MC, a nonlinear conjugate-gradient method for low-rank matrix completion. The underlying search space of fixed-rank matrices is endowed with a novel Riemannian metric that is tailored to the least-squares cost. Numerical comparisons suggest that R3MC robustly outperforms state-of-the-art algorithms across different problem instances, especially those that combine scarcely sampled and ill-conditioned data.Comment: Accepted for publication in the proceedings of the 53rd IEEE Conference on Decision and Control, 201

arXiv.org e-Print Archive

Crossref

Composing Scalable Nonlinear Algebraic Solvers

Author: Brune Peter R.
Knepley Matthew G.
Smith Barry F.
Tu Xuemin
Publication venue: 'Society for Industrial & Applied Mathematics (SIAM)'
Publication date: 16/02/2016
Field of study

Most efficient linear solvers use composable algorithmic components, with the most common model being the combination of a Krylov accelerator and one or more preconditioners. A similar set of concepts may be used for nonlinear algebraic systems, where nonlinear composition of different nonlinear solvers may significantly improve the time to solution. We describe the basic concepts of nonlinear composition and preconditioning and present a number of solvers applicable to nonlinear partial differential equations. We have developed a software framework in order to easily explore the possible combinations of solvers. We show that the performance gains from using composed solvers can be substantial compared with gains from standard Newton-Krylov methods.Comment: 29 pages, 14 figures, 13 table

arXiv.org e-Print Archive

CiteSeerX

KU ScholarWorks

Conic Optimization Theory: Convexification Techniques and Numerical Algorithms

Author: ahmadi
andersen
banerjee
benson
bertsekas
bienstock
bühlmann
camacho
candes
candès
carpentier
chen
chen
coffrin
coffrin
coleman
conforti
curto
curto
d'angelo
dullerud
fan
fattahi
fazel
george
glowinski
graepel
graepel
hsieh
josz
josz
josz
josz
kato
krivine
kuang
lall
lanckriet
lasserre
lasserre
laurent
lavaei
liu
ljung
madani
madani
madani
magron
mazumder
nemirovskii
nesterov
parrilo
parrilo
parrilo
pena
powell
pólya
shor
shor
shor
sojoudi
tütüncü
valmorbida
weisser
wright
wu
zhang
zhang
zheng
zhou
Publication venue: 'Elsevier BV'
Publication date: 26/09/2017
Field of study

Optimization is at the core of control theory and appears in several areas of this field, such as optimal control, distributed control, system identification, robust control, state estimation, model predictive control and dynamic programming. The recent advances in various topics of modern optimization have also been revamping the area of machine learning. Motivated by the crucial role of optimization theory in the design, analysis, control and operation of real-world systems, this tutorial paper offers a detailed overview of some major advances in this area, namely conic optimization and its emerging applications. First, we discuss the importance of conic optimization in different areas. Then, we explain seminal results on the design of hierarchies of convex relaxations for a wide range of nonconvex problems. Finally, we study different numerical algorithms for large-scale conic optimization problems.Comment: 18 page

arXiv.org e-Print Archive

Crossref

Do optimization methods in deep learning applications matter?

Author: Kiran Mariam
Ozyildirim Buse Melis
Publication venue: eScholarship, University of California
Publication date: 28/02/2020
Field of study

With advances in deep learning, exponential data growth and increasing model complexity, developing efficient optimization methods are attracting much research attention. Several implementations favor the use of Conjugate Gradient (CG) and Stochastic Gradient Descent (SGD) as being practical and elegant solutions to achieve quick convergence, however, these optimization processes also present many limitations in learning across deep learning applications. Recent research is exploring higher-order optimization functions as better approaches, but these present very complex computational challenges for practical use. Comparing first and higher-order optimization functions, in this paper, our experiments reveal that Levemberg-Marquardt (LM) significantly supersedes optimal convergence but suffers from very large processing time increasing the training complexity of both, classification and reinforcement learning problems. Our experiments compare off-the-shelf optimization functions(CG, SGD, LM and L-BFGS) in standard CIFAR, MNIST, CartPole and FlappyBird experiments.The paper presents arguments on which optimization functions to use and further, which functions would benefit from parallelization efforts to improve pretraining time and learning rate convergence

arXiv.org e-Print Archive

eScholarship - University of California