Search CORE

5,172 research outputs found

A Generic Path Algorithm for Regularized Statistical Estimation

Author: Wu Yichao
Zhou Hua
Publication venue
Publication date: 17/01/2012
Field of study

Regularization is widely used in statistics and machine learning to prevent overfitting and gear solution towards prior information. In general, a regularized estimation problem minimizes the sum of a loss function and a penalty term. The penalty term is usually weighted by a tuning parameter and encourages certain constraints on the parameters to be estimated. Particular choices of constraints lead to the popular lasso, fused-lasso, and other generalized

l_1

penalized regression methods. Although there has been a lot of research in this area, developing efficient optimization methods for many nonseparable penalties remains a challenge. In this article we propose an exact path solver based on ordinary differential equations (EPSODE) that works for any convex loss function and can deal with generalized

l_1

penalties as well as more complicated regularization such as inequality constraints encountered in shape-restricted regressions and nonparametric density estimation. In the path following process, the solution path hits, exits, and slides along the various constraints and vividly illustrates the tradeoffs between goodness of fit and model parsimony. In practice, the EPSODE can be coupled with AIC, BIC,

C_p

or cross-validation to select an optimal tuning parameter. Our applications to generalized

l_1

regularized generalized linear models, shape-restricted regressions, Gaussian graphical models, and nonparametric density estimation showcase the potential of the EPSODE algorithm.Comment: 28 pages, 5 figure

arXiv.org e-Print Archive

CiteSeerX

A Path Algorithm for Constrained Estimation

Author: Brunk H. D.
Chen X.
de Leeuw J.
Dempster A. P.
Efron B.
Efron B.
Friedman J.
Friedman J.
Goodnight J. H.
Grenander U.
Groeneboom P.
Hanson D. L.
Hildreth C.
Hua Zhou
Jennrich R.
Kenneth Lange
Lange K.
Lawson C. L.
Li C.
Little R. J. A.
Liu J.
Magnus J. R.
Mammen E.
Meyer M.
Nocedal J.
Robertson T.
Rosset S.
Ruszczyński A.
Savage C.
Schoenfeld D. A.
Shen X.
Silvapulle M. J.
Stein C. M.
Tibshirani R.
Tibshirani R. J.
Wu T. T.
Zou H.
Publication venue: 'Informa UK Limited'
Publication date: 18/03/2011
Field of study

Many least squares problems involve affine equality and inequality constraints. Although there are variety of methods for solving such problems, most statisticians find constrained estimation challenging. The current paper proposes a new path following algorithm for quadratic programming based on exact penalization. Similar penalties arise in

l_1

regularization in model selection. Classical penalty methods solve a sequence of unconstrained problems that put greater and greater stress on meeting the constraints. In the limit as the penalty constant tends to

\infty

, one recovers the constrained solution. In the exact penalty method, squared penalties are replaced by absolute value penalties, and the solution is recovered for a finite value of the penalty constant. The exact path following method starts at the unconstrained solution and follows the solution path as the penalty constant increases. In the process, the solution path hits, slides along, and exits from the various constraints. Path following in lasso penalized regression, in contrast, starts with a large value of the penalty constant and works its way downward. In both settings, inspection of the entire solution path is revealing. Just as with the lasso and generalized lasso, it is possible to plot the effective degrees of freedom along the solution path. For a strictly convex quadratic program, the exact penalty algorithm can be framed entirely in terms of the sweep operator of regression analysis. A few well chosen examples illustrate the mechanics and potential of path following.Comment: 26 pages, 5 figure

arXiv.org e-Print Archive

CiteSeerX

Crossref

Localized Lasso for High-Dimensional Regression

Author: Iwata Tomoharu
Kaski Samuel
Shawe-Taylor John
Takeuchi Koh
Yamada Makoto
Publication venue
Publication date: 13/10/2016
Field of study

We introduce the localized Lasso, which is suited for learning models that are both interpretable and have a high predictive power in problems with high dimensionality

d

and small sample size

n

. More specifically, we consider a function defined by local sparse models, one at each data point. We introduce sample-wise network regularization to borrow strength across the models, and sample-wise exclusive group sparsity (a.k.a.,

\ell_{1,2}

norm) to introduce diversity into the choice of feature sets in the local models. The local models are interpretable in terms of similarity of their sparsity patterns. The cost function is convex, and thus has a globally optimal solution. Moreover, we propose a simple yet efficient iterative least-squares based optimization procedure for the localized Lasso, which does not need a tuning parameter, and is guaranteed to converge to a globally optimal solution. The solution is empirically shown to outperform alternatives for both simulated and genomic personalized medicine data

arXiv.org e-Print Archive

UCL Discovery