Search CORE

540,019 research outputs found

Convex relaxation of mixture regression with efficient algorithms

Author: Caetano TS
Lim J
Quadrianto N
Schuurmans D
Publication venue: Curran Associates, Inc.
Publication date: 01/01/2009
Field of study

We develop a convex relaxation of maximum a posteriori estimation of a mixture of regression models. Although our relaxation involves a semidefinite matrix variable, we reformulate the problem to eliminate the need for general semidefinite programming. In particular, we provide two reformulations that admit fast algorithms. The first is a max-min spectral reformulation exploiting quasi-Newton descent. The second is a min-min reformulation consisting of fast alternating steps of closed-form updates. We evaluate the methods against Expectation-Maximization in a real problem of motion segmentation from video data

CiteSeerX

Sussex Research Online

CUED - Cambridge University Engineering Department

Sketch-based Randomized Algorithms for Dynamic Graph Regression

Author: Chehreghani Mostafa Haghir
Publication venue
Publication date: 04/06/2019
Field of study

A well-known problem in data science and machine learning is {\em linear regression}, which is recently extended to dynamic graphs. Existing exact algorithms for updating the solution of dynamic graph regression problem require at least a linear time (in terms of

n

: the size of the graph). However, this time complexity might be intractable in practice. In the current paper, we utilize {\em subsampled randomized Hadamard transform} and \textsf{CountSketch} to propose the first randomized algorithms. Suppose that we are given an

n\times m

matrix embedding

M

of the graph, where

m \ll n

. Let

r

be the number of samples required for a guaranteed approximation error, which is a sublinear function of

n

. Our first algorithm reduces time complexity of pre-processing to

O(n(m + 1) + 2n(m + 1) \log_2(r + 1) + rm^2)

. Then after an edge insertion or an edge deletion, it updates the approximate solution in

O(rm)

time. Our second algorithm reduces time complexity of pre-processing to

O \left( nnz(M) + m^3 \epsilon^{-2} \log^7(m/\epsilon) \right)

, where

nnz(M)

is the number of nonzero elements of

M

. Then after an edge insertion or an edge deletion or a node insertion or a node deletion, it updates the approximate solution in

O(qm)

time, with

q=O\left(\frac{m^2}{\epsilon^2} \log^6(m/\epsilon) \right)

. Finally, we show that under some assumptions, if

\ln n < \epsilon^{-1}

our first algorithm outperforms our second algorithm and if

\ln n \geq \epsilon^{-1}

our second algorithm outperforms our first algorithm

arXiv.org e-Print Archive

Regularization Paths for Generalized Linear Models via Coordinate Descent

Author: Jerome H. Friedman
Rob Tibshirani
Trevor Hastie
Publication venue
Publication date
Field of study

We develop fast algorithms for estimation of generalized linear models with convex penalties. The models include linear regression, two-class logistic regression, and multi- nomial regression problems while the penalties include Ã¢ÂÂ_1 (the lasso), Ã¢ÂÂ_2 (ridge regression) and mixtures of the two (the elastic net). The algorithms use cyclical coordinate descent, computed along a regularization path. The methods can handle large problems and can also deal efficiently with sparse features. In comparative timings we find that the new algorithms are considerably faster than competing methods.

Research Papers in Economics