Search CORE

2 research outputs found

Weighted SGD for $\ell_p$ Regression with Randomized Preconditioning

Author: Chow Yin-Lam
Mahoney Michael W.
Ré Christopher
Yang Jiyan
Publication venue
Publication date: 10/07/2017
Field of study

In recent years, stochastic gradient descent (SGD) methods and randomized linear algebra (RLA) algorithms have been applied to many large-scale problems in machine learning and data analysis. We aim to bridge the gap between these two methods in solving constrained overdetermined linear regression problems---e.g.,

\ell_2

and

\ell_1

regression problems. We propose a hybrid algorithm named pwSGD that uses RLA techniques for preconditioning and constructing an importance sampling distribution, and then performs an SGD-like iterative process with weighted sampling on the preconditioned system. We prove that pwSGD inherits faster convergence rates that only depend on the lower dimension of the linear system, while maintaining low computation complexity. Particularly, when solving

\ell_1

regression with size

n

d

, pwSGD returns an approximate solution with

\epsilon

relative error in the objective value in

\mathcal{O}(\log n \cdot \text{nnz}(A) + \text{poly}(d)/\epsilon^2)

time. This complexity is uniformly better than that of RLA methods in terms of both

\epsilon

and

d

when the problem is unconstrained. For

\ell_2

regression, pwSGD returns an approximate solution with

\epsilon

relative error in the objective value and the solution vector measured in prediction norm in

\mathcal{O}(\log n \cdot \text{nnz}(A) + \text{poly}(d) \log(1/\epsilon) /\epsilon)

time. We also provide lower bounds on the coreset complexity for more general regression problems, indicating that still new ideas will be needed to extend similar RLA preconditioning ideas to weighted SGD algorithms for more general regression problems. Finally, the effectiveness of such algorithms is illustrated numerically on both synthetic and real datasets.Comment: A conference version of this paper appears under the same title in Proceedings of ACM-SIAM Symposium on Discrete Algorithms, Arlington, VA, 201

arXiv.org e-Print Archive

Aligning Points to Lines: Provable Approximations

Author: Feldman Dan
Jubran Ibrahim
Publication venue
Publication date: 09/09/2019
Field of study

We suggest a new optimization technique for minimizing the sum

\sum_{i=1}^n f_i(x)

n

non-convex real functions that satisfy a property that we call piecewise log-Lipschitz. This is by forging links between techniques in computational geometry, combinatorics and convex optimization. As an example application, we provide the first constant-factor approximation algorithms whose running-time is polynomial in

n

for the fundamental problem of \emph{Points-to-Lines alignment}: Given

n

points

p_1,\cdots,p_n

and

n

lines

\ell_1,\cdots,\ell_n

on the plane and

z>0

, compute the matching

\pi:[n]\to[n]

and alignment (rotation matrix

R

and a translation vector

t

) that minimize the sum of Euclidean distances

\sum_{i=1}^n \mathrm{dist}(Rp_i-t,\ell_{\pi(i)})^z

between each point to its corresponding line. This problem is non-trivial even if

z=1

and the matching

\pi

is given. If

\pi

is given, the running time of our algorithms is

O(n^3)

, and even near-linear in

n

using core-sets that support: streaming, dynamic, and distributed parallel computations in poly-logarithmic update time. Generalizations for handling e.g. outliers or pseudo-distances such as

M

-estimators for the problem are also provided. Experimental results and open source code show that our provable algorithms improve existing heuristics also in practice. A companion demonstration video in the context of Augmented Reality shows how such algorithms may be used in real-time systems

arXiv.org e-Print Archive