Search CORE

219 research outputs found

Approximating Edit Distance Within Constant Factor in Truly Sub-Quadratic Time

Author: Chakraborty Diptarka
Das Debarati
Goldenberg Elazar
Koucky Michal
Saks Michael
Publication venue
Publication date: 08/10/2018
Field of study

Edit distance is a measure of similarity of two strings based on the minimum number of character insertions, deletions, and substitutions required to transform one string into the other. The edit distance can be computed exactly using a dynamic programming algorithm that runs in quadratic time. Andoni, Krauthgamer and Onak (2010) gave a nearly linear time algorithm that approximates edit distance within approximation factor

\text{poly}(\log n)

. In this paper, we provide an algorithm with running time

\tilde{O}(n^{2-2/7})

that approximates the edit distance within a constant factor

arXiv.org e-Print Archive

Copenhagen University Research Information System

Training Overparametrized Neural Networks in Sublinear Time

Author: Deng Yichuan
Hu Hang
Song Zhao
Weinstein Omri
Zhuo Danyang
Publication venue
Publication date: 07/02/2024
Field of study

The success of deep learning comes at a tremendous computational and energy cost, and the scalability of training massively overparametrized neural networks is becoming a real barrier to the progress of artificial intelligence (AI). Despite the popularity and low cost-per-iteration of traditional backpropagation via gradient decent, stochastic gradient descent (SGD) has prohibitive convergence rate in non-convex settings, both in theory and practice. To mitigate this cost, recent works have proposed to employ alternative (Newton-type) training methods with much faster convergence rate, albeit with higher cost-per-iteration. For a typical neural network with

m=\mathrm{poly}(n)

parameters and input batch of

n

datapoints in

\mathbb{R}^d

, the previous work of [Brand, Peng, Song, and Weinstein, ITCS'2021] requires

\sim mnd + n^3

time per iteration. In this paper, we present a novel training method that requires only

m^{1-\alpha} n d + n^3

amortized time in the same overparametrized regime, where

\alpha \in (0.01,1)

is some fixed constant. This method relies on a new and alternative view of neural networks, as a set of binary search trees, where each iteration corresponds to modifying a small subset of the nodes in the tree. We believe this view would have further applications in the design and analysis of deep neural networks (DNNs)

arXiv.org e-Print Archive

Space- and Time-Efficient Algorithm for Maintaining Dense Subgraphs on One-Pass Dynamic Streams

Author: Epasto A.
Gibson D.
Goldberg A. V.
Lawler E.
Matula D.
Publication venue
Publication date: 01/01/2015
Field of study

While in many graph mining applications it is crucial to handle a stream of updates efficiently in terms of {\em both} time and space, not much was known about achieving such type of algorithm. In this paper we study this issue for a problem which lies at the core of many graph mining applications called {\em densest subgraph problem}. We develop an algorithm that achieves time- and space-efficiency for this problem simultaneously. It is one of the first of its kind for graph problems to the best of our knowledge. In a graph

G = (V, E)

, the "density" of a subgraph induced by a subset of nodes

S \subseteq V

is defined as

|E(S)|/|S|

, where

E(S)

is the set of edges in

E

with both endpoints in

S

. In the densest subgraph problem, the goal is to find a subset of nodes that maximizes the density of the corresponding induced subgraph. For any

\epsilon>0

, we present a dynamic algorithm that, with high probability, maintains a

(4+\epsilon)

-approximation to the densest subgraph problem under a sequence of edge insertions and deletions in a graph with

n

nodes. It uses

\tilde O(n)

space, and has an amortized update time of

\tilde O(1)

and a query time of

\tilde O(1)

. Here,

\tilde O

hides a O(\poly\log_{1+\epsilon} n) term. The approximation ratio can be improved to

(2+\epsilon)

at the cost of increasing the query time to

\tilde O(n)

. It can be extended to a

(2+\epsilon)

-approximation sublinear-time algorithm and a distributed-streaming algorithm. Our algorithm is the first streaming algorithm that can maintain the densest subgraph in {\em one pass}. The previously best algorithm in this setting required

O(\log n)

passes [Bahmani, Kumar and Vassilvitskii, VLDB'12]. The space required by our algorithm is tight up to a polylogarithmic factor.Comment: A preliminary version of this paper appeared in STOC 201

arXiv.org e-Print Archive

Publikationer från KTH

CiteSeerX

Crossref

Warwick Research Archives Portal Repository

Digitala Vetenskapliga Arkivet - Academic Archive On-line

Algorithmic linear dimension reduction in the l_1 norm for sparse vectors

Author: Gilbert A. C.
Strauss M. J.
Tropp J. A.
Vershynin R.
Publication venue
Publication date: 18/08/2006
Field of study

This paper develops a new method for recovering m-sparse signals that is simultaneously uniform and quick. We present a reconstruction algorithm whose run time, O(m log^2(m) log^2(d)), is sublinear in the length d of the signal. The reconstruction error is within a logarithmic factor (in m) of the optimal m-term approximation error in l_1. In particular, the algorithm recovers m-sparse signals perfectly and noisy signals are recovered with polylogarithmic distortion. Our algorithm makes O(m log^2 (d)) measurements, which is within a logarithmic factor of optimal. We also present a small-space implementation of the algorithm. These sketching techniques and the corresponding reconstruction algorithms provide an algorithmic dimension reduction in the l_1 norm. In particular, vectors of support m in dimension d can be linearly embedded into O(m log^2 d) dimensions with polylogarithmic distortion. We can reconstruct a vector from its low-dimensional sketch in time O(m log^2(m) log^2(d)). Furthermore, this reconstruction is stable and robust under small perturbations

arXiv.org e-Print Archive

Caltech Authors