Search CORE

352 research outputs found

Sketch-based Randomized Algorithms for Dynamic Graph Regression

Author: Chehreghani Mostafa Haghir
Publication venue
Publication date: 04/06/2019
Field of study

A well-known problem in data science and machine learning is {\em linear regression}, which is recently extended to dynamic graphs. Existing exact algorithms for updating the solution of dynamic graph regression problem require at least a linear time (in terms of

n

: the size of the graph). However, this time complexity might be intractable in practice. In the current paper, we utilize {\em subsampled randomized Hadamard transform} and \textsf{CountSketch} to propose the first randomized algorithms. Suppose that we are given an

n\times m

matrix embedding

M

of the graph, where

m \ll n

. Let

r

be the number of samples required for a guaranteed approximation error, which is a sublinear function of

n

. Our first algorithm reduces time complexity of pre-processing to

O(n(m + 1) + 2n(m + 1) \log_2(r + 1) + rm^2)

. Then after an edge insertion or an edge deletion, it updates the approximate solution in

O(rm)

time. Our second algorithm reduces time complexity of pre-processing to

O \left( nnz(M) + m^3 \epsilon^{-2} \log^7(m/\epsilon) \right)

, where

nnz(M)

is the number of nonzero elements of

M

. Then after an edge insertion or an edge deletion or a node insertion or a node deletion, it updates the approximate solution in

O(qm)

time, with

q=O\left(\frac{m^2}{\epsilon^2} \log^6(m/\epsilon) \right)

. Finally, we show that under some assumptions, if

\ln n < \epsilon^{-1}

our first algorithm outperforms our second algorithm and if

\ln n \geq \epsilon^{-1}

our second algorithm outperforms our first algorithm

arXiv.org e-Print Archive

Improved analysis of the subsampled randomized Hadamard transform

Author: Tropp Joel A.
Publication venue
Publication date: 01/01/2011
Field of study

This paper presents an improved analysis of a structured dimension-reduction map called the subsampled randomized Hadamard transform. This argument demonstrates that the map preserves the Euclidean geometry of an entire subspace of vectors. The new proof is much simpler than previous approaches, and it offers---for the first time---optimal constants in the estimate on the number of dimensions required for the embedding.Comment: 8 pages. To appear, Advances in Adaptive Data Analysis, special issue "Sparse Representation of Data and Images." v2--v4 include minor correction

arXiv.org e-Print Archive

CiteSeerX

Caltech Authors

Randomized Dynamic Mode Decomposition

Author: Brunton Steven L.
Erichson N. Benjamin
Kutz J. Nathan
Mathelin Lionel
Publication venue: 'Society for Industrial & Applied Mathematics (SIAM)'
Publication date: 26/11/2019
Field of study

This paper presents a randomized algorithm for computing the near-optimal low-rank dynamic mode decomposition (DMD). Randomized algorithms are emerging techniques to compute low-rank matrix approximations at a fraction of the cost of deterministic algorithms, easing the computational challenges arising in the area of `big data'. The idea is to derive a small matrix from the high-dimensional data, which is then used to efficiently compute the dynamic modes and eigenvalues. The algorithm is presented in a modular probabilistic framework, and the approximation quality can be controlled via oversampling and power iterations. The effectiveness of the resulting randomized DMD algorithm is demonstrated on several benchmark examples of increasing complexity, providing an accurate and efficient approach to extract spatiotemporal coherent structures from big data in a framework that scales with the intrinsic rank of the data, rather than the ambient measurement dimension. For this work we assume that the dynamics of the problem under consideration is evolving on a low-dimensional subspace that is well characterized by a fast decaying singular value spectrum

arXiv.org e-Print Archive

Optimal approximate matrix product in terms of stable rank

Author: Cohen Michael B.
Nelson Jelani
Woodruff David P.
Publication venue
Publication date: 01/01/2016
Field of study

We prove, using the subspace embedding guarantee in a black box way, that one can achieve the spectral norm guarantee for approximate matrix multiplication with a dimensionality-reducing map having

m = O(\tilde{r}/\varepsilon^2)

rows. Here

\tilde{r}

is the maximum stable rank, i.e. squared ratio of Frobenius and operator norms, of the two matrices being multiplied. This is a quantitative improvement over previous work of [MZ11, KVZ14], and is also optimal for any oblivious dimensionality-reducing map. Furthermore, due to the black box reliance on the subspace embedding property in our proofs, our theorem can be applied to a much more general class of sketching matrices than what was known before, in addition to achieving better bounds. For example, one can apply our theorem to efficient subspace embeddings such as the Subsampled Randomized Hadamard Transform or sparse subspace embeddings, or even with subspace embedding constructions that may be developed in the future. Our main theorem, via connections with spectral error matrix multiplication shown in prior work, implies quantitative improvements for approximate least squares regression and low rank approximation. Our main result has also already been applied to improve dimensionality reduction guarantees for

k

-means clustering [CEMMP14], and implies new results for nonparametric regression [YPW15]. We also separately point out that the proof of the "BSS" deterministic row-sampling result of [BSS12] can be modified to show that for any matrices

A, B

of stable rank at most

\tilde{r}

, one can achieve the spectral norm guarantee for approximate matrix multiplication of

A^T B

by deterministically sampling

O(\tilde{r}/\varepsilon^2)

rows that can be found in polynomial time. The original result of [BSS12] was for rank instead of stable rank. Our observation leads to a stronger version of a main theorem of [KMST10].Comment: v3: minor edits; v2: fixed one step in proof of Theorem 9 which was wrong by a constant factor (see the new Lemma 5 and its use; final theorem unaffected

arXiv.org e-Print Archive

Dagstuhl Research Online Publication Server