Search CORE

3,811 research outputs found

Answering Conjunctive Queries under Updates

Author: Abiteboul S.
Berkholz C.
Brault-Baron J.
Chen H.
Moret B. M. E.
Segoufin L.
Veldhuizen T. L.
Williams R.
Zeume T.
Publication venue
Publication date: 21/02/2017
Field of study

We consider the task of enumerating and counting answers to

k

-ary conjunctive queries against relational databases that may be updated by inserting or deleting tuples. We exhibit a new notion of q-hierarchical conjunctive queries and show that these can be maintained efficiently in the following sense. During a linear time preprocessing phase, we can build a data structure that enables constant delay enumeration of the query results; and when the database is updated, we can update the data structure and restart the enumeration phase within constant time. For the special case of self-join free conjunctive queries we obtain a dichotomy: if a query is not q-hierarchical, then query enumeration with sublinear

^\ast

delay and sublinear update time (and arbitrary preprocessing time) is impossible. For answering Boolean conjunctive queries and for the more general problem of counting the number of solutions of k-ary queries we obtain complete dichotomies: if the query's homomorphic core is q-hierarchical, then size of the the query result can be computed in linear time and maintained with constant update time. Otherwise, the size of the query result cannot be maintained with sublinear update time. All our lower bounds rely on the OMv-conjecture, a conjecture on the hardness of online matrix-vector multiplication that has recently emerged in the field of fine-grained complexity to characterise the hardness of dynamic problems. The lower bound for the counting problem additionally relies on the orthogonal vectors conjecture, which in turn is implied by the strong exponential time hypothesis.

^\ast)

By sublinear we mean

O(n^{1-\varepsilon})

for some

\varepsilon>0

, where

n

is the size of the active domain of the current database

arXiv.org e-Print Archive

Crossref

Sketch-based Randomized Algorithms for Dynamic Graph Regression

Author: Chehreghani Mostafa Haghir
Publication venue
Publication date: 04/06/2019
Field of study

A well-known problem in data science and machine learning is {\em linear regression}, which is recently extended to dynamic graphs. Existing exact algorithms for updating the solution of dynamic graph regression problem require at least a linear time (in terms of

n

: the size of the graph). However, this time complexity might be intractable in practice. In the current paper, we utilize {\em subsampled randomized Hadamard transform} and \textsf{CountSketch} to propose the first randomized algorithms. Suppose that we are given an

n\times m

matrix embedding

M

of the graph, where

m \ll n

. Let

r

be the number of samples required for a guaranteed approximation error, which is a sublinear function of

n

. Our first algorithm reduces time complexity of pre-processing to

O(n(m + 1) + 2n(m + 1) \log_2(r + 1) + rm^2)

. Then after an edge insertion or an edge deletion, it updates the approximate solution in

O(rm)

time. Our second algorithm reduces time complexity of pre-processing to

O \left( nnz(M) + m^3 \epsilon^{-2} \log^7(m/\epsilon) \right)

, where

nnz(M)

is the number of nonzero elements of

M

. Then after an edge insertion or an edge deletion or a node insertion or a node deletion, it updates the approximate solution in

O(qm)

time, with

q=O\left(\frac{m^2}{\epsilon^2} \log^6(m/\epsilon) \right)

. Finally, we show that under some assumptions, if

\ln n < \epsilon^{-1}

our first algorithm outperforms our second algorithm and if

\ln n \geq \epsilon^{-1}

our second algorithm outperforms our first algorithm

arXiv.org e-Print Archive

A Randomized Sublinear Time Parallel GCD Algorithm for the EREW PRAM

Author: Bach
Beame
Bernstein
Borodin
Chor
Crandall
Davida
Greenlaw
Hildebrand
Jonathan P. Sorenson
Kannan
Karp
Meyer Eikenberry
Parsell
Purdom
Schönhage
Sedjelmaci
Sedjelmaci
Sedjelmaci
Sedjelmaci
Sorenson
Sorenson
Tenenbaum
Publication venue: 'Elsevier BV'
Publication date: 01/01/2010
Field of study

We present a randomized parallel algorithm that computes the greatest common divisor of two integers of n bits in length with probability 1-o(1) that takes O(n loglog n / log n) expected time using n^{6+\epsilon} processors on the EREW PRAM parallel model of computation. We believe this to be the first randomized sublinear time algorithm on the EREW PRAM for this problem

arXiv.org e-Print Archive

CiteSeerX

Crossref

Digital Commons @ Butler University