Search CORE

6 research outputs found

Approximate Clustering via Metric Partitioning

Author: Bandyapadhyay Sayan
Varadarajan Kasturi
Publication venue
Publication date: 01/01/2016
Field of study

In this paper we consider two metric covering/clustering problems - \textit{Minimum Cost Covering Problem} (MCC) and

k

-clustering. In the MCC problem, we are given two point sets

X

(clients) and

Y

(servers), and a metric on

X \cup Y

. We would like to cover the clients by balls centered at the servers. The objective function to minimize is the sum of the

\alpha

-th power of the radii of the balls. Here

\alpha \geq 1

is a parameter of the problem (but not of a problem instance). MCC is closely related to the

k

-clustering problem. The main difference between

k

-clustering and MCC is that in

k

-clustering one needs to select

k

balls to cover the clients. For any \eps > 0, we describe quasi-polynomial time (1 + \eps) approximation algorithms for both of the problems. However, in case of

k

-clustering the algorithm uses (1 + \eps)k balls. Prior to our work, a

3^{\alpha}

and a

{c}^{\alpha}

approximation were achieved by polynomial-time algorithms for MCC and

k

-clustering, respectively, where

c > 1

is an absolute constant. These two problems are thus interesting examples of metric covering/clustering problems that admit (1 + \eps)-approximation (using (1+\eps)k balls in case of

k

-clustering), if one is willing to settle for quasi-polynomial time. In contrast, for the variant of MCC where

\alpha

is part of the input, we show under standard assumptions that no polynomial time algorithm can achieve an approximation factor better than

O(\log |X|)

for

\alpha \geq \log |X|

.Comment: 19 page

arXiv.org e-Print Archive

Dagstuhl Research Online Publication Server

The Parameterized Complexity of Finding a 2-Sphere in a Simplicial Complex

Author: Burton Benjamin
Cabello Sergio
Kratsch Stefan
Pettersson William
Publication venue: LIPIcs - Leibniz International Proceedings in Informatics. 34th Symposium on Theoretical Aspects of Computer Science (STACS 2017)
Publication date: 01/01/2017
Field of study

We consider the problem of finding a subcomplex K\u27 of a simplicial complex K such that K\u27 is homeomorphic to the 2-dimensional sphere, S^2. We study two variants of this problem. The first asks if there exists such a K\u27 with at most k triangles, and we show that this variant is W[1]-hard and, assuming ETH, admits no O(n^(o(sqrt(k)))) time algorithm. We also give an algorithm that is tight with regards to this lower bound. The second problem is the dual of the first, and asks if K\u27 can be found by removing at most k triangles from K. This variant has an immediate O(3^k poly(|K|)) time algorithm, and we show that it admits a polynomial kernelization to O(k^2) triangles, as well as a polynomial compression to a weighted version with bit-size O(k log k)

arXiv.org e-Print Archive

Dagstuhl Research Online Publication Server

Enlighten

University of Queensland eSpace

Compressibility-Aware Quantum Algorithms on Strings

Author: Gibney Daniel
Thankachan Sharma V.
Publication venue
Publication date: 14/02/2023
Field of study

Sublinear time quantum algorithms have been established for many fundamental problems on strings. This work demonstrates that new, faster quantum algorithms can be designed when the string is highly compressible. We focus on two popular and theoretically significant compression algorithms -- the Lempel-Ziv77 algorithm (LZ77) and the Run-length-encoded Burrows-Wheeler Transform (RL-BWT), and obtain the results below. We first provide a quantum algorithm running in

\tilde{O}(\sqrt{zn})

time for finding the LZ77 factorization of an input string

T[1..n]

with

z

factors. Combined with multiple existing results, this yields an

\tilde{O}(\sqrt{rn})

time quantum algorithm for finding the RL-BWT encoding with

r

BWT runs. Note that

r = \tilde{\Theta}(z)

. We complement these results with lower bounds proving that our algorithms are optimal (up to polylog factors). Next, we study the problem of compressed indexing, where we provide a

\tilde{O}(\sqrt{rn})

time quantum algorithm for constructing a recently designed

\tilde{O}(r)

space structure with equivalent capabilities as the suffix tree. This data structure is then applied to numerous problems to obtain sublinear time quantum algorithms when the input is highly compressible. For example, we show that the longest common substring of two strings of total length

n

can be computed in

\tilde{O}(\sqrt{zn})

time, where

z

is the number of factors in the LZ77 factorization of their concatenation. This beats the best known

\tilde{O}(n^\frac{2}{3})

time quantum algorithm when

z

is sufficiently small

arXiv.org e-Print Archive

Persistent Laplacians: properties, algorithms and implications

Author: Mémoli Facundo
Wan Zhengchao
Wang Yusu
Publication venue
Publication date: 26/07/2021
Field of study

We present a thorough study of the theoretical properties and devise efficient algorithms for the \emph{persistent Laplacian}, an extension of the standard combinatorial Laplacian to the setting of pairs (or, in more generality, sequences) of simplicial complexes

K \hookrightarrow L

, which was recently introduced by Wang, Nguyen, and Wei. In particular, in analogy with the non-persistent case, we first prove that the nullity of the

q

-th persistent Laplacian

\Delta_q^{K,L}

equals the

q

-th persistent Betti number of the inclusion

(K \hookrightarrow L)

. We then present an initial algorithm for finding a matrix representation of

\Delta_q^{K,L}

, which itself helps interpret the persistent Laplacian. We exhibit a novel relationship between the persistent Laplacian and the notion of Schur complement of a matrix which has several important implications. In the graph case, it both uncovers a link with the notion of effective resistance and leads to a persistent version of the Cheeger inequality. This relationship also yields an additional, very simple algorithm for finding (a matrix representation of) the

q

-th persistent Laplacian which in turn leads to a novel and fundamentally different algorithm for computing the

q

-th persistent Betti number for a pair

(K,L)

which can be significantly more efficient than standard algorithms. Finally, we study persistent Laplacians for simplicial filtrations and present novel stability results for their eigenvalues. Our work brings methods from spectral graph theory, circuit theory, and persistent homology together with a topological view of the combinatorial Laplacian on simplicial complexes

arXiv.org e-Print Archive

Space Efficient Data Structures and Algorithms in the Word-RAM Model

Author: El-Zein Hicham
Publication venue: 'University of Waterloo'
Publication date: 26/04/2018
Field of study

In modern computation the volume of data-sets has increased dramatically. Since the majority of these data-sets are stored in internal memory, reducing their storage requirement is an important research topic. One way of reducing storage is using succinct and compact data structures which maintain the data in compressed form with extra data structures over it in a way that allows efficient access and query of the data. In this thesis we study space-efficient data structures for various combinatorial objects. We focus on succinct and compact data structures. Succinct data structures are data structures whose size is within the information theoretic lower bound plus a lower order term, whereas compact data structures are data structures whose size is a constant factor from the information theoretic lower bound

University of Waterloo's Institutional Repository