Search CORE

61 research outputs found

Approximate Online Pattern Matching in Sublinear Time

Author: Chakraborty Diptarka
Das Debarati
Publication venue: LIPIcs - Leibniz International Proceedings in Informatics. 39th IARCS Annual Conference on Foundations of Software Technology and Theoretical Computer Science (FSTTCS 2019)
Publication date: 01/01/2019
Field of study

Dagstuhl Research Online Publication Server

Sparse Weight Tolerant Subgraph for Single Source Shortest Path

Author: Chakraborty Diptarka
Das Debarati
Publication venue: LIPIcs - Leibniz International Proceedings in Informatics. 16th Scandinavian Symposium and Workshops on Algorithm Theory (SWAT 2018)
Publication date: 01/01/2018
Field of study

In this paper we address the problem of computing a sparse subgraph of any weighted directed graph such that the exact distances from a designated source vertex to all other vertices are preserved under bounded weight increment. Finding a small sized subgraph that preserves distances between any pair of vertices is a well studied problem. Since in the real world any network is prone to failures, it is natural to study the fault tolerant version of the above problem. Unfortunately, it turns out that there may not always exist such a sparse subgraph even under single edge failure [Demetrescu et al. \u2708]. However in real applications it is not always the case that a link (edge) in a network becomes completely faulty. Instead, it can happen that some links become more congested which can be captured by increasing weight on the corresponding edges. Thus it makes sense to try to construct a sparse distance preserving subgraph under the above weight increment model where total increase in weight in the whole network (graph) is bounded by some parameter k. To the best of our knowledge this problem has not been studied so far. In this paper we show that given any weighted directed graph with n vertices and a source vertex, one can construct a subgraph of size at most e * (k-1)!2^kn such that it preserves distances between the source and all other vertices as long as the total weight increment is bounded by k and we are allowed to only have integer valued (can be negative) weight on edges and also weight of an edge can only be increased by some positive integer. Next we show a lower bound of c * 2^kn, for some constant c >= 5/4, on the size of such a subgraph. We further argue that the restrictions of integral weight and integral weight increment are actually essential by showing that if we remove any one of these two we may need to store Omega(n^2) edges to preserve the distances

Dagstuhl Research Online Publication Server

Approximating Edit Distance Within Constant Factor in Truly Sub-Quadratic Time

Author: Chakraborty Diptarka
Das Debarati
Goldenberg Elazar
Koucky Michal
Saks Michael
Publication venue
Publication date: 08/10/2018
Field of study

Edit distance is a measure of similarity of two strings based on the minimum number of character insertions, deletions, and substitutions required to transform one string into the other. The edit distance can be computed exactly using a dynamic programming algorithm that runs in quadratic time. Andoni, Krauthgamer and Onak (2010) gave a nearly linear time algorithm that approximates edit distance within approximation factor

\text{poly}(\log n)

. In this paper, we provide an algorithm with running time

\tilde{O}(n^{2-2/7})

that approximates the edit distance within a constant factor

arXiv.org e-Print Archive

Copenhagen University Research Information System

Space-Optimal Quasi-Gray Codes with Logarithmic Read Complexity

Author: Chakraborty Diptarka
Das Debarati
Saurabh Nitin
Publication venue: LIPIcs - Leibniz International Proceedings in Informatics. 26th Annual European Symposium on Algorithms (ESA 2018)
Publication date: 01/01/2018
Field of study

A quasi-Gray code of dimension n and length l over an alphabet Sigma is a sequence of distinct words w_1,w_2,...,w_l from Sigma^n such that any two consecutive words differ in at most c coordinates, for some fixed constant c>0. In this paper we are interested in the read and write complexity of quasi-Gray codes in the bit-probe model, where we measure the number of symbols read and written in order to transform any word w_i into its successor w_{i+1}. We present construction of quasi-Gray codes of dimension n and length 3^n over the ternary alphabet {0,1,2} with worst-case read complexity O(log n) and write complexity 2. This generalizes to arbitrary odd-size alphabets. For the binary alphabet, we present quasi-Gray codes of dimension n and length at least 2^n - 20n with worst-case read complexity 6+log n and write complexity 2. This complements a recent result by Raskin [Raskin \u2717] who shows that any quasi-Gray code over binary alphabet of length 2^n has read complexity Omega(n). Our results significantly improve on previously known constructions and for the odd-size alphabets we break the Omega(n) worst-case barrier for space-optimal (non-redundant) quasi-Gray codes with constant number of writes. We obtain our results via a novel application of algebraic tools together with the principles of catalytic computation [Buhrman et al. \u2714, Ben-Or and Cleve \u2792, Barrington \u2789, Coppersmith and Grossman \u2775]

Dagstuhl Research Online Publication Server

Awn Reduction and the Domestication of Asian Rice: A Syndrome or Crop Improvement Trait?

Author: Chakraborty Debarati
Ray Avik
Svizzero Serge
Publication venue: Springer Verlag
Publication date: 31/07/2019
Field of study

International audienc

Approximate Trace Reconstruction via Median String (In Average-Case)

Author: Chakraborty Diptarka
Das Debarati
Krauthgamer Robert
Publication venue: LIPIcs - Leibniz International Proceedings in Informatics. 41st IARCS Annual Conference on Foundations of Software Technology and Theoretical Computer Science (FSTTCS 2021)
Publication date: 01/01/2021
Field of study

We consider an \emph{approximate} version of the trace reconstruction problem, where the goal is to recover an unknown string

s\in\{0,1\}^n

from

m

traces (each trace is generated independently by passing

s

through a probabilistic insertion-deletion channel with rate

p

). We present a deterministic near-linear time algorithm for the average-case model, where

s

is random, that uses only \emph{three} traces. It runs in near-linear time

\tilde O(n)

and with high probability reports a string within edit distance

O(\epsilon p n)

from

s

for

\epsilon=\tilde O(p)

, which significantly improves over the straightforward bound of

O(pn)

. Technically, our algorithm computes a

(1+\epsilon)

-approximate median of the three input traces. To prove its correctness, our probabilistic analysis shows that an approximate median is indeed close to the unknown

s

. To achieve a near-linear time bound, we have to bypass the well-known dynamic programming algorithm that computes an optimal median in time

O(n^3)

arXiv.org e-Print Archive

Dagstuhl Research Online Publication Server

Clustering Permutations: New Techniques with Streaming Applications

Author: Chakraborty Diptarka
Das Debarati
Krauthgamer Robert
Publication venue: LIPIcs - Leibniz International Proceedings in Informatics. 14th Innovations in Theoretical Computer Science Conference (ITCS 2023)
Publication date: 01/01/2023
Field of study

10.4230/LIPIcs.ITCS.2023.3125

Dagstuhl Research Online Publication Server

ScholarBank@NUS