Search CORE

89 research outputs found

Why is it hard to beat $O(n^2)$ for Longest Common Weakly Increasing Subsequence?

Author: Polak Adam
Publication venue: 'Elsevier BV'
Publication date: 03/03/2017
Field of study

The Longest Common Weakly Increasing Subsequence problem (LCWIS) is a variant of the classic Longest Common Subsequence problem (LCS). Both problems can be solved with simple quadratic time algorithms. A recent line of research led to a number of matching conditional lower bounds for LCS and other related problems. However, the status of LCWIS remained open. In this paper we show that LCWIS cannot be solved in strongly subquadratic time unless the Strong Exponential Time Hypothesis (SETH) is false. The ideas which we developed can also be used to obtain a lower bound based on a safer assumption of NC-SETH, i.e. a version of SETH which talks about NC circuits instead of less expressive CNF formulas

arXiv.org e-Print Archive

Jagiellonian Univeristy Repository

Tight Conditional Lower Bounds for Longest Common Increasing Subsequence

Author: Duraj Lech
Polak Adam
Publication venue: LIPIcs - Leibniz International Proceedings in Informatics. 12th International Symposium on Parameterized and Exact Computation (IPEC 2017)
Publication date: 01/01/2017
Field of study

We consider the canonical generalization of the well-studied Longest Increasing Subsequence problem to multiple sequences, called k-LCIS: Given k integer sequences X_1,...,X_k of length at most n, the task is to determine the length of the longest common subsequence of X_1,...,X_k that is also strictly increasing. Especially for the case of k=2 (called LCIS for short), several algorithms have been proposed that require quadratic time in the worst case. Assuming the Strong Exponential Time Hypothesis (SETH), we prove a tight lower bound, specifically, that no algorithm solves LCIS in (strongly) subquadratic time. Interestingly, the proof makes no use of normalization tricks common to hardness proofs for similar problems such as LCS. We further strengthen this lower bound to rule out O((nL)^{1-epsilon}) time algorithms for LCIS, where L denotes the solution size, and to rule out O(n^{k-epsilon}) time algorithms for k-LCIS. We obtain the same conditional lower bounds for the related Longest Common Weakly Increasing Subsequence problem

arXiv.org e-Print Archive

Archivio istituzionale della Ricerca - Bocconi

Dagstuhl Research Online Publication Server

Jagiellonian Univeristy Repository

MPG.PuRe

A Faster Subquadratic Algorithm for the Longest Common Increasing Subsequence Problem

Author: Agrawal Anadi
Gawrychowski Pawe?
Publication venue: LIPIcs - Leibniz International Proceedings in Informatics. 31st International Symposium on Algorithms and Computation (ISAAC 2020)
Publication date: 01/01/2020
Field of study

The Longest Common Increasing Subsequence (LCIS) is a variant of the classical Longest Common Subsequence (LCS), in which we additionally require the common subsequence to be strictly increasing. While the well-known "Four Russians" technique can be used to find LCS in subquadratic time, it does not seem applicable to LCIS. Recently, Duraj [STACS 2020] used a completely different method based on the combinatorial properties of LCIS to design an

\mathcal{O}(n^2(\log\log n)^2/\log^{1/6}n)

time algorithm. We show that an approach based on exploiting tabulation can be used to construct an asymptotically faster

\mathcal{O}(n^2 \log\log n/\sqrt{\log n})

time algorithm. As our solution avoids using the specific combinatorial properties of LCIS, it can be also adapted for the Longest Common Weakly Increasing Subsequence (LCWIS)

arXiv.org e-Print Archive

Dagstuhl Research Online Publication Server

Permutation classes

Author: Vatter Vincent
Publication venue
Publication date: 04/01/2015
Field of study

This is a survey on permutation classes for the upcoming book Handbook of Enumerative Combinatorics

arXiv.org e-Print Archive

CiteSeerX

Optimizing Dynamic Time Warping’s Window Width for Time Series Data Mining Applications

Author: Bagnall Anthony
Dau Hoang Anh
Forestier Germain
Keogh Eamonn
Mueen Abdullah
Petitjean Francois
Silva Diego Furtado
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 09/04/2018
Field of study

Dynamic Time Warping (DTW) is a highly competitive distance measure for most time series data mining problems. Obtaining the best performance from DTW requires setting its only parameter, the maximum amount of warping (w). In the supervised case with ample data, w is typically set by cross-validation in the training stage. However, this method is likely to yield suboptimal results for small training sets. For the unsupervised case, learning via cross-validation is not possible because we do not have access to labeled data. Many practitioners have thus resorted to assuming that “the larger the better”, and they use the largest value of w permitted by the computational resources. However, as we will show, in most circumstances, this is a naïve approach that produces inferior clusterings. Moreover, the best warping window width is generally non-transferable between the two tasks, i.e., for a single dataset, practitioners cannot simply apply the best w learned for classification on clustering or vice versa. In addition, we will demonstrate that the appropriate amount of warping not only depends on the data structure, but also on the dataset size. Thus, even if a practitioner knows the best setting for a given dataset, they will likely be at a lost if they apply that setting on a bigger size version of that data. All these issues seem largely unknown or at least unappreciated in the community. In this work, we demonstrate the importance of setting DTW’s warping window width correctly, and we also propose novel methods to learn this parameter in both supervised and unsupervised settings. The algorithms we propose to learn w can produce significant improvements in classification accuracy and clustering quality. We demonstrate the correctness of our novel observations and the utility of our ideas by testing them with more than one hundred publicly available datasets. Our forceful results allow us to make a perhaps unexpected claim; an underappreciated “low hanging fruit” in optimizing DTW’s performance can produce improvements that make it an even stronger baseline, closing most or all the improvement gap of the more sophisticated methods proposed in recent years

univOAK

University of East Anglia digital repository

27th Annual European Symposium on Algorithms: ESA 2019, September 9-11, 2019, Munich/Garching, Germany

Author: ESA <27. 2019, München>
Publication venue: Schloss Dagstuhl - Leibniz-Zentrum für Informatik GmbH, Dagstuhl Publishing
Publication date: 01/09/2019
Field of study

Digitale Bibliothek Thüringen

LIPIcs, Volume 274, ESA 2023, Complete Volume

Author: Farach-Colton Martin
Herman Grzegorz
Puglisi Simon J.
Publication venue: LIPIcs - Leibniz International Proceedings in Informatics. 31st Annual European Symposium on Algorithms (ESA 2023)
Publication date: 01/01/2023
Field of study

LIPIcs, Volume 274, ESA 2023, Complete Volum

Dagstuhl Research Online Publication Server

Recommended from our members

Problems and results on linear hypergraphs

Author: Long Jason
Publication venue: University of Cambridge
Publication date: 01/03/2019
Field of study

In this thesis, we tackle several problems involving the study of 3-uniform, linear hypergraphs satisfying some additional structural constraint. We begin with a problem of Hrushovski concerning Latin squares satisfying a partial associativity condition. From an

n\times n

Latin square

A

one can define a binary operation

\circ:[n]\times[n]\to [n]

, and

\circ

is associative if and only if

A

is a group multiplication table. Hrushovski asked whether, if

\circ

is only associative a positive proportion of the time,

A

must still in some sense be close to a group multiplication table. This problem manifests a well-studied combinatorial theme, in which a local structural constraint is relaxed (first to a `99

\%

' version and then to a `1

\%

' version) and the global consequences of the relaxed constraints are analysed. We show that the partial associativity condition is sufficient to deduce powerful global information, allowing us to find within

A

a large subset with group-like structure. Since Latin squares can be regarded as 3-uniform, linear hypergraphs, and the partial associativity condition can be formulated in terms of the count of a particular subhypergraph, we are able to apply purely combinatorial methods to a problem that touches algebra, model theory and geometric group theory. We then take this problem further. A condition due to Thomsen provides a combinatorial constraint which, if satisfied by the Latin square

A

, proves that

A

is in fact the multiplication table of an abelian group. It is then natural to ask whether a relaxed version of this result is also attainable, and by extending our methods we are able to prove a result of this flavour. Since the combinatorial obstructions to commutativity of

\circ

are far more complex than those for associativity, topological complications arise that are not present in the earlier work. We also study a problem of Loh concerning sequences of triples of integers from

[n]

satisfying a certain `increasing' property. Loh studied the maximum length of such a sequence, improving a trivial upper bound of

n^2

n^2/\exp(\log^*n)

using the triangle removal lemma and conjecturing that a natural construction of length

n^{3/2}

is best possible. We provide the first power-type improvement to the upper bound, showing that there exists

\epsilon>0

such that the length is bounded by

n^{2-\epsilon}

. By viewing the triples as edges in a 3-uniform hypergraph, the increasing property shows that the hypergraph is linear and provides further restrictions in terms of forbidden subhypergraphs. By considering this formulation, we provide links to various important open problems including the Brown--Erd\H os--S\'os conjecture. Finally, we present a collection of shorter results. In work connecting to the earlier chapters, we resolve the Brown--Erd\H os--S\'os conjecture in the context of hypergraphs with a group structure, and show moreover that subsets of group multiplication tables exhibit local density far beyond what can be hoped for in general. In work less closely connected to the main theme of the thesis, we also answer a question of Leader, Mili\'cevi\'c and Tan concerning partitions of boxes, consider a problem on projective cubes in

\mathbb{Z}_{2^n}

, and resolve a conjecture concerning a diffusion process on graphs

Apollo (Cambridge)

36th International Symposium on Theoretical Aspects of Computer Science: STACS 2019, March 13-16, 2019, Berlin, Germany

Author: STACS 36. 2019 Berlin
Publication venue: Schloss Dagstuhl - Leibniz-Zentrum für Informatik GmbH, Dagstuhl Publishing
Publication date: 01/03/2019
Field of study

Digitale Bibliothek Thüringen

Exploiting the Computational Power of Ternary Content Addressable Memory

Author: Tirdad Kamran
Publication venue: 'University of Waterloo'
Publication date: 01/01/2011
Field of study

Ternary Content Addressable Memory or in short TCAM is a special type of memory that can execute a certain set of operations in parallel on all of its words. Because of power consumption and relatively small storage capacity, it has only been used in special environments. Over the past few years its cost has been reduced and its storage capacity has increased signifi cantly and these exponential trends are continuing. Hence it can be used in more general environments for larger problems. In this research we study how to exploit its computational power in order to speed up fundamental problems and needless to say that we barely scratched the surface. The main problems that has been addressed in our research are namely Boolean matrix multiplication, approximate subset queries using bloom filters, Fixed universe priority queues and network flow classi cation. For Boolean matrix multiplication our simple algorithm has a run time of O (d(N^2)/w) where N is the size of the square matrices, w is the number of bits in each word of TCAM and d is the maximum number of ones in a row of one of the matrices. For the Fixed universe priority queue problems we propose two data structures one with constant time complexity and space of O((1/ε)n(U^ε)) and the other one in linear space and amortized time complexity of O((lg lg U)/(lg lg lg U)) which beats the best possible data structure in the RAM model namely Y-fast trees. Considering each word of TCAM as a bloom filter, we modify the hash functions of the bloom filter and propose a data structure which can use the information capacity of each word of TCAM more efi ciently by using the co-occurrence probability of possible members. And finally in the last chapter we propose a novel technique for network flow classi fication using TCAM

University of Waterloo's Institutional Repository