Search CORE

15 research outputs found

Enumeration of three term arithmetic progressions in fixed density sets

Author: Sjöland Erik
Publication venue
Publication date: 10/11/2014
Field of study

Additive combinatorics is built around the famous theorem by Szemer\'edi which asserts existence of arithmetic progressions of any length among the integers. There exist several different proofs of the theorem based on very different techniques. Szemer\'edi's theorem is an existence statement, whereas the ultimate goal in combinatorics is always to make enumeration statements. In this article we develop new methods based on real algebraic geometry to obtain several quantitative statements on the number of arithmetic progressions in fixed density sets. We further discuss the possibility of a generalization of Szemer\'edi's theorem using methods from real algebraic geometry.Comment: 62 pages. Update v2: Corrected some references. Update v3: Incorporated feedbac

arXiv.org e-Print Archive

CiteSeerX

Cocyclic Hadamard Matrices: An Efficient Search Based Algorithm

Author: Turner Jonathan S.
Publication venue: AFIT Scholar
Publication date: 16/06/2019
Field of study

This dissertation serves as the culmination of three papers. “Counting the decimation classes of binary vectors with relatively prime fixed-density presents the first non-exhaustive decimation class counting algorithm. “A Novel Approach to Relatively Prime Fixed Density Bracelet Generation in Constant Amortized Time presents a novel lexicon for binary vectors based upon the Discrete Fourier Transform, and develops a bracelet generation method based upon the same. “A Novel Legendre Pair Generation Algorithm expands upon the bracelet generation algorithm and includes additional constraints imposed by Legendre Pairs. It further presents an efficient sorting and comparison algorithm based upon symmetric functions, as well as multiple unique Legendre Pairs

AFTI Scholar (Air Force Institute of Technology)

Bubble-Flip---A New Generation Algorithm for Prefix Normal Words

Author: Cicalese Ferdinando
Lipták Zsuzsanna
Rossi Massimiliano
Publication venue: 'Elsevier BV'
Publication date: 01/01/2017
Field of study

We present a new recursive generation algorithm for prefix normal words. These are binary strings with the property that no substring has more 1s than the prefix of the same length. The new algorithm uses two operations on binary strings, which exploit certain properties of prefix normal words in a smart way. We introduce infinite prefix normal words and show that one of the operations used by the algorithm, if applied repeatedly to extend the string, produces an ultimately periodic infinite word, which is prefix normal. Moreover, based on the original finite word, we can predict both the length and the density of an ultimate period of this infinite word.Comment: 30 pages, 3 figures, accepted in Theoret. Comp. Sc.. This is the journal version of the paper with the same title at LATA 2018 (12th International Conference on Language and Automata Theory and Applications, Tel Aviv, April 9-11, 2018

arXiv.org e-Print Archive

Catalogo dei prodotti della ricerca

Loopless Algorithms to Generate Maximum Length Gray Cycles wrt. k-Character Substitution

Author: Néraud Jean
Publication venue
Publication date: 04/09/2023
Field of study

Given a binary word relation

\tau

onto

A^*

and a finite language

X\subseteq A^*

, a

\tau

-Gray cycle over

X

consists in a permutation

\left(w_{[i]}\right)_{0\le i\le |X|-1}

X

such that each word

w_{[i]}

is an image under

\tau

of the previous word

w_{{[i-1]}}

. We define the complexity measure

\lambda_{A,\tau}(n)

, equal to the largest cardinality of a language

X

having words of length at most

n

, and s.t. some

\tau

-Gray cycle over

X

exists. The present paper is concerned with

\tau=\sigma_k

, the so-called

k

-character substitution, s.t.

(u,v)\in\sigma_k

holds if, and only if, the Hamming distance of

u

and

v

k

. We present loopless (resp., constant amortized time) algorithms for computing specific maximum length

\sigma_k$-Gray cycles.Comment: arXiv admin note: text overlap with arXiv:2108.1365

arXiv.org e-Print Archive

Algorithms and Data Structures for Coding, Indexing, and Mining of Sequential Data

Author: Rossi Massimiliano
Publication venue
Publication date: 01/01/2020
Field of study

In recent years, the production of sequential data has been rapidly increasing. This requires solving challenging problems about how to represent information, how to retrieve information, and how to extract knowledge, from sequential data. These questions belong to the areas of coding, indexing, and mining, respectively. In this thesis, we investigate problems from those three areas. Coding refers to the way in which information is represented. Coding aims at generating optimal codes, that are codes having a minimum expected length. Codes can be generated for different purposes, from data compression to error detection/correction. The Lempel-Ziv 77 parsing produces an asymptotically optimal code in terms of compression. We study algorithms to efficiently decompress strings from the Lempel-Ziv 77 parsing, using memory proportional to the size of the parsing itself. We provide the first implementation of an algorithm by Bille et al., the only work we are aware of on this problem. We present a practical evaluation of this approach and several optimizations which improve the performance on all datasets we tested. Through the Ulam-R{'e}nyi game, it is possible to provide optimal adaptive error-correcting codes. The game consists of discovering an unknown

m

-bit number by asking membership questions the answers to which can be erroneous. Questions are formulated knowing the answers to all previous ones. We want to find an optimal strategy, i.e., a strategy that can identify any

m

-bit number using the theoretical minimum number of questions. We studied the case where questions are a union of up to a fixed number of intervals, and up to three answers can be erroneous. We first show that for any sufficiently large

m

, there exists a strategy to identify an initially unknown

m

-bit number which uses at most four intervals per question. We further refine our main tool to turn the above asymptotic result into a complete characterization of those instances of the Ulam-R{'e}nyi game that admit optimal strategies. Indexing refers to the way in which information is retrieved. An index for texts permits finding all occurrences of any substring, without traversing the whole text. Many applications require to look for approximate substrings. One of these is the problem of jumbled pattern matching, where two strings match if one is a permutation of the other. We study combinatorial aspects of prefix normal words, a class of binary words introduced in this context. These words can be used as indices for the Indexed Binary Jumbled Pattern Matching problem. We present a new recursive generation algorithm for prefix normal words that is competitive with the previous one but allows to list all prefix normal words sharing the same prefix. This sheds lights on novel insights that may help solving the problem of counting the number of prefix normal words of a given length. We then introduce infinite prefix normal words, and we show that one of the operations used by the algorithm, when repeatedly applied to extend a word, produces an infinite prefix normal word. This motivates the seeking for other operations that produce infinite prefix normal words. We found that one of these operations establishes a connection between prefix normal words and Sturmian words. We also explored the relationship between prefix normal words and Abelian complexity, as well as between prefix normal words and lexicographic order. Mining refers to the way in which information is converted into knowledge. The process of knowledge discovery covers several processing steps, including knowledge extraction. We analyze the problem of mining assertions for an embedded system from its simulation traces. This problem can be modeled as a pattern discovery problem on colored strings. We present two problems of pattern discovery on colored strings: patterns for one color only, or for all colors at the same time. We present two suffix tree-based algorithms. The first algorithm solves both the one color problem and the all colors problem. We then, introduce modifications which improve performance of the algorithm both on synthetic and on real data. We implemented and evaluated the proposed approaches, highlighting time trade-offs that can be obtained. A different way of knowledge extraction is based on the information-theoretic perspective of Pearl's model of causality. It has been postulated that the true causality direction between two phenomena A and B is related to the problem of finding the minimum entropy joint distribution between A and B. This problem is known to be NP-hard, and greedy algorithms have recently been proposed. We provide a novel analysis of one of the proposed heuristic showing that this algorithm guarantees an additive approximation of 1 bit. We then, provide a general criterion for guaranteeing an additive approximation factor of 1. This criterion may be of independent interest in other contexts where couplings are used

Catalogo dei prodotti della ricerca