1,543 research outputs found

    A decomposition method for global evaluation of Shannon entropy and local estimations of algorithmic complexity

    Get PDF
    We investigate the properties of a Block Decomposition Method (BDM), which extends the power of a Coding Theorem Method (CTM) that approximates local estimations of algorithmic complexity based on Solomonoff–Levin’s theory of algorithmic probability providing a closer connection to algorithmic complexity than previous attempts based on statistical regularities such as popular lossless compression schemes. The strategy behind BDM is to find small computer programs that produce the components of a larger, decomposed object. The set of short computer programs can then be artfully arranged in sequence so as to produce the original object. We show that the method provides efficient estimations of algorithmic complexity but that it performs like Shannon entropy when it loses accuracy. We estimate errors and study the behaviour of BDM for different boundary conditions, all of which are compared and assessed in detail. The measure may be adapted for use with more multi-dimensional objects than strings, objects such as arrays and tensors. To test the measure we demonstrate the power of CTM on low algorithmic-randomness objects that are assigned maximal entropy (e.g., π) but whose numerical approximations are closer to the theoretical low algorithmic-randomness expectation. We also test the measure on larger objects including dual, isomorphic and cospectral graphs for which we know that algorithmic randomness is low. We also release implementations of the methods in most major programming languages—Wolfram Language (Mathematica), Matlab, R, Perl, Python, Pascal, C++, and Haskell—and an online algorithmic complexity calculator.Swedish Research Council (Vetenskapsrådet

    A decomposition method for global evaluation of Shannon entropy and local estimations of algorithmic complexity

    Get PDF
    We investigate the properties of a Block Decomposition Method (BDM), which extends the power of a Coding Theorem Method (CTM) that approximates local estimations of algorithmic complexity based on Solomonoff–Levin’s theory of algorithmic probability providing a closer connection to algorithmic complexity than previous attempts based on statistical regularities such as popular lossless compression schemes. The strategy behind BDM is to find small computer programs that produce the components of a larger, decomposed object. The set of short computer programs can then be artfully arranged in sequence so as to produce the original object. We show that the method provides efficient estimations of algorithmic complexity but that it performs like Shannon entropy when it loses accuracy. We estimate errors and study the behaviour of BDM for different boundary conditions, all of which are compared and assessed in detail. The measure may be adapted for use with more multi-dimensional objects than strings, objects such as arrays and tensors. To test the measure we demonstrate the power of CTM on low algorithmic-randomness objects that are assigned maximal entropy (e.g., π) but whose numerical approximations are closer to the theoretical low algorithmic-randomness expectation. We also test the measure on larger objects including dual, isomorphic and cospectral graphs for which we know that algorithmic randomness is low. We also release implementations of the methods in most major programming languages—Wolfram Language (Mathematica), Matlab, R, Perl, Python, Pascal, C++, and Haskell—and an online algorithmic complexity calculator.Swedish Research Council (Vetenskapsrådet

    Periodic-orbit theory of universal level correlations in quantum chaos

    Full text link
    Using Gutzwiller's semiclassical periodic-orbit theory we demonstrate universal behaviour of the two-point correlator of the density of levels for quantum systems whose classical limit is fully chaotic. We go beyond previous work in establishing the full correlator such that its Fourier transform, the spectral form factor, is determined for all times, below and above the Heisenberg time. We cover dynamics with and without time reversal invariance (from the orthogonal and unitary symmetry classes). A key step in our reasoning is to sum the periodic-orbit expansion in terms of a matrix integral, like the one known from the sigma model of random-matrix theory.Comment: 44 pages, 11 figures, changed title; final version published in New J. Phys. + additional appendices B-F not included in the journal versio

    Optimal-Time Dictionary-Compressed Indexes

    Full text link
    We describe the first self-indexes able to count and locate pattern occurrences in optimal time within a space bounded by the size of the most popular dictionary compressors. To achieve this result we combine several recent findings, including \emph{string attractors} --- new combinatorial objects encompassing most known compressibility measures for highly repetitive texts ---, and grammars based on \emph{locally-consistent parsing}. More in detail, let γ\gamma be the size of the smallest attractor for a text TT of length nn. The measure γ\gamma is an (asymptotic) lower bound to the size of dictionary compressors based on Lempel--Ziv, context-free grammars, and many others. The smallest known text representations in terms of attractors use space O(γlog(n/γ))O(\gamma\log(n/\gamma)), and our lightest indexes work within the same asymptotic space. Let ϵ>0\epsilon>0 be a suitably small constant fixed at construction time, mm be the pattern length, and occocc be the number of its text occurrences. Our index counts pattern occurrences in O(m+log2+ϵn)O(m+\log^{2+\epsilon}n) time, and locates them in O(m+(occ+1)logϵn)O(m+(occ+1)\log^\epsilon n) time. These times already outperform those of most dictionary-compressed indexes, while obtaining the least asymptotic space for any index searching within O((m+occ)polylogn)O((m+occ)\,\textrm{polylog}\,n) time. Further, by increasing the space to O(γlog(n/γ)logϵn)O(\gamma\log(n/\gamma)\log^\epsilon n), we reduce the locating time to the optimal O(m+occ)O(m+occ), and within O(γlog(n/γ)logn)O(\gamma\log(n/\gamma)\log n) space we can also count in optimal O(m)O(m) time. No dictionary-compressed index had obtained this time before. All our indexes can be constructed in O(n)O(n) space and O(nlogn)O(n\log n) expected time. As a byproduct of independent interest..

    Bibliographie

    Get PDF

    One-way definability of two-way word transducers

    Get PDF
    Functional transductions realized by two-way transducers (or, equally, by streaming transducers or MSO transductions) are the natural and standard notion of `regular' mappings from words to words. It was shown in 2013 that it is decidable if such a transduction can be implemented by some one-way transducer, but the given algorithm has non-elementary complexity. We provide an algorithm of different flavor solving the above question, that has doubly exponential space complexity. In the special case of sweeping transducers the complexity is one exponential less. We also show how to construct an equivalent one-way transducer, whenever it exists, in doubly or triply exponential time, again depending on whether the input transducer is sweeping or two-way. In the sweeping case our construction is shown to be optimal

    Automatic Sequences and Decidable Properties: Implementation and Applications

    Get PDF
    In 1912 Axel Thue sparked the study of combinatorics on words when he showed that the Thue-Morse sequence contains no overlaps, that is, factors of the form ayaya. Since then many interesting properties of sequences began to be discovered and studied. In this thesis, we consider a class of infinite sequences generated by automata, called the k-automatic sequences. In particular, we present a logical theory in which many properties of k-automatic sequences can be expressed as predicates and we show that such predicates are decidable. Our main contribution is the implementation of a theorem prover capable of practically characterizing many commonly sought-after properties of k-automatic sequences. We showcase a panoply of results achieved using our method. We give new explicit descriptions of the recurrence and appearance functions of a list of well-known k-automatic sequences. We define a related function, called the condensation function, and give explicit descriptions for it as well. We re-affirm known results on the critical exponent of some sequences and determine it for others where it was previously unknown. On the more theoretical side, we show that the subword complexity p(n) of k-automatic sequences is k-synchronized, i.e., the language of pairs (n, p(n)) (expressed in base k) is accepted by an automaton. Furthermore, we prove that the Lyndon factorization of k-automatic sequences is also k-automatic and explicitly compute the factorization for several sequences. Finally, we show that while the number of unbordered factors of length n is not k-synchronized, it is k-regular

    "Daleki svijet muzikom dokuěen"("A distant world touched by music"): a contextual and critical study of Yugoslavian music as exemplified in the life and music of Josip Stolcer Slavenski (1896-1955)

    Get PDF
    The core of this study is a contextual, critical and analytical study of the life and work of Josip Stolcer Slavenski (1886-1955). It consists of a brief outline of nineteenth-century socio-political, cultural and musical trends in the former Yugoslavia which serves as a broader context for the period of Moderna (avant-garde). This movement, which attempted to embrace new ideas and developments of mainstream western Europe, emerged at the end of the nineteenth century and played a major role in Slavenski's musical development. Parucular reference to Slavenski's circumstances are examined in the biographical chapter and his periods of study in Budapest, Prague and Paris are discussed, as is the important recognition of his music at the Donaueschingen Festival in the 1920s, and how the changing political climate of Yugoslavia, from monarchy to republic, influenced Slavenski's political and national affiliations. With this contextual backdrop, which includes reference to a number of Slavenski's contemporaries - namely Kodaly, Bartok, Suk, Novak, Hindemith, composers of the Second Viennese School, and his own Croatian, Serbian and Slovenian compatriots, the dissertation focusses on particular aspects of Slavenski's work with special emphasis on his use of Balkan folk music, his autodidactic theories of 'Astroacoustics', the expressionist fabric of his music, and the socialist realism of his later works during the 1940s and 1950s.In attempting to illuminate the nature of the 'Slavenski phenomena', this work also offers certain suggestions as to what might have been the reasons for the neglect and creative aridity of his later years. Furthermore, it is hoped that the conclusions of this study may in themselves provide an impetus for further research into Slavenski's work which has received little attention in English-speaking musicology

    Privileged Words and Sturmian Words

    Get PDF
    This dissertation has two almost unrelated themes: privileged words and Sturmian words. Privileged words are a new class of words introduced recently. A word is privileged if it is a complete first return to a shorter privileged word, the shortest privileged words being letters and the empty word. Here we give and prove almost all results on privileged words known to date. On the other hand, the study of Sturmian words is a well-established topic in combinatorics on words. In this dissertation, we focus on questions concerning repetitions in Sturmian words, reproving old results and giving new ones, and on establishing completely new research directions. The study of privileged words presented in this dissertation aims to derive their basic properties and to answer basic questions regarding them. We explore a connection between privileged words and palindromes and seek out answers to questions on context-freeness, computability, and enumeration. It turns out that the language of privileged words is not context-free, but privileged words are recognizable by a linear-time algorithm. A lower bound on the number of binary privileged words of given length is proven. The main interest, however, lies in the privileged complexity functions of the Thue-Morse word and Sturmian words. We derive recurrences for computing the privileged complexity function of the Thue-Morse word, and we prove that Sturmian words are characterized by their privileged complexity function. As a slightly separate topic, we give an overview of a certain method of automated theorem-proving and show how it can be applied to study privileged factors of automatic words. The second part of this dissertation is devoted to Sturmian words. We extensively exploit the interpretation of Sturmian words as irrational rotation words. The essential tools are continued fractions and elementary, but powerful, results of Diophantine approximation theory. With these tools at our disposal, we reprove old results on powers occurring in Sturmian words with emphasis on the fractional index of a Sturmian word. Further, we consider abelian powers and abelian repetitions and characterize the maximum exponents of abelian powers with given period occurring in a Sturmian word in terms of the continued fraction expansion of its slope. We define the notion of abelian critical exponent for Sturmian words and explore its connection to the Lagrange spectrum of irrational numbers. The results obtained are often specialized for the Fibonacci word; for instance, we show that the minimum abelian period of a factor of the Fibonacci word is a Fibonacci number. In addition, we propose a completely new research topic: the square root map. We prove that the square root map preserves the language of any Sturmian word. Moreover, we construct a family of non-Sturmian optimal squareful words whose language the square root map also preserves.This construction yields examples of aperiodic infinite words whose square roots are periodic.Siirretty Doriast
    corecore