4 research outputs found

    Average Redundancy for Known Sources: Ubiquitous Trees in Source Coding

    Get PDF
    Analytic information theory aims at studying problems of information theory using analytic techniques of computer science and combinatorics. Following Hadamard's precept, these problems are tackled by complex analysis methods such as generating functions, Mellin transform, Fourier series, saddle point method, analytic poissonization and depoissonization, and singularity analysis. This approach lies at the crossroad of computer science and information theory. In this survey we concentrate on one facet of information theory (i.e., source coding better known as data compression), namely the redundancy rate\textit{redundancy rate} problem. The redundancy rate problem determines by how much the actual code length exceeds the optimal code length. We further restrict our interest to the average\textit{average} redundancy for known\textit{known} sources, that is, when statistics of information sources are known. We present precise analyses of three types of lossless data compression schemes, namely fixed-to-variable (FV) length codes, variable-to-fixed (VF) length codes, and variable-to-variable (VV) length codes. In particular, we investigate average redundancy of Huffman, Tunstall, and Khodak codes. These codes have succinct representations as trees\textit{trees}, either as coding or parsing trees, and we analyze here some of their parameters (e.g., the average path from the root to a leaf)

    Enumeration of General t-ary Trees and Universal Types

    Get PDF
    We consider t-ary trees characterized by their numbers of nodes and their total path length. When t=2 these are called binary trees, and in such trees a parent node may have up to t child nodes. We give asymptotic expansions for the total number of trees with nodes and path length p, when n and p are large. We consider several different ranges of n and p. For n→∞ and p=O(n^{3/2}) we recover the Airy distribution for the path length in trees with many nodes, and also obtain higher order asymptotic results. For p→∞ and an appropriate range of n we obtain a limiting Gaussian distribution for the number of nodes in trees with large path lengths. The mean and variance are expressed in terms of the maximal root of the Airy function. Singular perturbation methods, such as asymptotic matching and WKB type expansions, are used throughout, and they are combined with more standard methods of analytic combinatorics, such as generating functions, singularity analysis, saddle point method, etc. The results are applicable to problems in information theory, that involve data compression schemes which parse long sequence into shorter phrases. Numerical studies show the accuracy of the various asymptotic approximations. Key Words: Trees; Universal Types; Asymptotics; Path Length; Singular Perturbation

    On the Number of t-Ary Trees with a Given Path Length

    No full text

    On the Number of t-Ary Trees with a Given Path Length

    No full text
    We show that the number of t-ary trees with path length equal to p is h(t -1 ) tp log 2 p , where h(x)=-x log 2 x-(1-x) log 2 (1-x) is the binary entropy function. Besides its intrinsic combinatorial interest, the question recently arose in the context of information theory, where the number of t-ary trees with path length p estimates the number of universal types, or, equivalently, the number of di#erent possible Lempel-Ziv'78 dictionaries for sequences of length p over an alphabet of size t
    corecore