46 research outputs found

    Bi-Lipschitz Bijection between the Boolean Cube and the Hamming Ball

    Full text link
    We construct a bi-Lipschitz bijection from the Boolean cube to the Hamming ball of equal volume. More precisely, we show that for all even n there exists an explicit bijection f from the n-dimensional Boolean cube to the Hamming ball of equal volume embedded in (n+1)-dimensional Boolean cube, such that for all x and y it holds that distance(x,y) / 5 <= distance(f(x),f(y)) <= 4 distance(x,y) where distance(,) denotes the Hamming distance. In particular, this implies that the Hamming ball is bi-Lipschitz transitive. This result gives a strong negative answer to an open problem of Lovett and Viola [CC 2012], who raised the question in the context of sampling distributions in low-level complexity classes. The conceptual implication is that the problem of proving lower bounds in the context of sampling distributions will require some new ideas beyond the sensitivity-based structural results of Boppana [IPL 97]. We study the mapping f further and show that it (and its inverse) are computable in DLOGTIME-uniform TC0, but not in AC0. Moreover, we prove that f is "approximately local" in the sense that all but the last output bit of f are essentially determined by a single input bit

    Lipschitz bijections between boolean functions

    Full text link
    We answer four questions from a recent paper of Rao and Shinkar on Lipschitz bijections between functions from {0,1}n\{0,1\}^n to {0,1}\{0,1\}. (1) We show that there is no O(1)O(1)-bi-Lipschitz bijection from Dictator\mathrm{Dictator} to XOR\mathrm{XOR} such that each output bit depends on O(1)O(1) input bits. (2) We give a construction for a mapping from XOR\mathrm{XOR} to Majority\mathrm{Majority} which has average stretch O(n)O(\sqrt{n}), matching a previously known lower bound. (3) We give a 3-Lipschitz embedding ϕ:{0,1}n{0,1}2n+1\phi : \{0,1\}^n \to \{0,1\}^{2n+1} such that XOR(x)=Majority(ϕ(x))\mathrm{XOR}(x) = \mathrm{Majority}(\phi(x)) for all x{0,1}nx \in \{0,1\}^n. (4) We show that with high probability there is a O(1)O(1)-bi-Lipschitz mapping from Dictator\mathrm{Dictator} to a uniformly random balanced function

    The combinatorics of adinkras

    Get PDF
    Thesis (Ph. D.)--Massachusetts Institute of Technology, Department of Mathematics, 2013.Cataloged from PDF version of thesis.Includes bibliographical references (pages 67-69).Adinkras are graphical tools created to study representations of supersymmetry algebras. Besides having inherent interest for physicists, the study of adinkras has already shown nontrivial connections with coding theory and Clifford algebras. Furthermore, adinkras offer many easy-to-state and accessible mathematical problems of algebraic, combinatorial, and computational nature. In this work, we make a self-contained treatment of the mathematical foundations of adinkras that slightly generalizes the existing literature. Then, we make new connections to other areas including homological algebra, theory of polytopes, Pfaffian orientations, graph coloring, and poset theory. Selected results include the enumeration of odd dashings for all adinkraizable chromotopologies, the notion of Stiefel-Whitney classes for codes and their vanishing conditions, and the enumeration of all Hamming cube adinkras up through dimension 5.by Yan Zhang.Ph.D

    Juntas in the 1\ell^{1}-grid and Lipschitz maps between discrete tori

    Full text link
    We show that if A[k]nA \subset [k]^n, then AA is ϵ\epsilon-close to a junta depending upon at most exp(O(A/(kn1ϵ)))\exp(O(|\partial A|/(k^{n-1}\epsilon))) coordinates, where A\partial A denotes the edge-boundary of AA in the 1\ell^1-grid. This is sharp up to the value of the absolute constant in the exponent. This result can be seen as a generalisation of the Junta theorem for the discrete cube, from [E. Friedgut, Boolean functions with low average sensitivity depend on few coordinates, Combinatorica 18 (1998), 27-35], or as a characterization of large subsets of the 1\ell^1-grid whose edge-boundary is small. We use it to prove a result on the structure of Lipschitz functions between two discrete tori; this can be seen as a discrete, quantitative analogue of a recent result of Austin [T. Austin, On the failure of concentration for the \ell^{\infty}-ball, preprint]. We also prove a refined version of our junta theorem, which is sharp in a wider range of cases.Comment: 29 pages. A mistake in Example 2 (pointed out by an anonymous referee) has now been correcte

    Concentration of measure, negative association, and machine learning

    Full text link
    In this thesis we consider concentration inequalities and the concentration of measure phenomenon from a variety of angles. Sharp tail bounds on the deviation of Lipschitz functions of independent random variables about their mean are well known. We consider variations on this theme for dependent variables on the Boolean cube. In recent years negatively associated probability distributions have been studied as potential generalizations of independent random variables. Results on this class of distributions have been sparse at best, even when restricting to the Boolean cube. We consider the class of negatively associated distributions topologically, as a subset of the general class of probability measures. Both the weak (distributional) topology and the total variation topology are considered, and the simpler notion of negative correlation is investigated. The concentration of measure phenomenon began with Milman's proof of Dvoretzky's theorem, and is therefore intimately connected to the field of high-dimensional convex geometry. Recently this field has found application in the area of compressed sensing. We consider these applications and in particular analyze the use of Gordon's min-max inequality in various compressed sensing frameworks, including the Dantzig selector and the matrix uncertainty selector. Finally we consider the use of concentration inequalities in developing a theoretically sound anomaly detection algorithm. Our method uses a ranking procedure based on KNN graphs of given data. We develop a max-margin learning-to-rank framework to train limited complexity models to imitate these KNN scores. The resulting anomaly detector is shown to be asymptotically optimal in that for any false alarm rate α, its decision region converges to the α-percentile minimum volume level set of the unknown underlying density

    Tree-like graphings, wallings, and median graphings of equivalence relations

    Full text link
    We prove several results showing that every locally finite Borel graph whose large-scale geometry is "tree-like" induces a treeable equivalence relation. In particular, our hypotheses hold if each component of the original graph either has bounded tree-width or is quasi-isometric to a tree. In the latter case, we moreover show that there exists a Borel quasi-isometry to a Borel forest, under the additional assumption of (componentwise) bounded degree. We also extend these results on quasi-treeings to Borel proper metric spaces. In fact, our most general result shows treeability of countable Borel equivalence relations equipped with an abstract wallspace structure on each class obeying some local finiteness conditions, which we call a proper walling. The proof is based on the Stone duality between proper wallings and median graphs, i.e., CAT(0) cube complexes.Comment: 38 page

    Helly groups

    Full text link
    Helly graphs are graphs in which every family of pairwise intersecting balls has a non-empty intersection. This is a classical and widely studied class of graphs. In this article we focus on groups acting geometrically on Helly graphs -- Helly groups. We provide numerous examples of such groups: all (Gromov) hyperbolic, CAT(0) cubical, finitely presented graphical C(4)-T(4) small cancellation groups, and type-preserving uniform lattices in Euclidean buildings of type CnC_n are Helly; free products of Helly groups with amalgamation over finite subgroups, graph products of Helly groups, some diagram products of Helly groups, some right-angled graphs of Helly groups, and quotients of Helly groups by finite normal subgroups are Helly. We show many properties of Helly groups: biautomaticity, existence of finite dimensional models for classifying spaces for proper actions, contractibility of asymptotic cones, existence of EZ-boundaries, satisfiability of the Farrell-Jones conjecture and of the coarse Baum-Connes conjecture. This leads to new results for some classical families of groups (e.g. for FC-type Artin groups) and to a unified approach to results obtained earlier

    Doctor of Philosophy in Computing

    Get PDF
    dissertationIn the last two decades, an increasingly large amount of data has become available. Massive collections of videos, astronomical observations, social networking posts, network routing information, mobile location history and so forth are examples of real world data requiring processing for applications ranging from classi?cation to predictions. Computational resources grow at a far more constrained rate, and hence the need for ef?cient algorithms that scale well. Over the past twenty years high quality theoretical algorithms have been developed for two central problems: nearest neighbor search and dimensionality reduction over Euclidean distances in worst case distributions. These two tasks are interesting in their own right. Nearest neighbor corresponds to a database query lookup, while dimensionality reduction is a form of compression on massive data. Moreover, these are also subroutines in algorithms ranging from clustering to classi?cation. However, many highly relevant settings and distance measures have not received similar attention to that of worst case point sets in Euclidean space. The Bregman divergences include the information theoretic distances, such as entropy, of most relevance in many machine learning applications and yet prior to this dissertation lacked ef?cient dimensionality reductions, nearest neighbor algorithms, or even lower bounds on what could be possible. Furthermore, even in the Euclidean setting, theoretical algorithms do not leverage that almost all real world datasets have signi?cant low-dimensional substructure. In this dissertation, we explore different models and techniques for similarity search and dimensionality reduction. What upper bounds can be obtained for nearest neighbors for Bregman divergences? What upper bounds can be achieved for dimensionality reduction for information theoretic measures? Are these problems indeed intrinsically of harder computational complexity than in the Euclidean setting? Can we improve the state of the art nearest neighbor algorithms for real world datasets in Euclidean space? These are the questions we investigate in this dissertation, and that we shed some new insight on. In the ?rst part of our dissertation, we focus on Bregman divergences. We exhibit nearest neighbor algorithms, contingent on a distributional constraint on the datasets. We next show lower bounds suggesting that is in some sense inherent to the problem complexity. After this we explore dimensionality reduction techniques for the Jensen-Shannon and Hellinger distances, two popular information theoretic measures. In the second part, we show that even for the more well-studied Euclidean case, worst case nearest neighbor algorithms can be improved upon sharply for real world datasets with spectral structure
    corecore