5,246 research outputs found

    Choosing Book Friends

    Get PDF
    PDF pages: 3

    Sharp asymptotic and finite-sample rates of convergence of empirical measures in Wasserstein distance

    Full text link
    The Wasserstein distance between two probability measures on a metric space is a measure of closeness with applications in statistics, probability, and machine learning. In this work, we consider the fundamental question of how quickly the empirical measure obtained from nn independent samples from μ\mu approaches μ\mu in the Wasserstein distance of any order. We prove sharp asymptotic and finite-sample results for this rate of convergence for general measures on general compact metric spaces. Our finite-sample results show the existence of multi-scale behavior, where measures can exhibit radically different rates of convergence as nn grows

    Uncoupled isotonic regression via minimum Wasserstein deconvolution

    Full text link
    Isotonic regression is a standard problem in shape-constrained estimation where the goal is to estimate an unknown nondecreasing regression function ff from independent pairs (xi,yi)(x_i, y_i) where E[yi]=f(xi),i=1,…n\mathbb{E}[y_i]=f(x_i), i=1, \ldots n. While this problem is well understood both statistically and computationally, much less is known about its uncoupled counterpart where one is given only the unordered sets {x1,…,xn}\{x_1, \ldots, x_n\} and {y1,…,yn}\{y_1, \ldots, y_n\}. In this work, we leverage tools from optimal transport theory to derive minimax rates under weak moments conditions on yiy_i and to give an efficient algorithm achieving optimal rates. Both upper and lower bounds employ moment-matching arguments that are also pertinent to learning mixtures of distributions and deconvolution.Comment: To appear in Information and Inference: a Journal of the IM

    Entropic optimal transport is maximum-likelihood deconvolution

    Get PDF
    We give a statistical interpretation of entropic optimal transport by showing that performing maximum-likelihood estimation for Gaussian deconvolution corresponds to calculating a projection with respect to the entropic optimal transport distance. This structural result gives theoretical support for the wide adoption of these tools in the machine learning community
    • …
    corecore