Statistical aspects of optimal transport

Abstract

Optimal transport (OT) is a flexible framework for contrasting and interpolating probability measures which has recently been applied throughout science, including in machine learning, statistics, graphics, economics, biology, and more. In this thesis, we study several statistical problems at the forefront of applied optimal transport, prioritizing statistically and computationally practical results. We begin by considering one of the most popular applications of OT in practice, the barycenter problem, providing dimension-free rates of statistical estimation. In the Gaussian case, we analyze first-order methods for computing barycenters, and develop global, dimension-free rates of convergence despite the non-convexity of the problem. Extending beyond the Gaussian case, however, is challenging due to the fundamental curse of dimensionality for OT, which motivates the study of a regularized, and in fact more computationally feasible, form of optimal transport, dubbed entropic optimal transport (entropic OT). Recent work has suggested that entropic OT may escape the curse of dimensionality of un-regularized OT, and in this thesis we develop a refined theory of the statistical behavior of entropic OT by showing that entropic OT does attain truly dimension-free rates of convergence in the large regularization regime, as well as automatically adapts to the intrinsic dimension of the data in the small regularization regime. We also consider the rate of approximation of entropic OT in the semi-discrete case, and complement these results by considering the problem of trajectory reconstruction, proposing two practical methods based off both un-regularized and entropic OT.Ph.D

    Similar works

    Full text

    thumbnail-image

    Available Versions