16,387 research outputs found

    Computing Bi-Lipschitz Outlier Embeddings into the Line

    Get PDF
    The problem of computing a bi-Lipschitz embedding of a graphical metric into the line with minimum distortion has received a lot of attention. The best-known approximation algorithm computes an embedding with distortion O(c2)O(c^2), where cc denotes the optimal distortion [B\u{a}doiu \etal~2005]. We present a bi-criteria approximation algorithm that extends the above results to the setting of \emph{outliers}. Specifically, we say that a metric space (X,ρ)(X,\rho) admits a (k,c)(k,c)-embedding if there exists KβŠ‚XK\subset X, with ∣K∣=k|K|=k, such that (Xβˆ–K,ρ)(X\setminus K, \rho) admits an embedding into the line with distortion at most cc. Given kβ‰₯0k\geq 0, and a metric space that admits a (k,c)(k,c)-embedding, for some cβ‰₯1c\geq 1, our algorithm computes a (poly(k,c,log⁑n),poly(c))({\mathsf p}{\mathsf o}{\mathsf l}{\mathsf y}(k, c, \log n), {\mathsf p}{\mathsf o}{\mathsf l}{\mathsf y}(c))-embedding in polynomial time. This is the first algorithmic result for outlier bi-Lipschitz embeddings. Prior to our work, comparable outlier embeddings where known only for the case of additive distortion

    Composition of nested embeddings with an application to outlier removal

    Full text link
    We study the design of embeddings into Euclidean space with outliers. Given a metric space (X,d)(X,d) and an integer kk, the goal is to embed all but kk points in XX (called the "outliers") into β„“2\ell_2 with the smallest possible distortion cc. Finding the optimal distortion cc for a given outlier set size kk, or alternately the smallest kk for a given target distortion cc are both NP-hard problems. In fact, it is UGC-hard to approximate kk to within a factor smaller than 22 even when the metric sans outliers is isometrically embeddable into β„“2\ell_2. We consider bi-criteria approximations. Our main result is a polynomial time algorithm that approximates the outlier set size to within an O(log⁑4k)O(\log^4 k) factor and the distortion to within a constant factor. The main technical component in our result is an approach for constructing a composition of two given embeddings from subsets of XX into β„“2\ell_2 which inherits the distortions of each to within small multiplicative factors. Specifically, given a low cSc_S distortion embedding from SβŠ‚XS\subset X into β„“2\ell_2 and a high(er) cXc_X distortion embedding from the entire set XX into β„“2\ell_2, we construct a single embedding that achieves the same distortion cSc_S over pairs of points in SS and an expansion of at most O(log⁑k)β‹…cXO(\log k)\cdot c_X over the remaining pairs of points, where k=∣Xβˆ–S∣k=|X\setminus S|. Our composition theorem extends to embeddings into arbitrary β„“p\ell_p metrics for pβ‰₯1p\ge 1, and may be of independent interest. While unions of embeddings over disjoint sets have been studied previously, to our knowledge, this is the first work to consider compositions of nested embeddings.Comment: 25 pages (including 2 appendices), 5 figure
    • …
    corecore