Search CORE

5 research outputs found

Tight Analysis of a Multiple-Swap Heuristic for Budgeted Red-Blue Median

Author: Friggstad Zachary
Zhang Yifeng
Publication venue
Publication date: 01/01/2016
Field of study

Budgeted Red-Blue Median is a generalization of classic

k

-Median in that there are two sets of facilities, say

\mathcal{R}

and

\mathcal{B}

, that can be used to serve clients located in some metric space. The goal is to open

k_r

facilities in

\mathcal{R}

and

k_b

facilities in

\mathcal{B}

for some given bounds

k_r, k_b

and connect each client to their nearest open facility in a way that minimizes the total connection cost. We extend work by Hajiaghayi, Khandekar, and Kortsarz [2012] and show that a multiple-swap local search heuristic can be used to obtain a

(5+\epsilon)

-approximation for Budgeted Red-Blue Median for any constant

\epsilon > 0

. This is an improvement over their single swap analysis and beats the previous best approximation guarantee of 8 by Swamy [2014]. We also present a matching lower bound showing that for every

p \geq 1

, there are instances of Budgeted Red-Blue Median with local optimum solutions for the

p

-swap heuristic whose cost is

5 + \Omega\left(\frac{1}{p}\right)

times the optimum solution cost. Thus, our analysis is tight up to the lower order terms. In particular, for any

\epsilon > 0

we show the single-swap heuristic admits local optima whose cost can be as bad as

7-\epsilon

times the optimum solution cost

arXiv.org e-Print Archive

Dagstuhl Research Online Publication Server

Constant Approximation for $k$ -Median and $k$ -Means with Outliers via Iterative Rounding

Author: Arthur David
Charikar M.
Chawla Sanjay
Chen Ke
Cohen-Addad Vincent
Guha Sudipto
Korupolu Madhukar R.
Ott Lionel
Shi Li. A
Publication venue
Publication date: 06/04/2018
Field of study

In this paper, we present a new iterative rounding framework for many clustering problems. Using this, we obtain an

(\alpha_1 + \epsilon \leq 7.081 + \epsilon)

-approximation algorithm for

k

-median with outliers, greatly improving upon the large implicit constant approximation ratio of Chen [Chen, SODA 2018]. For

k

-means with outliers, we give an

(\alpha_2+\epsilon \leq 53.002 + \epsilon)

-approximation, which is the first

O(1)

-approximation for this problem. The iterative algorithm framework is very versatile; we show how it can be used to give

\alpha_1

- and

(\alpha_1 + \epsilon)

-approximation algorithms for matroid and knapsack median problems respectively, improving upon the previous best approximations ratios of

8

[Swamy, ACM Trans. Algorithms] and

17.46

[Byrka et al, ESA 2015]. The natural LP relaxation for the

k

-median/

k

-means with outliers problem has an unbounded integrality gap. In spite of this negative result, our iterative rounding framework shows that we can round an LP solution to an almost-integral solution of small cost, in which we have at most two fractionally open facilities. Thus, the LP integrality gap arises due to the gap between almost-integral and fully-integral solutions. Then, using a pre-processing procedure, we show how to convert an almost-integral solution to a fully-integral solution losing only a constant-factor in the approximation ratio. By further using a sparsification technique, the additive factor loss incurred by the conversion can be reduced to any

\epsilon > 0

arXiv.org e-Print Archive

Crossref

Diversity-aware $k$ -median : Clustering with fair center representation

Author: Gionis Aristides
Ordozgoiti Bruno
Thejaswi Suhas
Publication venue
Publication date: 22/06/2021
Field of study

We introduce a novel problem for diversity-aware clustering. We assume that the potential cluster centers belong to a set of groups defined by protected attributes, such as ethnicity, gender, etc. We then ask to find a minimum-cost clustering of the data into

k

clusters so that a specified minimum number of cluster centers are chosen from each group. We thus require that all groups are represented in the clustering solution as cluster centers, according to specified requirements. More precisely, we are given a set of clients

C

, a set of facilities \pazocal{F}, a collection

\mathcal{F}=\{F_1,\dots,F_t\}

of facility groups F_i \subseteq \pazocal{F}, budget

k

, and a set of lower-bound thresholds

R=\{r_1,\dots,r_t\}

, one for each group in

\mathcal{F}

. The \emph{diversity-aware

k

-median problem} asks to find a set

S

k

facilities in \pazocal{F} such that

|S \cap F_i| \geq r_i

, that is, at least

r_i

centers in

S

are from group

F_i

, and the

k

-median cost

\sum_{c \in C} \min_{s \in S} d(c,s)

is minimized. We show that in the general case where the facility groups may overlap, the diversity-aware

k

-median problem is \np-hard, fixed-parameter intractable, and inapproximable to any multiplicative factor. On the other hand, when the facility groups are disjoint, approximation algorithms can be obtained by reduction to the \emph{matroid median} and \emph{red-blue median} problems. Experimentally, we evaluate our approximation methods for the tractable cases, and present a relaxation-based heuristic for the theoretically intractable case, which can provide high-quality and efficient solutions for real-world datasets.Comment: To appear in ECML-PKDD 202

arXiv.org e-Print Archive

Local search heuristics for the mobile facility location problem

Author: Mustafa Sahin
Russell Halper
S Raghavan
Publication venue
Publication date: 23/04/2020
Field of study

a b s t r a c t In the mobile facility location problem (MFLP), one seeks to relocate (or move) a set of existing facilities and assign clients to these facilities so that the sum of facility movement costs and the client travel costs (each to its assigned facility) is minimized. This paper studies formulations and develops local search heuristics for the MFLP. First, we develop an integer programming (IP) formulation for the MFLP by observing that for a given set of facility destinations the problem may be decomposed into two polynomially solvable subproblems. This IP formulation is quite compact in terms of the number of nonzero coefficients in the constraint matrix and the number of integer variables; and allows for the solution of large-scale MFLP instances. Using the decomposition observation, we propose two local search neighborhoods for the MFLP. We report on extensive computational tests of the new IP formulation and local search heuristics on a large range of instances. These tests demonstrate that the proposed formulation and local search heuristics significantly outperform the existing formulation and a previously developed local search heuristic for the problem

CiteSeerX

Large-scale optimization for data placement problem

Author: Ansari Lazima
University of Lethbridge. Faculty of Arts and Science
Publication venue: 'University of Central Missouri, Department of Mathematics and Computer Science'
Publication date: 01/01/2017
Field of study

Large-scale optimization of combinatorial problems is one of the most challenging areas. These problems are characterized by large sets of data (variables and constraints). In this thesis, we study large-scale optimization of the data placement problem with zero storage cost. The goal in the data placement problem is to find the placement of data objects in a set of fixed capacity caches in a network to optimize the latency of access. Data placement problem arises naturally in the design of content distribution networks. We report on an empirical study of the upper bound and the lower bound of this problem for large sized instances. We also study a semi-Lagrangean relaxation of a closely related k-median problem. In this thesis, we study the theory and practice of approximation algorithm for the data placement problem and the k-median problem

OPUS: Open Uleth Scholarship - University of Lethbridge Research Repository