Search CORE

9,118 research outputs found

Approximation algorithms for clustering and facility location problems

Author: Gupta Shalmoli
Publication venue
Publication date: 01/12/2018
Field of study

In this thesis we design and analyze algorithms for various facility location and clustering problems. The problems we study are NP-Hard and therefore, assuming P is not equal NP, there do not exist polynomial time algorithms to solve them optimally. One approach to cope with the intractability of these problems is to design approximation algorithms which run in polynomial-time and output a near-optimal solution for all instances of the problem. However these algorithms do not always work well in practice. Often heuristics with no explicit approximation guarantee perform quite well. To bridge this gap between theory and practice, and to design algorithms that are tuned for instances arising in practice, there is an increasing emphasis on beyond worst-case analysis. In this thesis we consider both these approaches. In the first part we design worst case approximation algorithms for Uniform Submodular Facility Location (USFL), and Capacitated k-center (CapKCenter) problems. USFL is a generalization of the well-known Uncapacitated Facility Location problem. In USFL the cost of opening a facility is a submodular function of the clients assigned to it (the function is identical for all facilities). We show that a natural greedy algorithm (which gives constant factor approximation for Uncapacitated Facility Location and other facility location problems) has a lower bound of log(n), where n is the number of clients. We present an O(log^2 k) approximation algorithm where k is the number of facilities. The algorithm is based on rounding a convex relaxation. We further consider several special cases of the problem and give improved approximation bounds for them. The CapKCenter problem is an extension of the well-known k-center problem: each facility has a maximum capacity on the number of clients that can be assigned to it. We obtain a 9-approximation for this problem via a linear programming (LP) rounding procedure. Our result, combined with previously known lower bounds, almost settles the integrality gap for a natural LP relaxation. In the second part we consider several well-known clustering problems like k-center, k-median, k-means and their corresponding outlier variants. We use beyond worst-case analysis due to the practical relevance of these problems. In particular we show that when the input instances are 2-perturbation resilient (i.e. the optimal solution does not change when the distances change by a multiplicative factor of 2), the LP integrality gap for k-center (and also asymmetric k-center) is 1. We further introduce a model of perturbation resilience for clustering with outliers. Under this new model, we show that previous results (including our LP integrality result) known for clustering under perturbation resilience also extend for clustering with outliers. This leads to a dynamic programming based heuristic for k-means with outliers (k-means-outlier) which gives an optimal solution when the instance is 2-perturbation resilient. We propose two more algorithms for k-means-outlier — a sampling based algorithm which gives an O(1) approximation when the optimal clusters are not “too small”, and an LP rounding algorithm which gives an O(1) approximation at the expense of violating the number of clusters and outliers by a small constant. We empirically study our proposed algorithms on several clustering datasets

Illinois Digital Environment for Access to Learning and Scholarship Repository

Constant Factor Approximation for Capacitated k-Center with Outliers

Author: Cygan Marek
Kociumaka Tomasz
Publication venue
Publication date: 01/01/2014
Field of study

The

k

-center problem is a classic facility location problem, where given an edge-weighted graph

G = (V,E)

one is to find a subset of

k

vertices

S

, such that each vertex in

V

is "close" to some vertex in

S

. The approximation status of this basic problem is well understood, as a simple 2-approximation algorithm is known to be tight. Consequently different extensions were studied. In the capacitated version of the problem each vertex is assigned a capacity, which is a strict upper bound on the number of clients a facility can serve, when located at this vertex. A constant factor approximation for the capacitated

k

-center was obtained last year by Cygan, Hajiaghayi and Khuller [FOCS'12], which was recently improved to a 9-approximation by An, Bhaskara and Svensson [arXiv'13]. In a different generalization of the problem some clients (denoted as outliers) may be disregarded. Here we are additionally given an integer

p

and the goal is to serve exactly

p

clients, which the algorithm is free to choose. In 2001 Charikar et al. [SODA'01] presented a 3-approximation for the

k

-center problem with outliers. In this paper we consider a common generalization of the two extensions previously studied separately, i.e. we work with the capacitated

k

-center with outliers. We present the first constant factor approximation algorithm with approximation ratio of 25 even for the case of non-uniform hard capacities.Comment: 15 pages, 3 figures, accepted to STACS 201

arXiv.org e-Print Archive

Dagstuhl Research Online Publication Server

The Non-Uniform k-Center Problem

Author: Chakrabarty Deeparnab
Goyal Prachi
Krishnaswamy Ravishankar
Publication venue
Publication date: 01/01/2016
Field of study

In this paper, we introduce and study the Non-Uniform k-Center problem (NUkC). Given a finite metric space

(X,d)

and a collection of balls of radii

\{r_1\geq \cdots \ge r_k\}

, the NUkC problem is to find a placement of their centers on the metric space and find the minimum dilation

\alpha

, such that the union of balls of radius

\alpha\cdot r_i

around the

i

th center covers all the points in

X

. This problem naturally arises as a min-max vehicle routing problem with fleets of different speeds. The NUkC problem generalizes the classic

k

-center problem when all the

k

radii are the same (which can be assumed to be

1

after scaling). It also generalizes the

k

-center with outliers (kCwO) problem when there are

k

balls of radius

1

and

\ell

balls of radius

0

. There are

2

-approximation and

3

-approximation algorithms known for these problems respectively; the former is best possible unless P=NP and the latter remains unimproved for 15 years. We first observe that no

O(1)

-approximation is to the optimal dilation is possible unless P=NP, implying that the NUkC problem is more non-trivial than the above two problems. Our main algorithmic result is an

(O(1),O(1))

-bi-criteria approximation result: we give an

O(1)

-approximation to the optimal dilation, however, we may open

\Theta(1)

centers of each radii. Our techniques also allow us to prove a simple (uni-criteria), optimal

2

-approximation to the kCwO problem improving upon the long-standing

3

-factor. Our main technical contribution is a connection between the NUkC problem and the so-called firefighter problems on trees which have been studied recently in the TCS community.Comment: Adjusted the figur

arXiv.org e-Print Archive

Dagstuhl Research Online Publication Server

In-Network Outlier Detection in Wireless Sensor Networks

Author: A Beck
A Cerpa
Boleslaw Szymanski
Chris Giannella
D Apiletti
D Krivitski
G Tietjen
H Fan
Hillol Kargupta
IF Akyildiz
IF Akyildiz
Joel W. Branch
K Bhaduri
K Das
K Holger
L Chen
M Bawa
M Mehyar
M Otey
P Gupta
R Wolff
R Wolff
Ran Wolff
S Basu
S Chong
S Mukherjee
V Barnett
V Hodge
W Mebane
X Sheng
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 03/09/2009
Field of study

To address the problem of unsupervised outlier detection in wireless sensor networks, we develop an approach that (1) is flexible with respect to the outlier definition, (2) computes the result in-network to reduce both bandwidth and energy usage,(3) only uses single hop communication thus permitting very simple node failure detection and message reliability assurance mechanisms (e.g., carrier-sense), and (4) seamlessly accommodates dynamic updates to data. We examine performance using simulation with real sensor data streams. Our results demonstrate that our approach is accurate and imposes a reasonable communication load and level of power consumption.Comment: Extended version of a paper appearing in the Int'l Conference on Distributed Computing Systems 200

arXiv.org e-Print Archive

Crossref

Matroid and Knapsack Center Problems

Author: A. Schrijver
D. Hochbaum
D.Z. Chen
F. Grandoni
G.N. Frederickson
J. Chuzhoy
J. Edmonds
J. Li
M. Charikar
M. Hajiaghayi
R. Cole
R. Matthew McCutchen
S. Chechik
S. Khuller
S. Khuller
T. Gonzalez
Publication venue
Publication date: 01/01/2013
Field of study

In the classic

k

-center problem, we are given a metric graph, and the objective is to open

k

nodes as centers such that the maximum distance from any vertex to its closest center is minimized. In this paper, we consider two important generalizations of

k

-center, the matroid center problem and the knapsack center problem. Both problems are motivated by recent content distribution network applications. Our contributions can be summarized as follows: 1. We consider the matroid center problem in which the centers are required to form an independent set of a given matroid. We show this problem is NP-hard even on a line. We present a 3-approximation algorithm for the problem on general metrics. We also consider the outlier version of the problem where a given number of vertices can be excluded as the outliers from the solution. We present a 7-approximation for the outlier version. 2. We consider the (multi-)knapsack center problem in which the centers are required to satisfy one (or more) knapsack constraint(s). It is known that the knapsack center problem with a single knapsack constraint admits a 3-approximation. However, when there are at least two knapsack constraints, we show this problem is not approximable at all. To complement the hardness result, we present a polynomial time algorithm that gives a 3-approximate solution such that one knapsack constraint is satisfied and the others may be violated by at most a factor of

1+\epsilon

. We also obtain a 3-approximation for the outlier version that may violate the knapsack constraint by

1+\epsilon

.Comment: A preliminary version of this paper is accepted to IPCO 201

arXiv.org e-Print Archive

CiteSeerX

Crossref