Search CORE

92 research outputs found

The reverse greedy algorithm for the metric k-median problem

Author: Charikar
Claire Kenyon
Jain
Marek Chrobak
Mettu
Neal Young
Publication venue: 'Elsevier BV'
Publication date: 01/01/2005
Field of study

The Reverse Greedy algorithm (RGreedy) for the k-median problem works as follows. It starts by placing facilities on all nodes. At each step, it removes a facility to minimize the resulting total distance from the customers to the remaining facilities. It stops when k facilities remain. We prove that, if the distance function is metric, then the approximation ratio of RGreedy is between ?(log n/ log log n) and O(log n).Comment: to appear in IPL. preliminary version in COCOON '0

arXiv.org e-Print Archive

CiteSeerX

Crossref

eScholarship - University of California

Center-based Clustering under Perturbation Stability

Author: Awasthi Pranjal
Blum Avrim
Sheffet Or
Publication venue
Publication date: 11/08/2011
Field of study

Clustering under most popular objective functions is NP-hard, even to approximate well, and so unlikely to be efficiently solvable in the worst case. Recently, Bilu and Linial \cite{Bilu09} suggested an approach aimed at bypassing this computational barrier by using properties of instances one might hope to hold in practice. In particular, they argue that instances in practice should be stable to small perturbations in the metric space and give an efficient algorithm for clustering instances of the Max-Cut problem that are stable to perturbations of size

O(n^{1/2})

. In addition, they conjecture that instances stable to as little as O(1) perturbations should be solvable in polynomial time. In this paper we prove that this conjecture is true for any center-based clustering objective (such as

k

-median,

k

-means, and

k

-center). Specifically, we show we can efficiently find the optimal clustering assuming only stability to factor-3 perturbations of the underlying metric in spaces without Steiner points, and stability to factor

2+\sqrt{3}

perturbations for general metrics. In particular, we show for such instances that the popular Single-Linkage algorithm combined with dynamic programming will find the optimal clustering. We also present NP-hardness results under a weaker but related condition

arXiv.org e-Print Archive

Fault Tolerant Clustering Revisited

Author: Kumar Nirman
Raichel Benjamin
Publication venue
Publication date: 01/01/2013
Field of study

In discrete k-center and k-median clustering, we are given a set of points P in a metric space M, and the task is to output a set C \subseteq ? P, |C| = k, such that the cost of clustering P using C is as small as possible. For k-center, the cost is the furthest a point has to travel to its nearest center, whereas for k-median, the cost is the sum of all point to nearest center distances. In the fault-tolerant versions of these problems, we are given an additional parameter 1 ?\leq \ell \leq ? k, such that when computing the cost of clustering, points are assigned to their \ell-th nearest-neighbor in C, instead of their nearest neighbor. We provide constant factor approximation algorithms for these problems that are both conceptually simple and highly practical from an implementation stand-point

arXiv.org e-Print Archive

University of Memphis Digital Commons

Robust Fault Tolerant uncapacitated facility location

Author: Chechik Shiri
Peleg David
Publication venue
Publication date: 01/01/2010
Field of study

In the uncapacitated facility location problem, given a graph, a set of demands and opening costs, it is required to find a set of facilities R, so as to minimize the sum of the cost of opening the facilities in R and the cost of assigning all node demands to open facilities. This paper concerns the robust fault-tolerant version of the uncapacitated facility location problem (RFTFL). In this problem, one or more facilities might fail, and each demand should be supplied by the closest open facility that did not fail. It is required to find a set of facilities R, so as to minimize the sum of the cost of opening the facilities in R and the cost of assigning all node demands to open facilities that did not fail, after the failure of up to \alpha facilities. We present a polynomial time algorithm that yields a 6.5-approximation for this problem with at most one failure and a 1.5 + 7.5\alpha-approximation for the problem with at most \alpha > 1 failures. We also show that the RFTFL problem is NP-hard even on trees, and even in the case of a single failure

arXiv.org e-Print Archive

CiteSeerX

Dagstuhl Research Online Publication Server

Incremental Medians via Online Bidding

Author: B. Kalyanasundaram
C. Chekuri
Claire Kenyon
E. Koutsoupias
G. Lin
J.-H. Lin
J.-H. Lin
John Noga
K. Jain
K. Jain
K. Jain
M. Charikar
M. Charikar
M. Charikar
M. Charikar
M. Chrobak
M. Chrobak
M. Goemans
M. Goemans
M.-Y. Kao
M.R. Korupolu
Marek Chrobak
N.E. Young
Neal E. Young
R. Fagin
R. Motwani
R.R. Mettu
R.R. Mettu
S. Dasgupta
V. Arya
V. Arya
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 28/05/2020
Field of study

In the k-median problem we are given sets of facilities and customers, and distances between them. For a given set F of facilities, the cost of serving a customer u is the minimum distance between u and a facility in F. The goal is to find a set F of k facilities that minimizes the sum, over all customers, of their service costs. Following Mettu and Plaxton, we study the incremental medians problem, where k is not known in advance, and the algorithm produces a nested sequence of facility sets where the kth set has size k. The algorithm is c-cost-competitive if the cost of each set is at most c times the cost of the optimum set of size k. We give improved incremental algorithms for the metric version: an 8-cost-competitive deterministic algorithm, a 2e ~ 5.44-cost-competitive randomized algorithm, a (24+epsilon)-cost-competitive, poly-time deterministic algorithm, and a (6e+epsilon ~ .31)-cost-competitive, poly-time randomized algorithm. The algorithm is s-size-competitive if the cost of the kth set is at most the minimum cost of any set of size k, and has size at most s k. The optimal size-competitive ratios for this problem are 4 (deterministic) and e (randomized). We present the first poly-time O(log m)-size-approximation algorithm for the offline problem and first poly-time O(log m)-size-competitive algorithm for the incremental problem. Our proofs reduce incremental medians to the following online bidding problem: faced with an unknown threshold T, an algorithm submits "bids" until it submits a bid that is at least the threshold. It pays the sum of all its bids. We prove that folklore algorithms for online bidding are optimally competitive.Comment: conference version appeared in LATIN 2006 as "Oblivious Medians via Online Bidding

arXiv.org e-Print Archive

Crossref

The Hardness of Approximation of Euclidean k-means

Author: Awasthi Pranjal
Charikar Moses
Krishnaswamy Ravishankar
Sinop Ali Kemal
Publication venue
Publication date: 01/01/2015
Field of study

The Euclidean

k

-means problem is a classical problem that has been extensively studied in the theoretical computer science, machine learning and the computational geometry communities. In this problem, we are given a set of

n

points in Euclidean space

R^d

, and the goal is to choose

k

centers in

R^d

so that the sum of squared distances of each point to its nearest center is minimized. The best approximation algorithms for this problem include a polynomial time constant factor approximation for general

k

and a

(1+\epsilon)

-approximation which runs in time

poly(n) 2^{O(k/\epsilon)}

. At the other extreme, the only known computational complexity result for this problem is NP-hardness [ADHP'09]. The main difficulty in obtaining hardness results stems from the Euclidean nature of the problem, and the fact that any point in

R^d

can be a potential center. This gap in understanding left open the intriguing possibility that the problem might admit a PTAS for all

k,d

. In this paper we provide the first hardness of approximation for the Euclidean

k

-means problem. Concretely, we show that there exists a constant

\epsilon > 0

such that it is NP-hard to approximate the

k

-means objective to within a factor of

(1+\epsilon)

. We show this via an efficient reduction from the vertex cover problem on triangle-free graphs: given a triangle-free graph, the goal is to choose the fewest number of vertices which are incident on all the edges. Additionally, we give a proof that the current best hardness results for vertex cover can be carried over to triangle-free graphs. To show this we transform

G

, a known hard vertex cover instance, by taking a graph product with a suitably chosen graph

H

, and showing that the size of the (normalized) maximum independent set is almost exactly preserved in the product graph using a spectral analysis, which might be of independent interest

arXiv.org e-Print Archive

CiteSeerX

Princeton University Open Access Repository

Dagstuhl Research Online Publication Server

Performance Appraisal Research: A Critical Review of Work on “The Social Context and Politics of Appraisal”

Author: Jenkins Alan
Publication venue
Publication date
Field of study

This paper reviews existing literatures on the analysis of performance appraisal (PA) paying special attention to those which try to take into account the “social context” of appraisal systems and processes. The special place of political action within these processes is underlined and the different levels at which politics need to be considered in research are outlined. Research on politics is considered and shown to lack an adequate consideration of the social relations involved in the reciprocal interactions between PA tools and processes and users interpretation and manipulation of them.Performance appraisal; Social context; Politics

Research Papers in Economics