6,325 research outputs found
Dependent randomized rounding for clustering and partition systems with knapsack constraints
Clustering problems are fundamental to unsupervised learning. There is an
increased emphasis on fairness in machine learning and AI; one representative
notion of fairness is that no single demographic group should be
over-represented among the cluster-centers. This, and much more general
clustering problems, can be formulated with "knapsack" and "partition"
constraints. We develop new randomized algorithms targeting such problems, and
study two in particular: multi-knapsack median and multi-knapsack center. Our
rounding algorithms give new approximation and pseudo-approximation algorithms
for these problems. One key technical tool, which may be of independent
interest, is a new tail bound analogous to Feige (2006) for sums of random
variables with unbounded variances. Such bounds are very useful in inferring
properties of large networks using few samples
An Improved Algorithm for Fixed-Hub Single Allocation Problem
This paper discusses the fixed-hub single allocation problem (FHSAP). In this
problem, a network consists of hub nodes and terminal nodes. Hubs are fixed and
fully connected; each terminal node is connected to a single hub which routes
all its traffic. The goal is to minimize the cost of routing the traffic in the
network. In this paper, we propose a linear programming (LP)-based rounding
algorithm. The algorithm is based on two ideas. First, we modify the LP
relaxation formulation introduced in Ernst and Krishnamoorthy (1996, 1999) by
incorporating a set of validity constraints. Then, after obtaining a fractional
solution to the LP relaxation, we make use of a geometric rounding algorithm to
obtain an integral solution. We show that by incorporating the validity
constraints, the strengthened LP often provides much tighter upper bounds than
the previous methods with a little more computational effort, and the solution
obtained often has a much smaller gap with the optimal solution. We also
formulate a robust version of the FHSAP and show that it can guard against data
uncertainty with little cost
Centrality of Trees for Capacitated k-Center
There is a large discrepancy in our understanding of uncapacitated and
capacitated versions of network location problems. This is perhaps best
illustrated by the classical k-center problem: there is a simple tight
2-approximation algorithm for the uncapacitated version whereas the first
constant factor approximation algorithm for the general version with capacities
was only recently obtained by using an intricate rounding algorithm that
achieves an approximation guarantee in the hundreds.
Our paper aims to bridge this discrepancy. For the capacitated k-center
problem, we give a simple algorithm with a clean analysis that allows us to
prove an approximation guarantee of 9. It uses the standard LP relaxation and
comes close to settling the integrality gap (after necessary preprocessing),
which is narrowed down to either 7, 8 or 9. The algorithm proceeds by first
reducing to special tree instances, and then solves such instances optimally.
Our concept of tree instances is quite versatile, and applies to natural
variants of the capacitated k-center problem for which we also obtain improved
algorithms. Finally, we give evidence to show that more powerful preprocessing
could lead to better algorithms, by giving an approximation algorithm that
beats the integrality gap for instances where all non-zero capacities are
uniform.Comment: 21 pages, 2 figure
Reallocating Multiple Facilities on the Line
We study the multistage -facility reallocation problem on the real line,
where we maintain facility locations over stages, based on the
stage-dependent locations of agents. Each agent is connected to the nearest
facility at each stage, and the facilities may move from one stage to another,
to accommodate different agent locations. The objective is to minimize the
connection cost of the agents plus the total moving cost of the facilities,
over all stages. -facility reallocation was introduced by de Keijzer and
Wojtczak, where they mostly focused on the special case of a single facility.
Using an LP-based approach, we present a polynomial time algorithm that
computes the optimal solution for any number of facilities. We also consider
online -facility reallocation, where the algorithm becomes aware of agent
locations in a stage-by-stage fashion. By exploiting an interesting connection
to the classical -server problem, we present a constant-competitive
algorithm for facilities
Constant Approximation for -Median and -Means with Outliers via Iterative Rounding
In this paper, we present a new iterative rounding framework for many
clustering problems. Using this, we obtain an -approximation algorithm for -median with outliers, greatly
improving upon the large implicit constant approximation ratio of Chen [Chen,
SODA 2018]. For -means with outliers, we give an -approximation, which is the first -approximation for
this problem. The iterative algorithm framework is very versatile; we show how
it can be used to give - and -approximation
algorithms for matroid and knapsack median problems respectively, improving
upon the previous best approximations ratios of [Swamy, ACM Trans.
Algorithms] and [Byrka et al, ESA 2015].
The natural LP relaxation for the -median/-means with outliers problem
has an unbounded integrality gap. In spite of this negative result, our
iterative rounding framework shows that we can round an LP solution to an
almost-integral solution of small cost, in which we have at most two
fractionally open facilities. Thus, the LP integrality gap arises due to the
gap between almost-integral and fully-integral solutions. Then, using a
pre-processing procedure, we show how to convert an almost-integral solution to
a fully-integral solution losing only a constant-factor in the approximation
ratio. By further using a sparsification technique, the additive factor loss
incurred by the conversion can be reduced to any
Lotsize optimization leading to a -median problem with cardinalities
We consider the problem of approximating the branch and size dependent demand
of a fashion discounter with many branches by a distributing process being
based on the branch delivery restricted to integral multiples of lots from a
small set of available lot-types. We propose a formalized model which arises
from a practical cooperation with an industry partner. Besides an integer
linear programming formulation and a primal heuristic for this problem we also
consider a more abstract version which we relate to several other classical
optimization problems like the p-median problem, the facility location problem
or the matching problem.Comment: 14 page
- …