Search CORE

4,202 research outputs found

Balanced Allocations and Double Hashing

Author: Alon N.
Dillinger P.C.
Heileman G.
Mitzenmacher M.
Mitzenmacher M.
Richa A.
Vadhan S.
Vvedenskaya N.D.
Publication venue
Publication date: 29/01/2014
Field of study

Double hashing has recently found more common usage in schemes that use multiple hash functions. In double hashing, for an item

x

, one generates two hash values

f(x)

and

g(x)

, and then uses combinations

(f(x) +k g(x)) \bmod n

for

k=0,1,2,...

to generate multiple hash values from the initial two. We first perform an empirical study showing that, surprisingly, the performance difference between double hashing and fully random hashing appears negligible in the standard balanced allocation paradigm, where each item is placed in the least loaded of

d

choices, as well as several related variants. We then provide theoretical results that explain the behavior of double hashing in this context.Comment: Further updated, small improvements/typos fixe

arXiv.org e-Print Archive

CiteSeerX

Crossref

More Analysis of Double Hashing for Balanced Allocations

Author: Mitzenmacher Michael
Publication venue
Publication date: 02/03/2015
Field of study

With double hashing, for a key

x

, one generates two hash values

f(x)

and

g(x)

, and then uses combinations

(f(x) +i g(x)) \bmod n

for

i=0,1,2,...

to generate multiple hash values in the range

[0,n-1]

from the initial two. For balanced allocations, keys are hashed into a hash table where each bucket can hold multiple keys, and each key is placed in the least loaded of

d

choices. It has been shown previously that asymptotically the performance of double hashing and fully random hashing is the same in the balanced allocation paradigm using fluid limit methods. Here we extend a coupling argument used by Lueker and Molodowitch to show that double hashing and ideal uniform hashing are asymptotically equivalent in the setting of open address hash tables to the balanced allocation setting, providing further insight into this phenomenon. We also discuss the potential for and bottlenecks limiting the use this approach for other multiple choice hashing schemes.Comment: 13 pages ; current draft ; will be submitted to conference shortl

arXiv.org e-Print Archive

Crossref

Tight Load Balancing via Randomized Local Search

Author: Berenbrink Petra
Kling Peter
Liaw Christopher
Mehrabian Abbas
Publication venue
Publication date: 29/06/2017
Field of study

We consider the following balls-into-bins process with

n

bins and

m

balls: each ball is equipped with a mutually independent exponential clock of rate 1. Whenever a ball's clock rings, the ball samples a random bin and moves there if the number of balls in the sampled bin is smaller than in its current bin. This simple process models a typical load balancing problem where users (balls) seek a selfish improvement of their assignment to resources (bins). From a game theoretic perspective, this is a randomized approach to the well-known Koutsoupias-Papadimitriou model, while it is known as randomized local search (RLS) in load balancing literature. Up to now, the best bound on the expected time to reach perfect balance was

O\left({(\ln n)}^2+\ln(n)\cdot n^2/m\right)

due to Ganesh, Lilienthal, Manjunath, Proutiere, and Simatos (Load balancing via random local search in closed and open systems, Queueing Systems, 2012). We improve this to an asymptotically tight

O\left(\ln(n)+n^2/m\right)

. Our analysis is based on the crucial observation that performing "destructive moves" (reversals of RLS moves) cannot decrease the balancing time. This allows us to simplify problem instances and to ignore "inconvenient moves" in the analysis.Comment: 24 pages, 3 figures, preliminary version appeared in proceedings of 2017 IEEE International Parallel and Distributed Processing Symposium (IPDPS'17

arXiv.org e-Print Archive

Crossref

Balanced Allocation on Graphs: A Random Walk Approach

Author: B Vöcking
C Cooper
M Mitzenmacher
N Alon
N Alon
P Berenbrink
X Dahan
Y Azar
Publication venue
Publication date: 27/02/2016
Field of study

In this paper we propose algorithms for allocating

n

sequential balls into

n

bins that are interconnected as a

d

-regular

n

-vertex graph

G

, where

d\ge3

can be any integer.Let

l

be a given positive integer. In each round

t

1\le t\le n

, ball

t

picks a node of

G

uniformly at random and performs a non-backtracking random walk of length

l

from the chosen node.Then it allocates itself on one of the visited nodes with minimum load (ties are broken uniformly at random). Suppose that

G

has a sufficiently large girth and

d=\omega(\log n)

. Then we establish an upper bound for the maximum number of balls at any bin after allocating

n

balls by the algorithm, called {\it maximum load}, in terms of

l

with high probability. We also show that the upper bound is at most an

O(\log\log n)

factor above the lower bound that is proved for the algorithm. In particular, we show that if we set

l=\lfloor(\log n)^{\frac{1+\epsilon}{2}}\rfloor

, for every constant

\epsilon\in (0, 1)

, and

G

has girth at least

\omega(l)

, then the maximum load attained by the algorithm is bounded by

O(1/\epsilon)

with high probability.Finally, we slightly modify the algorithm to have similar results for balanced allocation on

d

-regular graph with

d\in[3, O(\log n)]

and sufficiently large girth

arXiv.org e-Print Archive

Crossref

Parallel Balanced Allocations: The Heavily Loaded Case

Author: Berenbrink Petra
Bertrand Pierre
Esseen C.G.
Mitzenmacher Michael
Talwar Kunal
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 01/01/2019
Field of study

We study parallel algorithms for the classical balls-into-bins problem, in which

m

balls acting in parallel as separate agents are placed into

n

bins. Algorithms operate in synchronous rounds, in each of which balls and bins exchange messages once. The goal is to minimize the maximal load over all bins using a small number of rounds and few messages. While the case of

m=n

balls has been extensively studied, little is known about the heavily loaded case. In this work, we consider parallel algorithms for this somewhat neglected regime of

m\gg n

. The naive solution of allocating each ball to a bin chosen uniformly and independently at random results in maximal load

m/n+\Theta(\sqrt{m/n\cdot \log n})

(for

m\geq n \log n

) w.h.p. In contrast, for the sequential setting Berenbrink et al (SIAM J. Comput 2006) showed that letting each ball join the least loaded bin of two randomly selected bins reduces the maximal load to

m/n+O(\log\log m)

w.h.p. To date, no parallel variant of such a result is known. We present a simple parallel threshold algorithm that obtains a maximal load of

m/n+O(1)

w.h.p. within

O(\log\log (m/n)+\log^* n)

rounds. The algorithm is symmetric (balls and bins all "look the same"), and balls send

O(1)

messages in expectation per round. The additive term of

O(\log^* n)

in the complexity is known to be tight for such algorithms (Lenzen and Wattenhofer Distributed Computing 2016). We also prove that our analysis is tight, i.e., algorithms of the type we provide must run for

\Omega(\min\{\log\log (m/n),n\})

rounds w.h.p. Finally, we give a simple asymmetric algorithm (i.e., balls are aware of a common labeling of the bins) that achieves a maximal load of

m/n + O(1)

in a constant number of rounds w.h.p. Again, balls send only a single message per round, and bins receive

(1+o(1))m/n+O(\log n)

messages w.h.p

arXiv.org e-Print Archive

Crossref

MPG.PuRe

Unbalanced Allocations

Author: Redlich Amanda
Publication venue
Publication date: 31/12/2013
Field of study

We consider the unbalanced allocation of

m

balls into

n

bins by a randomized algorithm using the "power of two choices". For each ball, we select a set of bins at random, then place the ball in the fullest bin within the set. Applications of this generic algorithm range from cost minimization to condensed matter physics. In this paper, we analyze the distribution of the bin loads produced by this algorithm, considering, for example, largest and smallest loads, loads of subsets of the bins, and the likelihood of bins having equal loads

arXiv.org e-Print Archive

CiteSeerX

Parallel Load Balancing on Constrained Client-Server Topologies

Author: Berenbrink P.
Berenbrink Petra
Bosek B.
Godfrey P. Brighten
Karp Richard M.
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 27/05/2020
Field of study

We study parallel \emph{Load Balancing} protocols for a client-server distributed model defined as follows. There is a set \sC of

n

clients and a set \sS of

n

servers where each client has (at most) a constant number

d \geq 1

of requests that must be assigned to some server. The client set and the server one are connected to each other via a fixed bipartite graph: the requests of client

v

can only be sent to the servers in its neighborhood

N(v)

. The goal is to assign every client request so as to minimize the maximum load of the servers. In this setting, efficient parallel protocols are available only for dense topolgies. In particular, a simple symmetric, non-adaptive protocol achieving constant maximum load has been recently introduced by Becchetti et al \cite{BCNPT18} for regular dense bipartite graphs. The parallel completion time is \bigO(\log n) and the overall work is \bigO(n), w.h.p. Motivated by proximity constraints arising in some client-server systems, we devise a simple variant of Becchetti et al's protocol \cite{BCNPT18} and we analyse it over almost-regular bipartite graphs where nodes may have neighborhoods of small size. In detail, we prove that, w.h.p., this new version has a cost equivalent to that of Becchetti et al's protocol (in terms of maximum load, completion time, and work complexity, respectively) on every almost-regular bipartite graph with degree

\Omega(\log^2n)

. Our analysis significantly departs from that in \cite{BCNPT18} for the original protocol and requires to cope with non-trivial stochastic-dependence issues on the random choices of the algorithmic process which are due to the worst-case, sparse topology of the underlying graph

arXiv.org e-Print Archive

Crossref

INRIA a CCSD electronic archive server

HAL Descartes

ART