Search CORE

7 research outputs found

On the Insertion Time of Cuckoo Hashing

Author: Fountoulakis Nikolaos
Panagiotou Konstantinos
Steger Angelika
Publication venue
Publication date: 01/01/2013
Field of study

Cuckoo hashing is an efficient technique for creating large hash tables with high space utilization and guaranteed constant access times. There, each item can be placed in a location given by any one out of k different hash functions. In this paper we investigate further the random walk heuristic for inserting in an online fashion new items into the hash table. Provided that k > 2 and that the number of items in the table is below (but arbitrarily close) to the theoretically achievable load threshold, we show a polylogarithmic bound for the maximum insertion time that holds with high probability.Comment: 27 pages, final version accepted by the SIAM Journal on Computin

arXiv.org e-Print Archive

Crossref

University of Birmingham Research Portal

MPG.PuRe

On the Insertion Time of Cuckoo Hashing

Author: Fountoulakis N.
Panagiotou K.
Steger A.
Publication venue
Publication date: 01/01/2010
Field of study

CiteSeerX

MPG.PuRe

On the insertion time of random walk cuckoo hashing

Author: Frieze Alan
Johansson Tony
Publication venue
Publication date: 08/01/2017
Field of study

Cuckoo Hashing is a hashing scheme invented by Pagh and Rodler. It uses

d\geq 2

distinct hash functions to insert items into the hash table. It has been an open question for some time as to the expected time for Random Walk Insertion to add items. We show that if the number of hash functions

d=O(1)

is sufficiently large, then the expected insertion time is

O(1)

per item.Comment: 9 page

arXiv.org e-Print Archive

Crossref

Double Hashing Thresholds via Local Weak Convergence

Author: Leconte Mathieu
Publication venue: HAL CCSD
Publication date: 02/10/2013
Field of study

International audienceA lot of interest has recently arisen in the analysis of multiple-choice "cuckoo hashing" schemes. In this context, a main performance criterion is the load threshold under which the hashing scheme is able to build a valid hashtable with high probability in the limit of large systems; various techniques have successfully been used to answer this question (differential equations, combinatorics, cavity method) for increasing levels of generality of the model. However, the hashing scheme analysed so far is quite utopic in that it requires to generate a lot of independent, fully random choices. Schemes with reduced randomness exists, such as "double hashing", which is expected to provide similar asymptotic results as the ideal scheme, yet they have been more resistant to analysis so far. In this paper, we point out that the approach via the cavity method extends quite naturally to the analysis of double hashing and allows to compute the corresponding threshold. The path followed is to show that the graph induced by the double hashing scheme has the same local weak limit as the one obtained with full randomness

Crossref

INRIA a CCSD electronic archive server

Multiple choice allocations with small maximum loads

Author: Khosla Megha
Publication venue: 'Walter de Gruyter GmbH'
Publication date: 01/01/2014
Field of study

The idea of using multiple choices to improve allocation schemes is now well understood and is often illustrated by the following example. Suppose

n

balls are allocated to

n

bins with each ball choosing a bin independently and uniformly at random. The \emph{maximum load}, or the number of balls in the most loaded bin, will then be approximately

\log n \over \log \log n

with high probability. Suppose now the balls are allocated sequentially by placing a ball in the least loaded bin among the

k\ge 2

bins chosen independently and uniformly at random. Azar, Broder, Karlin, and Upfal showed that in this scenario, the maximum load drops to

{\log \log n \over \log k} +\Theta(1)

, with high probability, which is an exponential improvement over the previous case. In this thesis we investigate multiple choice allocations from a slightly different perspective. Instead of minimizing the maximum load, we fix the bin capacities and focus on maximizing the number of balls that can be allocated without overloading any bin. In the process that we consider we have

m=\lfloor cn \rfloor

balls and

n

bins. Each ball chooses

k

bins independently and uniformly at random. \emph{Is it possible to assign each ball to one of its choices such that the no bin receives more than

\ell

balls?} For all

k\ge 3

and

\ell\ge 2

we give a critical value,

c_{k,\ell}^*

, such that when

cc_{k,\ell}^*

this is not the case. In case such an allocation exists, \emph{how quickly can we find it?} Previous work on total allocation time for case

k\ge 3

and

\ell=1

has analyzed a \emph{breadth first strategy} which is shown to be linear only in expectation. We give a simple and efficient algorithm which we also call \emph{local search allocation}(LSA) to find an allocation for all

k\ge 3

and

\ell=1

. Provided the number of balls are below (but arbitrarily close to) the theoretical achievable load threshold, we give a \emph{linear} bound for the total allocation time that holds with high probability. We demonstrate, through simulations, an order of magnitude improvement for total and maximum allocation times when compared to the state of the art method. Our results find applications in many areas including hashing, load balancing, data management, orientability of random hypergraphs and maximum matchings in a special class of bipartite graphs.Die Idee, mehrere Wahlmöglichkeiten zu benutzen, um Zuordnungsschemas zu verbessern, ist mittlerweile gut verstanden und wird oft mit Hilfe des folgenden Beispiels illustriert: Man nehme an, dass n Kugeln auf n Behälter verteilt werden und jede Kugel unabhängig und gleichverteilt per Zufall ihren Behälter wählt. Die maximale Auslastung, bzw. die Anzahl an Kugeln im meist befüllten Behälter, wird dann mit hoher Wahrscheinlichkeit schätzungsweise

\log n \over \log \log n

sein. Alternativ können die Kugeln sequenziell zugeordnet werden, indem jede Kugel k ≥ 2 Behälter unabhängig und gleichverteilt zufällig auswählt und in dem am wenigsten befüllten dieser k Behälter platziert wird. Azar, Broder, Karlin, and Upfal haben gezeigt, dass in diesem Szenario die maximale Auslastung mit hoher Wahrscheinlichkeit auf

{\log \log n \over \log k} +\Theta(1)

sinkt, was eine exponentielle Verbesserung des vorhergehenden Falls darstellt. In dieser Doktorarbeit untersuchen wir solche Zuteilungschemas von einem etwas anderen Standpunkt. Statt die maximale Last zu minimieren, ﬁxieren wir die Kapazitäten der Behälter und konzentrieren uns auf die Maximierung der Anzahl der Kugeln, die ohne Überlastung eines Behälters zugeteilt werden können. In dem von uns betrachteten Prozess haben wir m = bcnc Kugeln und n Behälter. Jede Kugel wählt unabhängig und gleichverteilt zufällig k Behälter. Ist es möglich, jeder Kugel einen Behälter ihrer Wahl zuzuordnen, so dass kein Behälter mehr als Kugeln erhält? Für alle k ≥ 3 und ≥ 2 geben wir einen kritischen Wert

c _{k,\ell}^*

, an sodass für c c {k,\ell}^*

nicht. Im Falle, dass solch eine Zuordnung existiert, stellt sich die Frage, wie schnell diese gefunden werden kann. Die bisher durchgeführten Arbeiten zur Gesamtzuordnungszeit im Falle k ≥ 3 and

\ell = 1

haben eine Breitensuchstrategie analysiert, welche nur im Erwartungswert linear ist. Wir präsentieren einen einfachen und eﬃzienten Algorithmus, welchen wir local search allocation (LSA) nennen und der Zuteilungen für alle k ≥ 3 und

\ell = 1$ ﬁndet. Sofern die Anzahl der Kugeln unter (aber beliebig nahe an) der theoretisch erreichbaren Lastschwelle ist, zeigen wir eine lineare Schranke für die Gesamtzuordnungszeit, die mit hoher Wahrscheinlichkeit gilt. Anhand von Simulationen demonstrieren wir eine Verbesserung der Gesamt- und Maximalzuordnungszeiten um eine Größenordnung im Vergleich zu anderen aktuellen Methoden. Unsere Ergebnisse ﬁnden Anwendung in vielen Bereichen einschließlich Hashing, Lastbalancierung, Datenmanagement, Orientierbarkeit von zufälligen Hypergraphen und maximale Paarungen in einer speziellen Klasse von bipartiten Graphen

Universaar

Acronym

MPG.PuRe

On the Insertion Time of Cuckoo Hashing

Author: Angelika Steger
Cain J. A.
Dietzfelbinger M.
Fernholz D.
Fotakis D.
Fountoulakis N.
Fountoulakis N.
Konstantinos Panagiotou
Mitzenmacher M.
Mitzenmacher M.
Nikolaos Fountoulakis
Publication venue: 'Society for Industrial & Applied Mathematics (SIAM)'
Publication date
Field of study

Crossref