Search CORE

332 research outputs found

Linear Hashing is Awesome

Author: Knudsen Mathias Bæk Tejs
Publication venue
Publication date: 01/01/2016
Field of study

We consider the hash function

h(x) = ((ax+b) \bmod p) \bmod n

where

a,b

are chosen uniformly at random from

\{0,1,\ldots,p-1\}

. We prove that when we use

h(x)

in hashing with chaining to insert

n

elements into a table of size

n

the expected length of the longest chain is

\tilde{O}\!\left(n^{1/3}\right)

. The proof also generalises to give the same bound when we use the multiply-shift hash function by Dietzfelbinger et al. [Journal of Algorithms 1997].Comment: A preliminary version appeared at FOCS'1

arXiv.org e-Print Archive

Crossref

Copenhagen University Research Information System

Theory and applications of hashing: report from Dagstuhl Seminar 17181

Author
Publication venue: Schloss Dagstuhl
Publication date: 19/12/2017
Field of study

This report documents the program and the topics discussed of the 4-day Dagstuhl Seminar 17181 “Theory and Applications of Hashing”, which took place May 1–5, 2017. Four long and eighteen short talks covered a wide and diverse range of topics within the theme of the workshop. The program left sufficient space for informal discussions among the 40 participants

Digitale Bibliothek Thüringen

How blockchain impacts cloud-based system performance: a case study for a groupware communication application

Author: Beck Roman
Eklund Peter
Spasovski Jason
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/01/2019
Field of study

This paper examines the performance trade-off when implementing a blockchain architecture for a cloud-based groupware communication application. We measure the additional cloud-based resources and performance costs of the overhead required to implement a groupware collaboration system over a blockchain architecture. To evaluate our groupware application, we develop measuring instruments for testing scalability and performance of computer systems deployed as cloud computing applications. While some details of our groupware collaboration application have been published in earlier work, in this paper we reflect on a generalized measuring method for blockchain-enabled applications which may in turn lead to a general methodology for testing cloud-based system performance and scalability using blockchain. Response time and transaction throughput metrics are collected for the blockchain implementation against the non-blockchain implementation and some conclusions are drawn about the additional resources that a blockchain architecture for a groupware collaboration application impose

Deakin Research Online

The IT University of Copenhagen's Repository

AIS Electronic Library (AISeL)

Linear Hashing: No Shift, Non-Prime Modulus, For $\mathbb{R}$ eal!

Author: Westover Alek
Publication venue
Publication date: 24/07/2023
Field of study

In classical Linear Hashing

\mathsf{LH}

items

x\in \{1,2,\ldots, |U|\}

are mapped to bins

\{0,1,\ldots, n-1\}

by a function such as

x\mapsto (ax+b)\mod p \mod n

for prime

p\in [|U|, 2|U|]

and randomly chosen integers

a,b \in [1,p]

. Despite

\mathsf{LH}

's simplicity understanding the expected maxload, i.e., number of elements in a fullest bin, of

\mathsf{LH}

for worst-case inputs is a notoriously challenging open question. For hashing

n

items the best known lower bound is

\Omega\left(\frac{\log n}{\log\log n}\right)

, whereas the best known upper bound is

\widetilde{O}(n^{1/3})

due to Knudsen. In this paper we consider three modifications of classic

\mathsf{LH}

: (1)

\mathsf{LH}

without the ``

+b

" shift term, resulting in loss of pairwise-independence. (2)

\mathsf{LH}

with a composite, rather than prime, modulus. (3)

\mathsf{LH}

in a continuous setting where the multiplier ``

a

" is chosen from

\mathbb{R}

rather than

\mathbb{Z}

. We show that

\mathsf{LH}

is fairly robust to these changes, in particular by demonstrating analogs of known maxload-bounds for these new variants. These results give several new perspectives on

\mathsf{LH}

, in particular showing that properties of

\mathsf{LH}

such as pairwise-independence, a prime modulus, or even its setting in the integers may not be fundamental. We believe that these new perspectives, beyond being independently interesting, may also be useful in future work towards understanding the maxload of

\mathsf{LH}

.Comment: 11 page

arXiv.org e-Print Archive

The Whalesong

Author
Publication venue: University of Alaska Southeast
Publication date: 28/01/2003
Field of study

Student and community leaders meet at banquet -- Peru inspires and amazes UAS group -- Egan Library wing opens to rave reviews -- Count yourself lucky -- Notice to Stafford Loan Borrowers -- VideoVersity - what is it? -- Global ethics brought to UAS -- Success is up to you -- Student Spotlight: Augie Stiehr -- Long Live Narcissus -- Media and computer services merge -- Special election to be held -- Paint misbehavin' -- Get wet at Squire's -- Preview -- The best album you never heard. .

ScholarWorks@UA

Sparse Nonnegative Convolution is Equivalent to Dense Nonnegative Convolution

Author: Bringmann K.
Fischer N.
Nakos V.
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 01/01/2021
Field of study

MPG.PuRe

Locally Uniform Hashing

Author: Bercea Ioana O.
Beretta Lorenzo
Houen Jakob Bæk Tejs
Klausen Jonas
Thorup Mikkel
Publication venue
Publication date: 28/09/2023
Field of study

Hashing is a common technique used in data processing, with a strong impact on the time and resources spent on computation. Hashing also affects the applicability of theoretical results that often assume access to (unrealistic) uniform/fully-random hash functions. In this paper, we are concerned with designing hash functions that are practical and come with strong theoretical guarantees on their performance. To this end, we present tornado tabulation hashing, which is simple, fast, and exhibits a certain full, local randomness property that provably makes diverse algorithms perform almost as if (abstract) fully-random hashing was used. For example, this includes classic linear probing, the widely used HyperLogLog algorithm of Flajolet, Fusy, Gandouet, Meunier [AOFA 97] for counting distinct elements, and the one-permutation hashing of Li, Owen, and Zhang [NIPS 12] for large-scale machine learning. We also provide a very efficient solution for the classical problem of obtaining fully-random hashing on a fixed (but unknown to the hash function) set of

n

keys using

O(n)

space. As a consequence, we get more efficient implementations of the splitting trick of Dietzfelbinger and Rink [ICALP'09] and the succinct space uniform hashing of Pagh and Pagh [SICOMP'08]. Tornado tabulation hashing is based on a simple method to systematically break dependencies in tabulation-based hashing techniques.Comment: FOCS 202

arXiv.org e-Print Archive

Sparse Nonnegative Convolution Is Equivalent to Dense Nonnegative Convolution

Author: Bringmann K.
Fischer N.
Nakos V.
Publication venue
Publication date: 01/01/2021
Field of study

Computing the convolution

A\star B

of two length-

n

vectors

A,B

is an ubiquitous computational primitive. Applications range from string problems to Knapsack-type problems, and from 3SUM to All-Pairs Shortest Paths. These applications often come in the form of nonnegative convolution, where the entries of

A,B

are nonnegative integers. The classical algorithm to compute

A\star B

uses the Fast Fourier Transform and runs in time

O(n\log n)

. However, often

A

and

B

satisfy sparsity conditions, and hence one could hope for significant improvements. The ideal goal is an

O(k\log k)

-time algorithm, where

k

is the number of non-zero elements in the output, i.e., the size of the support of

A\star B

. This problem is referred to as sparse nonnegative convolution, and has received considerable attention in the literature; the fastest algorithms to date run in time

O(k\log^2 n)

. The main result of this paper is the first

O(k\log k)

-time algorithm for sparse nonnegative convolution. Our algorithm is randomized and assumes that the length

n

and the largest entry of

A

and

B

are subexponential in

k

. Surprisingly, we can phrase our algorithm as a reduction from the sparse case to the dense case of nonnegative convolution, showing that, under some mild assumptions, sparse nonnegative convolution is equivalent to dense nonnegative convolution for constant-error randomized algorithms. Specifically, if

D(n)

is the time to convolve two nonnegative length-

n

vectors with success probability

2/3

, and

S(k)

is the time to convolve two nonnegative vectors with output size

k

with success probability

2/3

, then

S(k)=O(D(k)+k(\log\log k)^2)

. Our approach uses a variety of new techniques in combination with some old machinery from linear sketching and structured linear algebra, as well as new insights on linear hashing, the most classical hash function

MPG.PuRe