Search CORE

438 research outputs found

A unified approach to linear probing hashing with buckets

Author: Janson Svante
Viola Alfredo
Publication venue
Publication date: 22/10/2014
Field of study

We give a unified analysis of linear probing hashing with a general bucket size. We use both a combinatorial approach, giving exact formulas for generating functions, and a probabilistic approach, giving simple derivations of asymptotic results. Both approaches complement nicely, and give a good insight in the relation between linear probing and random walks. A key methodological contribution, at the core of Analytic Combinatorics, is the use of the symbolic method (based on q-calculus) to directly derive the generating functions to analyze.Comment: 49 page

arXiv.org e-Print Archive

CiteSeerX

Boosting Multi-Core Reachability Performance with Shared Hash Tables

Author: Laarman Alfons
van de Pol Jaco
Weber Michael
Publication venue
Publication date: 01/01/2010
Field of study

This paper focuses on data structures for multi-core reachability, which is a key component in model checking algorithms and other verification methods. A cornerstone of an efficient solution is the storage of visited states. In related work, static partitioning of the state space was combined with thread-local storage and resulted in reasonable speedups, but left open whether improvements are possible. In this paper, we present a scaling solution for shared state storage which is based on a lockless hash table implementation. The solution is specifically designed for the cache architecture of modern CPUs. Because model checking algorithms impose loose requirements on the hash table operations, their design can be streamlined substantially compared to related work on lockless hash tables. Still, an implementation of the hash table presented here has dozens of sensitive performance parameters (bucket size, cache line size, data layout, probing sequence, etc.). We analyzed their impact and compared the resulting speedups with related tools. Our implementation outperforms two state-of-the-art multi-core model checkers (SPIN and DiVinE) by a substantial margin, while placing fewer constraints on the load balancing and search algorithms.Comment: preliminary repor

arXiv.org e-Print Archive

CiteSeerX

University of Twente Research Information

Tradeoffs for nearest neighbors on the sphere

Author: Laarhoven Thijs
Publication venue
Publication date: 01/01/2015
Field of study

We consider tradeoffs between the query and update complexities for the (approximate) nearest neighbor problem on the sphere, extending the recent spherical filters to sparse regimes and generalizing the scheme and analysis to account for different tradeoffs. In a nutshell, for the sparse regime the tradeoff between the query complexity

n^{\rho_q}

and update complexity

n^{\rho_u}

for data sets of size

n

is given by the following equation in terms of the approximation factor

c

and the exponents

\rho_q

and

\rho_u

c^2\sqrt{\rho_q}+(c^2-1)\sqrt{\rho_u}=\sqrt{2c^2-1}.

For small

c=1+\epsilon

, minimizing the time for updates leads to a linear space complexity at the cost of a query time complexity

n^{1-4\epsilon^2}

. Balancing the query and update costs leads to optimal complexities

n^{1/(2c^2-1)}

, matching bounds from [Andoni-Razenshteyn, 2015] and [Dubiner, IEEE-TIT'10] and matching the asymptotic complexities of [Andoni-Razenshteyn, STOC'15] and [Andoni-Indyk-Laarhoven-Razenshteyn-Schmidt, NIPS'15]. A subpolynomial query time complexity

n^{o(1)}

can be achieved at the cost of a space complexity of the order

n^{1/(4\epsilon^2)}

, matching the bound

n^{\Omega(1/\epsilon^2)}

of [Andoni-Indyk-Patrascu, FOCS'06] and [Panigrahy-Talwar-Wieder, FOCS'10] and improving upon results of [Indyk-Motwani, STOC'98] and [Kushilevitz-Ostrovsky-Rabani, STOC'98]. For large

c

, minimizing the update complexity results in a query complexity of

n^{2/c^2+O(1/c^4)}

, improving upon the related exponent for large

c

of [Kapralov, PODS'15] by a factor

2

, and matching the bound

n^{\Omega(1/c^2)}

of [Panigrahy-Talwar-Wieder, FOCS'08]. Balancing the costs leads to optimal complexities

n^{1/(2c^2-1)}

, while a minimum query time complexity can be achieved with update complexity

n^{2/c^2+O(1/c^4)}

, improving upon the previous best exponents of Kapralov by a factor

2

.Comment: 16 pages, 1 table, 2 figures. Mostly subsumed by arXiv:1608.03580 [cs.DS] (along with arXiv:1605.02701 [cs.DS]

arXiv.org e-Print Archive

Repository TU/e

Pure OAI Repository

Analysing the Performance of GPU Hash Tables for State Space Exploration

Author: Cassee Nathan
Wijs Anton
Publication venue: 'Open Publishing Association'
Publication date: 01/01/2017
Field of study

In the past few years, General Purpose Graphics Processors (GPUs) have been used to significantly speed up numerous applications. One of the areas in which GPUs have recently led to a significant speed-up is model checking. In model checking, state spaces, i.e., large directed graphs, are explored to verify whether models satisfy desirable properties. GPUexplore is a GPU-based model checker that uses a hash table to efficiently keep track of already explored states. As a large number of states is discovered and stored during such an exploration, the hash table should be able to quickly handle many inserts and queries concurrently. In this paper, we experimentally compare two different hash tables optimised for the GPU, one being the GPUexplore hash table, and the other using Cuckoo hashing. We compare the performance of both hash tables using random and non-random data obtained from model checking experiments, to analyse the applicability of the two hash tables for state space exploration. We conclude that Cuckoo hashing is three times faster than GPUexplore hashing for random data, and that Cuckoo hashing is five to nine times faster for non-random data. This suggests great potential to further speed up GPUexplore in the near future.Comment: In Proceedings GaM 2017, arXiv:1712.0834

arXiv.org e-Print Archive

Repository TU/e

Pure OAI Repository

Hybrid LSH: Faster Near Neighbors Reporting in High-dimensional Space

Author: Pham Ninh
Publication venue
Publication date: 01/01/2017
Field of study

We study the

r

-near neighbors reporting problem (

r

-NN), i.e., reporting \emph{all} points in a high-dimensional point set

S

that lie within a radius

r

of a given query point

q

. Our approach builds upon on the locality-sensitive hashing (LSH) framework due to its appealing asymptotic sublinear query time for near neighbor search problems in high-dimensional space. A bottleneck of the traditional LSH scheme for solving

r

-NN is that its performance is sensitive to data and query-dependent parameters. On datasets whose data distributions have diverse local density patterns, LSH with inappropriate tuning parameters can sometimes be outperformed by a simple linear search. In this paper, we introduce a hybrid search strategy between LSH-based search and linear search for

r

-NN in high-dimensional space. By integrating an auxiliary data structure into LSH hash tables, we can efficiently estimate the computational cost of LSH-based search for a given query regardless of the data distribution. This means that we are able to choose the appropriate search strategy between LSH-based search and linear search to achieve better performance. Moreover, the integrated data structure is time efficient and fits well with many recent state-of-the-art LSH-based approaches. Our experiments on real-world datasets show that the hybrid search approach outperforms (or is comparable to) both LSH-based search and linear search for a wide range of search radii and data distributions in high-dimensional space.Comment: Accepted as a short paper in EDBT 201

arXiv.org e-Print Archive

Copenhagen University Research Information System

A Distributed Hash Table for Shared Memory

Author: Dijk Tom van
Oortwijn Wytse
Pol Jaco van de
Publication venue: Springer
Publication date: 01/09/2015
Field of study

Distributed algorithms for graph searching require a high-performance CPU-efficient hash table that supports find-or-put. This operation either inserts data or indicates that it has already been added before. This paper focuses on the design and evaluation of such a hash table, targeting supercomputers. The latency of find-or-put is minimized by using one-sided RDMA operations. These operations are overlapped as much as possible to reduce waiting times for roundtrips. In contrast to existing work, we use linear probing and argue that this requires less roundtrips. The hash table is implemented in UPC. A peak-throughput of 114.9 million op/s is reached on an Infiniband cluster. With a load-factor of 0.9, find-or-put can be performed in 4.5μs on average. The hash table performance remains very high, even under high loads

University of Twente Research Information