74 research outputs found
Theoretically-Efficient and Practical Parallel DBSCAN
The DBSCAN method for spatial clustering has received significant attention
due to its applicability in a variety of data analysis tasks. There are fast
sequential algorithms for DBSCAN in Euclidean space that take work
for two dimensions, sub-quadratic work for three or more dimensions, and can be
computed approximately in linear work for any constant number of dimensions.
However, existing parallel DBSCAN algorithms require quadratic work in the
worst case, making them inefficient for large datasets. This paper bridges the
gap between theory and practice of parallel DBSCAN by presenting new parallel
algorithms for Euclidean exact DBSCAN and approximate DBSCAN that match the
work bounds of their sequential counterparts, and are highly parallel
(polylogarithmic depth). We present implementations of our algorithms along
with optimizations that improve their practical performance. We perform a
comprehensive experimental evaluation of our algorithms on a variety of
datasets and parameter settings. Our experiments on a 36-core machine with
hyper-threading show that we outperform existing parallel DBSCAN
implementations by up to several orders of magnitude, and achieve speedups by
up to 33x over the best sequential algorithms
Local, Smooth, and Consistent Jacobi Set Simplification
The relation between two Morse functions defined on a common domain can be
studied in terms of their Jacobi set. The Jacobi set contains points in the
domain where the gradients of the functions are aligned. Both the Jacobi set
itself as well as the segmentation of the domain it induces have shown to be
useful in various applications. Unfortunately, in practice functions often
contain noise and discretization artifacts causing their Jacobi set to become
unmanageably large and complex. While there exist techniques to simplify Jacobi
sets, these are unsuitable for most applications as they lack fine-grained
control over the process and heavily restrict the type of simplifications
possible.
In this paper, we introduce a new framework that generalizes critical point
cancellations in scalar functions to Jacobi sets in two dimensions. We focus on
simplifications that can be realized by smooth approximations of the
corresponding functions and show how this implies simultaneously simplifying
contiguous subsets of the Jacobi set. These extended cancellations form the
atomic operations in our framework, and we introduce an algorithm to
successively cancel subsets of the Jacobi set with minimal modifications
according to some user-defined metric. We prove that the algorithm is correct
and terminates only once no more local, smooth and consistent simplifications
are possible. We disprove a previous claim on the minimal Jacobi set for
manifolds with arbitrary genus and show that for simply connected domains, our
algorithm reduces a given Jacobi set to its simplest configuration.Comment: 24 pages, 19 figure
I/O-efficient algorithm for constrained Delaunay triangulation with applications to proximity search
Master'sMASTER OF SCIENC
Dynamic Connectivity in Disk Graphs
Let S ⊆ R2 be a set of n sites in the plane, so that every site s ∈ S has an associated
radius rs > 0. Let D(S) be the disk intersection graph defined by S, i.e., the graph
with vertex set S and an edge between two distinct sites s, t ∈ S if and only if the
disks with centers s, t and radii rs , rt intersect. Our goal is to design data structures
that maintain the connectivity structure of D(S) as sites are inserted and/or deleted
in S. First, we consider unit disk graphs, i.e., we fix rs = 1, for all sites s ∈ S.
For this case, we describe a data structure that has O(log2 n) amortized update time
and O(log n/ log log n) query time. Second, we look at disk graphs with bounded
radius ratio Ψ, i.e., for all s ∈ S, we have 1 ≤ rs ≤ Ψ, for a parameter Ψ that is
known in advance. Here, we not only investigate the fully dynamic case, but also the
incremental and the decremental scenario, where only insertions or only deletions of
sites are allowed. In the fully dynamic case, we achieve amortized expected update
time O(Ψ log4 n) and query time O(log n/ log log n). This improves the currently
best update time by a factor of Ψ. In the incremental case, we achieve logarithmic
dependency on Ψ, with a data structure that has O(α(n)) amortized query time and
O(log Ψ log4 n) amortized expected update time, where α(n) denotes the inverse Ackermann
function. For the decremental setting, we first develop an efficient decremental
disk revealing data structure: given two sets R and B of disks in the plane, we can delete
disks from B, and upon each deletion, we receive a list of all disks in R that no longer
intersect the union of B. Using this data structure, we get decremental data structures
with a query time of O(log n/ log log n) that supports deletions in O(n log Ψ log4 n)
overall expected time for disk graphs with bounded radius ratio Ψ and O(n log5 n)
overall expected time for disk graphs with arbitrary radii, assuming that the deletion
sequence is oblivious of the internal random choices of the data structures
LIPIcs, Volume 244, ESA 2022, Complete Volume
LIPIcs, Volume 244, ESA 2022, Complete Volum
- …