143 research outputs found
Dynamic Connectivity in Disk Graphs
Let S ⊆ R2 be a set of n sites in the plane, so that every site s ∈ S has an associated
radius rs > 0. Let D(S) be the disk intersection graph defined by S, i.e., the graph
with vertex set S and an edge between two distinct sites s, t ∈ S if and only if the
disks with centers s, t and radii rs , rt intersect. Our goal is to design data structures
that maintain the connectivity structure of D(S) as sites are inserted and/or deleted
in S. First, we consider unit disk graphs, i.e., we fix rs = 1, for all sites s ∈ S.
For this case, we describe a data structure that has O(log2 n) amortized update time
and O(log n/ log log n) query time. Second, we look at disk graphs with bounded
radius ratio Ψ, i.e., for all s ∈ S, we have 1 ≤ rs ≤ Ψ, for a parameter Ψ that is
known in advance. Here, we not only investigate the fully dynamic case, but also the
incremental and the decremental scenario, where only insertions or only deletions of
sites are allowed. In the fully dynamic case, we achieve amortized expected update
time O(Ψ log4 n) and query time O(log n/ log log n). This improves the currently
best update time by a factor of Ψ. In the incremental case, we achieve logarithmic
dependency on Ψ, with a data structure that has O(α(n)) amortized query time and
O(log Ψ log4 n) amortized expected update time, where α(n) denotes the inverse Ackermann
function. For the decremental setting, we first develop an efficient decremental
disk revealing data structure: given two sets R and B of disks in the plane, we can delete
disks from B, and upon each deletion, we receive a list of all disks in R that no longer
intersect the union of B. Using this data structure, we get decremental data structures
with a query time of O(log n/ log log n) that supports deletions in O(n log Ψ log4 n)
overall expected time for disk graphs with bounded radius ratio Ψ and O(n log5 n)
overall expected time for disk graphs with arbitrary radii, assuming that the deletion
sequence is oblivious of the internal random choices of the data structures
LIPIcs, Volume 251, ITCS 2023, Complete Volume
LIPIcs, Volume 251, ITCS 2023, Complete Volum
Applications of Molecular Dynamics simulations for biomolecular systems and improvements to density-based clustering in the analysis
Molecular Dynamics simulations provide a powerful tool to study biomolecular systems with atomistic detail. The key to better understand the function and behaviour of these molecules can often be found in their structural variability. Simulations can help to expose this information that is otherwise experimentally hard or impossible to attain. This work covers two application examples for which a sampling and a characterisation of the conformational ensemble could reveal the structural basis to answer a topical research question. For the fungal toxin phalloidin—a small bicyclic peptide—observed product ratios in different cyclisation reactions could be rationalised by assessing the conformational pre-organisation of precursor fragments. For the C-type lectin receptor langerin, conformational changes induced by different side-chain protonations could deliver an explanation
of the pH-dependency in the protein’s calcium-binding. The investigations were accompanied by the continued development of a density-based clustering protocol into a respective software package, which is generally well applicable for the use case of extracting conformational states from Molecular Dynamics data
One or Two Things We know about Concept Drift -- A Survey on Monitoring Evolving Environments
The world surrounding us is subject to constant change. These changes,
frequently described as concept drift, influence many industrial and technical
processes. As they can lead to malfunctions and other anomalous behavior, which
may be safety-critical in many scenarios, detecting and analyzing concept drift
is crucial. In this paper, we provide a literature review focusing on concept
drift in unsupervised data streams. While many surveys focus on supervised data
streams, so far, there is no work reviewing the unsupervised setting. However,
this setting is of particular relevance for monitoring and anomaly detection
which are directly applicable to many tasks and challenges in engineering. This
survey provides a taxonomy of existing work on drift detection. Besides, it
covers the current state of research on drift localization in a systematic way.
In addition to providing a systematic literature review, this work provides
precise mathematical definitions of the considered problems and contains
standardized experiments on parametric artificial datasets allowing for a
direct comparison of different strategies for detection and localization.
Thereby, the suitability of different schemes can be analyzed systematically
and guidelines for their usage in real-world scenarios can be provided.
Finally, there is a section on the emerging topic of explaining concept drift
Geometric Inhomogeneous Random Graphs for Algorithm Engineering
The design and analysis of graph algorithms is heavily based on the worst case.
In practice, however, many algorithms perform much better than the worst case would suggest.
Furthermore, various problems can be tackled more efficiently if one assumes the input to be, in a sense, realistic.
The field of network science, which studies the structure and emergence of real-world networks, identifies locality and heterogeneity as two frequently occurring properties.
A popular model that captures these properties are geometric inhomogeneous random graphs (GIRGs), which is a generalization of hyperbolic random graphs (HRGs).
Aside from their importance to network science, GIRGs can be an immensely valuable tool in algorithm engineering.
Since they convincingly mimic real-world networks, guarantees about quality and performance of an algorithm on instances of the model can be transferred to real-world applications.
They have model parameters to control the amount of heterogeneity and locality, which allows to evaluate those properties in isolation while keeping the rest fixed.
Moreover, they can be efficiently generated which allows for experimental analysis.
While realistic instances are often rare, generated instances are readily available.
Furthermore, the underlying geometry of GIRGs helps to visualize the network, e.g.,~for debugging or to improve understanding of its structure.
The aim of this work is to demonstrate the capabilities of geometric inhomogeneous random graphs in algorithm engineering and establish them as routine tools to replace previous models like the Erd\H{o}s-R{\\u27e}nyi model, where each edge exists with equal probability.
We utilize geometric inhomogeneous random graphs to design, evaluate, and optimize efficient algorithms for realistic inputs.
In detail, we provide the currently fastest sequential generator for GIRGs and HRGs and describe algorithms for maximum flow, directed spanning arborescence, cluster editing, and hitting set.
For all four problems, our implementations beat the state-of-the-art on realistic inputs.
On top of providing crucial benchmark instances, GIRGs allow us to obtain valuable insights.
Most notably, our efficient generator allows us to
experimentally show sublinear running time of our flow algorithm,
investigate the solution structure of cluster editing,
complement our benchmark set of arborescence instances with a density for which there are no real-world networks available,
and generate networks with adjustable locality and heterogeneity to reveal the effects of these properties on our algorithms
Distributed Spatial Data Sharing: a new era in sharing spatial data
The advancements in information and communications technology, including the widespread adoption of GPS-based sensors, improvements in computational data processing, and satellite imagery, have resulted in new data sources, stakeholders, and methods of producing, using, and sharing spatial data. Daily, vast amounts of data are produced by individuals interacting with digital content and through automated and semi-automated sensors deployed across the environment. A growing portion of this information contains geographic information directly or indirectly embedded within it. The widespread use of automated smart sensors and an increased variety of georeferenced media resulted in new individual data collectors. This raises a new set of social concerns around individual geopricacy and data ownership. These changes require new approaches to managing, sharing, and processing geographic data. With the appearance of distributed data-sharing technologies, some of these challenges may be addressed. This can be achieved by moving from centralized control and ownership of the data to a more distributed system. In such a system, the individuals are responsible for gathering and controlling access and storing data. Stepping into the new area of distributed spatial data sharing needs preparations, including developing tools and algorithms to work with spatial data in this new environment efficiently. Peer-to-peer (P2P) networks have become very popular for storing and sharing information in a decentralized approach. However, these networks lack the methods to process spatio-temporal queries. During the first chapter of this research, we propose a new spatio-temporal multi-level tree structure, Distributed Spatio-Temporal Tree (DSTree), which aims to address this problem. DSTree is capable of performing a range of spatio-temporal queries. We also propose a framework that uses blockchain to share a DSTree on the distributed network, and each user can replicate, query, or update it. Next, we proposed a dynamic k-anonymity algorithm to address geoprivacy concerns in distributed platforms. Individual dynamic control of geoprivacy is one of the primary purposes of the proposed framework introduced in this research. Sharing data within and between organizations can be enhanced by greater trust and transparency offered by distributed or decentralized technologies. Rather than depending on a central authority to manage geographic data, a decentralized framework would provide a fine-grained and transparent sharing capability. Users can also control the precision of shared spatial data with others. They are not limited to third-party algorithms to decide their privacy level and are also not limited to the binary levels of location sharing. As mentioned earlier, individuals and communities can benefit from distributed spatial data sharing. During the last chapter of this work, we develop an image-sharing platform, aka harvester safety application, for the Kakisa indigenous community in northern Canada. During this project, we investigate the potential of using a Distributed Spatial Data sharing (DSDS) infrastructure for small-scale data-sharing needs in indigenous communities. We explored the potential use case and challenges and proposed a DSDS architecture to allow users in small communities to share and query their data using DSDS. Looking at the current availability of distributed tools, the sustainable development of such applications needs accessible technology. We need easy-to-use tools to use distributed technologies on community-scale SDS. In conclusion, distributed technology is in its early stages and requires easy-to-use tools/methods and algorithms to handle, share and query geographic information. Once developed, it will be possible to contrast DSDS against other data systems and thereby evaluate the practical benefit of such systems. A distributed data-sharing platform needs a standard framework to share data between different entities. Just like the first decades of the appearance of the web, these tools need regulations and standards. Such can benefit individuals and small communities in the current chaotic spatial data-sharing environment controlled by the central bodies
LIPIcs, Volume 274, ESA 2023, Complete Volume
LIPIcs, Volume 274, ESA 2023, Complete Volum
Fully Dynamic Maximum Independent Sets of Disks in Polylogarithmic Update Time
A fundamental question in computational geometry is for a dynamic collection
of geometric objects in Euclidean space, whether it is possible to maintain a
maximum independent set in polylogarithmic update time. Already, for a set of
intervals, it is known that no dynamic algorithm can maintain an exact maximum
independent set with sublinear update time. Therefore, the typical objective is
to explore the trade-off between update time and solution size. Substantial
efforts have been made in recent years to understand this question for various
families of geometric objects, such as intervals, hypercubes, hyperrectangles,
and fat objects.
We present the first fully dynamic approximation algorithm for disks of
arbitrary radii in the plane that maintains a constant-factor approximate
maximum independent set in polylogarithmic update time. First, we show that for
a fully dynamic set of unit disks in the plane, a -approximate maximum
independent set can be maintained with worst-case update time ,
and optimal output-sensitive reporting. Moreover, this result generalizes to
fat objects of comparable sizes in any fixed dimension , where the
approximation ratio depends on the dimension and the fatness parameter. Our
main result is that for a fully dynamic set of disks of arbitrary radii in the
plane, an -approximate maximum independent set can be maintained in
polylogarithmic expected amortized update time.Comment: Abstract is shortened to meet Arxiv's requirement on the number of
character
LIPIcs, Volume 258, SoCG 2023, Complete Volume
LIPIcs, Volume 258, SoCG 2023, Complete Volum
- …