169 research outputs found
Scalability analysis of declustering methods for multidimensional range queries
Abstract—Efficient storage and retrieval of multiattribute data sets has become one of the essential requirements for many data-intensive applications. The Cartesian product file has been known as an effective multiattribute file structure for partial-match and best-match queries. Several heuristic methods have been developed to decluster Cartesian product files across multiple disks to obtain high performance for disk accesses. Although the scalability of the declustering methods becomes increasingly important for systems equipped with a large number of disks, no analytic studies have been done so far. In this paper, we derive formulas describing the scalability of two popular declustering methods¦Disk Modulo and Fieldwise Xor¦for range queries, which are the most common type of queries. These formulas disclose the limited scalability of the declustering methods, and this is corroborated by extensive simulation experiments. From the practical point of view, the formulas given in this paper provide a simple measure that can be used to predict the response time of a given range query and to guide the selection of a declustering method under various conditions
Adaptive Runtime Support for Direct Simulation Monte Carlo Methods on Distributed Memory Architectures
In highly adaptive irregular problems such as many
Particle-In-Cell (PICJ codes and Dimet Simulation Monte Carlo (DSMCJ
codes, data access patterns may vary from time step to time step. This
fluctuation may hinder efficient utilization of distributed memory
parallel computers because of the resulting overhead for data
redistribution and dynamic load balancing. To efficiently parallelize
such adaptive irregular problems on distributed memory parallel
computers, several issues such as effective methods for domain
partitioning and fast data transportation must be addressed. This paper
presents efficient runtime support methods for such problems. A simple
one-dimensional domain partitioning method is implemented and compared
with unstructured mesh partitioners such as recursive coordinate
bisection and recursive inertial bisection. A remapping decision policy
has been investigated for dynamic load balancing on S-dimensional DSMC
codes. Performance results are presented
(Also cross-referenced as UMIACS-TR-95-27
Scalability Analysis of Declustering Methods for Cartesian Product Files
Efficient storage and retrieval of multi-attribute datasets
has become one of the essential requirements for many data-intensive
applications. The Cartesian product file has been known as an effective
multi-attribute file structure for partial-match and best-match queries.
Several heuristic methods have been developed to decluster Cartesian
product files over multiple disks to obtain high performance for disk
accesses. Though the scalability of the declustering methods becomes
increasingly important for systems equipped with a large number of disks,
no analytic studies have been done so far.
In this paper we derive formulas describing the scalability
of two popular declustering methods Disk Modulo and Fieldwise Xor
for range queries, which are the most common type of queries.
These formulas disclose the limited scalability of the declustering methods
and are corroborated by extensive simulation experiments.
From the practical point of view,
the formulas given in this paper provide a simple measure
which can be used to predict the response time of a given range query
and to guide the selection of a declustering method
under various conditions.
(Also cross-referenced as UMIACS-TR-96-5
Study of Scalable Declustering Algorithms for Parallel Grid Files
Efficient storage and retrieval of large multidimensional datasets is
an important concern for large-scale scientific computations such as
long-running time-dependent simulations which periodically generate
snapshots of the state.
The main challenge for efficiently handling such datasets
is to minimize response time for multidimensional range queries.
The grid file is one of the well known access methods for
multidimensional and spatial data.
We investigate effective and scalable declustering techniques
for grid files with the primary goal of minimizing response time
and the secondary goal of maximizing the fairness of data distribution.
The main contributions of this paper are (1) analytic and experimental
evaluation of existing index-based declustering techniques and their
extensions for grid files, and (2) development of a proximity-based
declustering algorithm called {\em minimax} which is experimentally
shown to scale and to consistently achieve better response time
compared to available algorithms while maintaining perfect disk distribution.
(Also cross-referenced as UMIACS-TR-96-4
Index Translation Schemes for Adaptive Computations on Distributed Memory Multicomputers
Current research in parallel programming is focused on closing
the gap between globally indexed algorithms and the separate address
spaces of processors on distributed memory multicomputers. A set of index
translation schemes have been implemented as a part of CHAOS runtime
support library, so that the library functions can be used for
implementing a global indez space across a collection of separate local
index spaces. These schemes include also software-cached translation
schemes aimed at adaptive irregular problems as teen as a distributed
translation table technique for statically irregular problems. To
evaluate and demonstrate the efficiency of the softwDare-cached
translation schemes, experiments have been performed with an adaptively
irregular loop kernel and a full-fledped 3D DSMC code from NASA Langley
on the Intel Paragon and Cray T3D. This paper also discusses and analyzes
the operational conditions under which each scheme can produce optimal
performance.
(Also cross-referenced as UMIACS-TR-95-28
Analysis of the Clustering Properties of Hilbert Space-filling Curve
Several schemes for linear mapping of multidimensional space
have been proposed for many applications such as
access methods for spatio-temporal databases, image compression and so on.
In all these applications, one of the most desired properties
from such linear mappings is clustering, which means
the locality between objects in the multidimensional space
is preserved in the linear space.
It is widely believed that Hilbert space-filling curve
achieves the best clustering.
In this paper we provide closed-form formulas of
the number of clusters required by a given query region
of an arbitrary shape (e.g., polygons and polyhedra)
for Hilbert space-filling curve.
Both the asymptotic solution for a general case and the exact solution
for a special case generalize the previous work,
and they agree with the empirical results that
the number of clusters depends on the hyper-surface area of the query region
and not on its hyper-volume.
We have also shown that Hilbert curve achieves better clustering
than z-curve.
From the practical point of view,
the formulas given in this paper provide a simple measure
which can be used to predict the required disk access behaviors
and hence the total access time.
(Also cross-referenced as UMIACS-TR-96-20
SQLite Optimization with Phase Change Memory for Mobile Applications
ABSTRACT Given its pervasive use in smart mobile platforms, there is a compelling need to optimize the performance of sluggish SQLite databases. Popular mobile applications such as messenger, email and social network services rely on SQLite for their data management need. Those mobile applications tend to execute relatively short transactions in the autocommit mode for transactional consistency in databases. This often has adverse effect on the flash memory storage in mobile devices because the small random updates cause high write amplification and high write latency. In order to address this problem, we propose a new optimization strategy, called per-page logging (PPL), for mobile data management, and have implemented the key functions in SQLite/PPL. The hardware component of SQLite/PPL includes phase change memory (PCM) with a byte-addressable, persistent memory abstraction. By capturing an update in a physiological log record and adding it to the PCM log sector, SQLite/PPL can replace a multitude of successive page writes made to the same logical page with much smaller log writes done to PCM much more efficiently. We have observed that SQLite/PPL would potentially improve the performance of mobile applications by an order of magnitude while supporting transactional atomicity and durability
Run-time and compile-time support for adaptive irregular problems
In adaptive irregular problems the data arrays are accessed via indirection arrays, and data access patterns change during computation. Implementing such problems on distributed memory machines requires support for dynamic data partitioning, efficient preprocessing and fast data migration. This research presents efficient runtime primitives for such problems. This new set of primitives is part of the CHAOS library. It subsumes the previous PARTI library which targeted only static irregular problems. To demonstrate the efficacy of the runtime support, two real adaptive irregular applications have been parallelized using CHAOS primitives: a molecular dynamics code (CHARMM) and a particle-in-cell code (DSMC). The paper also proposes extensions to Fortran D which can allow compilers to generate more efficient code for adaptive problems. These language extensions have been implemented in the Syracuse Fortran 90D/HPF prototype compiler. The performance of the compiler parallelized codes is compared with the hand parallelized versions
- …