140,120 research outputs found
Query processing of spatial objects: Complexity versus Redundancy
The management of complex spatial objects in applications, such as geography and cartography,
imposes stringent new requirements on spatial database systems, in particular on efficient
query processing. As shown before, the performance of spatial query processing can be improved
by decomposing complex spatial objects into simple components. Up to now, only decomposition
techniques generating a linear number of very simple components, e.g. triangles or trapezoids, have
been considered. In this paper, we will investigate the natural trade-off between the complexity of
the components and the redundancy, i.e. the number of components, with respect to its effect on
efficient query processing. In particular, we present two new decomposition methods generating
a better balance between the complexity and the number of components than previously known
techniques. We compare these new decomposition methods to the traditional undecomposed representation
as well as to the well-known decomposition into convex polygons with respect to their
performance in spatial query processing. This comparison points out that for a wide range of query
selectivity the new decomposition techniques clearly outperform both the undecomposed representation
and the convex decomposition method. More important than the absolute gain in performance
by a factor of up to an order of magnitude is the robust performance of our new decomposition
techniques over the whole range of query selectivity
Scheduling Storms and Streams in the Cloud
Motivated by emerging big streaming data processing paradigms (e.g., Twitter
Storm, Streaming MapReduce), we investigate the problem of scheduling graphs
over a large cluster of servers. Each graph is a job, where nodes represent
compute tasks and edges indicate data-flows between these compute tasks. Jobs
(graphs) arrive randomly over time, and upon completion, leave the system. When
a job arrives, the scheduler needs to partition the graph and distribute it
over the servers to satisfy load balancing and cost considerations.
Specifically, neighboring compute tasks in the graph that are mapped to
different servers incur load on the network; thus a mapping of the jobs among
the servers incurs a cost that is proportional to the number of "broken edges".
We propose a low complexity randomized scheduling algorithm that, without
service preemptions, stabilizes the system with graph arrivals/departures; more
importantly, it allows a smooth trade-off between minimizing average
partitioning cost and average queue lengths. Interestingly, to avoid service
preemptions, our approach does not rely on a Gibbs sampler; instead, we show
that the corresponding limiting invariant measure has an interpretation
stemming from a loss system.Comment: 14 page
From Instantly Decodable to Random Linear Network Coding
Our primary goal in this paper is to traverse the performance gap between two
linear network coding schemes: random linear network coding (RLNC) and
instantly decodable network coding (IDNC) in terms of throughput and decoding
delay. We first redefine the concept of packet generation and use it to
partition a block of partially-received data packets in a novel way, based on
the coding sets in an IDNC solution. By varying the generation size, we obtain
a general coding framework which consists of a series of coding schemes, with
RLNC and IDNC identified as two extreme cases. We then prove that the
throughput and decoding delay performance of all coding schemes in this coding
framework are bounded between the performance of RLNC and IDNC and hence
throughput-delay tradeoff becomes possible. We also propose implementations of
this coding framework to further improve its throughput and decoding delay
performance, to manage feedback frequency and coding complexity, or to achieve
in-block performance adaption. Extensive simulations are then provided to
verify the performance of the proposed coding schemes and their
implementations.Comment: 30 pages with double space, 14 color figure
- …