5,400 research outputs found
Improving Performance of Iterative Methods by Lossy Checkponting
Iterative methods are commonly used approaches to solve large, sparse linear
systems, which are fundamental operations for many modern scientific
simulations. When the large-scale iterative methods are running with a large
number of ranks in parallel, they have to checkpoint the dynamic variables
periodically in case of unavoidable fail-stop errors, requiring fast I/O
systems and large storage space. To this end, significantly reducing the
checkpointing overhead is critical to improving the overall performance of
iterative methods. Our contribution is fourfold. (1) We propose a novel lossy
checkpointing scheme that can significantly improve the checkpointing
performance of iterative methods by leveraging lossy compressors. (2) We
formulate a lossy checkpointing performance model and derive theoretically an
upper bound for the extra number of iterations caused by the distortion of data
in lossy checkpoints, in order to guarantee the performance improvement under
the lossy checkpointing scheme. (3) We analyze the impact of lossy
checkpointing (i.e., extra number of iterations caused by lossy checkpointing
files) for multiple types of iterative methods. (4)We evaluate the lossy
checkpointing scheme with optimal checkpointing intervals on a high-performance
computing environment with 2,048 cores, using a well-known scientific
computation package PETSc and a state-of-the-art checkpoint/restart toolkit.
Experiments show that our optimized lossy checkpointing scheme can
significantly reduce the fault tolerance overhead for iterative methods by
23%~70% compared with traditional checkpointing and 20%~58% compared with
lossless-compressed checkpointing, in the presence of system failures.Comment: 14 pages, 10 figures, HPDC'1
Update-Efficiency and Local Repairability Limits for Capacity Approaching Codes
Motivated by distributed storage applications, we investigate the degree to
which capacity achieving encodings can be efficiently updated when a single
information bit changes, and the degree to which such encodings can be
efficiently (i.e., locally) repaired when single encoded bit is lost.
Specifically, we first develop conditions under which optimum
error-correction and update-efficiency are possible, and establish that the
number of encoded bits that must change in response to a change in a single
information bit must scale logarithmically in the block-length of the code if
we are to achieve any nontrivial rate with vanishing probability of error over
the binary erasure or binary symmetric channels. Moreover, we show there exist
capacity-achieving codes with this scaling.
With respect to local repairability, we develop tight upper and lower bounds
on the number of remaining encoded bits that are needed to recover a single
lost bit of the encoding. In particular, we show that if the code-rate is
less than the capacity, then for optimal codes, the maximum number
of codeword symbols required to recover one lost symbol must scale as
.
Several variations on---and extensions of---these results are also developed.Comment: Accepted to appear in JSA
Optimization of stochastic lossy transport networks and applications to power grids
Motivated by developments in renewable energy and smart grids, we formulate a
stylized mathematical model of a transport network with stochastic load
fluctuations. Using an affine control rule, we explore the trade-off between
the number of controllable resources in a lossy transport network and the
performance gain they yield in terms of expected power losses. Our results are
explicit and reveal the interaction between the level of flexibility, the
intrinsic load uncertainty and the network structure.Comment: 30 pages, 10 figure
Audiovisual preservation strategies, data models and value-chains
This is a report on preservation strategies, models and value-chains for digital file-based audiovisual content. The report includes: (a)current and emerging value-chains and business-models for audiovisual preservation;(b) a comparison of preservation strategies for audiovisual content including their strengths and weaknesses, and(c) a review of current preservation metadata models, and requirements for extension to support audiovisual files
Performance Comparison of the RPL and LOADng Routing Protocols in a Home Automation Scenario
RPL, the routing protocol proposed by IETF for IPv6/6LoWPAN Low Power and
Lossy Networks has significant complexity. Another protocol called LOADng, a
lightweight variant of AODV, emerges as an alternative solution. In this paper,
we compare the performance of the two protocols in a Home Automation scenario
with heterogenous traffic patterns including a mix of multipoint-to-point and
point-to-multipoint routes in realistic dense non-uniform network topologies.
We use Contiki OS and Cooja simulator to evaluate the behavior of the
ContikiRPL implementation and a basic non-optimized implementation of LOADng.
Unlike previous studies, our results show that RPL provides shorter delays,
less control overhead, and requires less memory than LOADng. Nevertheless,
enhancing LOADng with more efficient flooding and a better route storage
algorithm may improve its performance
A Configurable Transport Layer for CAF
The message-driven nature of actors lays a foundation for developing scalable
and distributed software. While the actor itself has been thoroughly modeled,
the message passing layer lacks a common definition. Properties and guarantees
of message exchange often shift with implementations and contexts. This adds
complexity to the development process, limits portability, and removes
transparency from distributed actor systems.
In this work, we examine actor communication, focusing on the implementation
and runtime costs of reliable and ordered delivery. Both guarantees are often
based on TCP for remote messaging, which mixes network transport with the
semantics of messaging. However, the choice of transport may follow different
constraints and is often governed by deployment. As a first step towards
re-architecting actor-to-actor communication, we decouple the messaging
guarantees from the transport protocol. We validate our approach by redesigning
the network stack of the C++ Actor Framework (CAF) so that it allows to combine
an arbitrary transport protocol with additional functions for remote messaging.
An evaluation quantifies the cost of composability and the impact of individual
layers on the entire stack
Creative Gardens: Towards Digital Commons
date-added: 2015-03-04 03:12:21 +0000 date-modified: 2015-04-01 06:49:53 +0000date-added: 2015-03-04 03:12:21 +0000 date-modified: 2015-04-01 06:49:53 +0000This work was supported by the Arts and Humanities Research Council, CreativeWorks London Hub, grant AH/J005142/1, and the European Regional Development Fund, London Creative and Digital Fusion
- …