11 research outputs found

    An experimental validation of the PRO model for parallel and distributed computation

    Get PDF
    National audienceThe Parallel Resource-Optimal (PRO) computation model was introduced by Gebremedhin et al. [2002] as a framework for the design and analysis of efficient parallel algorithms. The key features of the PRO model that distinguish it from previous parallel computation models are the full integration of resource-optimality into the design process and the use of a {granularity function as a parameter for measuring quality. In this paper we present experimental results on parallel algorithms, designed using the PRO model, for two representative problems: list ranking and sorting. The algorithms are implemented using SSCRAP, our environment for developing coarse-grained algorithms. The experimental performance results observed agree well with analytical predictions using the PRO model. Moreover, by using different platforms to run our experiments, we have been able to provide an integrated view of the modeling of an underlying architecture and the design and implementation of scalable parallel algorithms

    Incrementally Maintaining the Number of l-cliques

    Get PDF
    The main contribution of this paper is an incremental algorithm to update the number of ll-cliques, for l3l \geq 3, in which each node of a graph is contained, after the deletion of an arbitrary node. The initialization cost is O(nωp+q)O(n^{\omega p+q}), where nn is the number of nodes, p=l3p=\lfloor \frac{l}{3} \rfloor, q=l(mod3)q=l \pmod{3}, and ω=ω(1,1,1)\omega=\omega(1,1,1) is the exponent of the multiplication of two nxnn x n matrices. The amortized updating cost is O(nqT(n,p,ϵ))O(n^{q}T(n,p,\epsilon)) for any ϵ[0,1]\epsilon \in [0,1], where T(n,p,ϵ)=min{np1(np(1+ϵ)+np(ω(1,ϵ,1)ϵ)),npω(1,p1p,1)}T(n,p,\epsilon)=\min\{n^{p-1}(n^{p(1+\epsilon)}+n^{p(\omega(1,\epsilon,1)-\epsilon)}),n^{p \omega(1,\frac{p-1}{p},1)}\} and ω(1,r,1)\omega(1,r,1) is the exponent of the multiplication of an nxnrn x n^{r} matrix by an nrxnn^{r} x n matrix. The current best bounds on ω(1,r,1)\omega(1,r,1) imply an O(n2.376p+q)O(n^{2.376p+q}) initialization cost, an O(n2.575p+q1)O(n^{2.575p+q-1}) updating cost for 3l83 \leq l \leq 8, and an O(n2.376p+q0.532)O(n^{2.376p+q-0.532}) updating cost for l9l \geq 9. An interesting application to constraint programming is also considered

    Cutting planes and the elementary closure in fixed dimension

    Get PDF
    The elementary closure PP' of a polyhedron PP is the intersection of PP with all its Gomory-Chvátal cutting planes. PP' is a rational polyhedron provided that PP is rational. The known bounds for the number of inequalities defining PP' are exponential, even in fixed dimension. We show that the number of inequalities needed to describe the elementary closure of a rational polyhedron is polynomially bounded in fixed dimension. If PP is a simplicial cone, we construct a polytope QQ, whose integral elements correspond to cutting planes of PP. The vertices of the integer hull QIQ_I include the facets of PP'. A polynomial upper bound on their number can be obtained by applying a result of Cook et al. Finally, we present a polynomial algorithm in varying dimension, which computes cutting planes for a simplicial cone that correspond to vertices of QIQ_I

    Experiments with iterative improvement algorithms on completely unimodel hypercubes

    No full text

    Performance Implications by the Hierarchical Design of Clusters

    Get PDF
    Colloque avec actes et comité de lecture. internationale.International audienceWe present experimental results for the evaluation of PC clusters that differ on several aspect of their architecture, such as being mono or biprocessors and having a high bandwidth interconnection network or not. These experiments confirm that a good equilibrium between the speed (throughput) of the different components is crucial for a satisfactory performance of such a machine. On the other hand they also show that the latency of the underlying infrastructure is of less importance and can be hidden by applying techniques from coarse grained parallelism

    A deductive model checking approach for hybrid systems

    Get PDF
    In this paper we propose a verification method for hybrid systems that is based on a successive elimination of the various system locations involved. Briefly, with each such elimination we compute a weakest precondition (strongest postcondition) on the predecessor (successor) locations such that the property to be proved cannot be violated. This is done by representing a given verification problem as a second-order predicate logic formula which is to be solved (proved valid) with the help of a second-order quantifier elimination method. In contrast to many ``standard'' model checking approaches the method as described in this paper does not perform a forward or backward reachability analysis. Experiments show that this approach is particularly interesting in cases where a standard reachability analysis would require to travel often through some of the given system locations. In addition, the approach offers possibilities to proceed where ``standard'' reachability analysis approaches do not terminate

    Superposition and chaining for totally ordered divisible abelian groups

    No full text
    We present a calculus for first-order theorem proving in the presence of the axioms of totally ordered divisible abelian groups. The calculus extends previous superposition or chaining calculi for divisible torsion-free abelian groups and dense total orderings without endpoints. As its predecessors, it is refutationally complete and requires neither explicit inferences with the theory axioms nor variable overlaps. It offers thus an efficient way of treating equalities and inequalities between additive terms over, e.g., the rational numbers within a first-order theorem prover

    Fifth Biennial Report : June 1999 - August 2001

    No full text

    Ultimate Parallel List Ranking?

    No full text

    Ultimate parallel list ranking?

    No full text
    Two improved list-ranking algorithms are presented. The ``peeling-off'' algorithm leads to an optimal PRAM algorithm, but was designed with application on a real parallel machine in mind. It is simpler than earlier algorithms, and in a range of problem sizes, where previously several algorithms where required for the best performance, now this single algorithm suffices. If the problem size is much larger than the number of available processors, then the ``sparse-ruling-sets'' algorithm is even better. In previous versions this algorithm had very restricted practical application because of the large number of communication rounds it was performing. This main weakness of this algorithm is overcome by adding two new ideas, each of which reduces the number of communication rounds by a factor of two
    corecore