16 research outputs found

    Near-Optimal Placement of MPI processes on Hierarchical NUMA Architectures

    Get PDF
    International audienceMPI process placement can play a deterministic role concerning the application performance. This is especially true with nowadays architecture (heterogenous, multicore with different level of caches, etc.). In this paper, we will describe a novel algorithm called TreeMatch that maps processes to resources in order to reduce the communication cost of the whole application. We have implemented this algorithm and will discuss its performance using simulation and on the NAS benchmarks

    Towards an efficient process placement policy for MPI applications in multicore environments

    Get PDF
    International audienceThis paper presents a method to efficiently place MPI processes on multicore machines. Since MPI implementations often feature efficient supports for both shared-memory and network communication, an adequate placement policy is a crucial step to improve applications performance. As a case study, we show the results obtained for several NAS computing kernels and explain how the policy influences overall performance. In particular, we found out that a policy merely increasing the intranode communication ratio is not enough and that cache utilization is also an influential factor. A more sophisticated policy (eg. one taking into account the architecture's memory structure) is required to observe performance improvements

    Calculation of ground states of four-dimensional +or- J Ising spin glasses

    Full text link
    Ground states of four-dimensional (d=4) EA Ising spin glasses are calculated for sizes up to 7x7x7x7 using a combination of a genetic algorithm and cluster-exact approximation. The ground-state energy of the infinite system is extrapolated as e_0=-2.095(1). The ground-state stiffness (or domain wall) energy D is calculated. A D~L^{\Theta} behavior with \Theta=0.65(4) is found which confirms that the d=4 model has an equilibrium spin-glass-paramagnet transition for non-zero T_c.Comment: 5 pages, 3 figures, 31 references, revtex; update of reference

    Low-energy excitations in the three-dimensional random-field Ising model

    Get PDF
    The random-field Ising model (RFIM), one of the basic models for quenched disorder, can be studied numerically with the help of efficient ground-state algorithms. In this study, we extend these algorithm by various methods in order to analyze low-energy excitations for the three-dimensional RFIM with Gaussian distributed disorder that appear in the form of clusters of connected spins. We analyze several properties of these clusters. Our results support the validity of the droplet-model description for the RFIM.Comment: 10 pages, 9 figure

    HPPC 2010:5th Workshop on highly parallel processing on a chip

    No full text

    Automatic Memory Optimizations for Improving MPI Derived Datatype Performance

    No full text

    Improving the Performance of Collective Operations in MPICH

    No full text

    Collective Error Detection for MPI Collective Operations

    No full text
    corecore