2,761 research outputs found

    Multiphase complete exchange on a circuit switched hypercube

    Get PDF
    On a distributed memory parallel computer, the complete exchange (all-to-all personalized) communication pattern requires each of n processors to send a different block of data to each of the remaining n - 1 processors. This pattern is at the heart of many important algorithms, most notably the matrix transpose. For a circuit switched hypercube of dimension d(n = 2(sup d)), two algorithms for achieving complete exchange are known. These are (1) the Standard Exchange approach that employs d transmissions of size 2(sup d-1) blocks each and is useful for small block sizes, and (2) the Optimal Circuit Switched algorithm that employs 2(sup d) - 1 transmissions of 1 block each and is best for large block sizes. A unified multiphase algorithm is described that includes these two algorithms as special cases. The complete exchange on a hypercube of dimension d and block size m is achieved by carrying out k partial exchange on subcubes of dimension d(sub i) Sigma(sup k)(sub i=1) d(sub i) = d and effective block size m(sub i) = m2(sup d-di). When k = d and all d(sub i) = 1, this corresponds to algorithm (1) above. For the case of k = 1 and d(sub i) = d, this becomes the circuit switched algorithm (2). Changing the subcube dimensions d, varies the effective block size and permits a compromise between the data permutation and block transmission overhead of (1) and the startup overhead of (2). For a hypercube of dimension d, the number of possible combinations of subcubes is p(d), the number of partitions of the integer d. This is an exponential but very slowly growing function and it is feasible over these partitions to discover the best combination for a given message size. The approach was analyzed for, and implemented on, the Intel iPSC-860 circuit switched hypercube. Measurements show good agreement with predictions and demonstrate that the multiphase approach can substantially improve performance for block sizes in the 0 to 160 byte range. This range, which corresponds to 0 to 40 floating point numbers per processor, is commonly encountered in practical numeric applications. The multiphase technique is applicable to all circuit-switched hypercubes that use the common e-cube routing strategy

    A network flow model for load balancing in circuit-switched multicomputers

    Get PDF
    In multicomputers that utilize circuit switching or wormhole routing, communication overhead depends largely on link contention - the variation due to distance between nodes is negligible. This has a major impact on the load balancing problem. In this case, there are some nodes with excess load (sources) and others with deficit load (sinks) and it is required to find a matching of sources to sinks that avoids contention. The problem is made complex by the hardwired routing on currently available machines: the user can control only which nodes communicate but not how the messages are routed. Network flow models of message flow in the mesh and the hypercube were developed to solve this problem. The crucial property of these models is the correspondence between minimum cost flows and correctly routed messages. To solve a given load balancing problem, a minimum cost flow algorithm is applied to the network. This permits one to determine efficiently a maximum contention free matching of sources to sinks which, in turn, tells one how much of the given imbalance can be eliminated without contention

    Efficient algorithms for a class of partitioning problems

    Get PDF
    The problem of optimally partitioning the modules of chain- or tree-like tasks over chain-structured or host-satellite multiple computer systems is addressed. This important class of problems includes many signal processing and industrial control applications. Prior research has resulted in a succession of faster exact and approximate algorithms for these problems. Polynomial exact and approximate algorithms are described for this class that are better than any of the previously reported algorithms. The approach is based on a preprocessing step that condenses the given chain or tree structured task into a monotonic chain or tree. The partitioning of this monotonic take can then be carried out using fast search techniques

    Pro-poor growth: concepts and measurement with country case studies

    Get PDF
    .Poverty, inequality, pro-poor, Korea, Thailand, Vi

    Effect of torch angle on arc properties and weld pool shape in stationary GTAW

    Get PDF
    In this paper, a three dimensional numerical simulation is performed on a stationary arc to study the effect of torch angle in gas tungsten arc welding (GTAW) of SS304 stainless steel. A comparison has been made to investigate 90o and 70o torch angles and analyze the effect on arc and weld pool shape. Current density, heat flux and gas shear stress are calculated in the arc region and are used as input to the workpiece to determine the weld pool. Buoyancy and Marangoni shear also affect the weld pool shape and are taken into account. The computed and experimental results are observed symmetric for 90o torch angle. For 70o torch angle, current density and hence the heat flux due to electron contribution is found the maximum behind and heat flux due to conduction and convection is found the maximum ahead of the electrode tip in the welding direction. This makes the maximum of total heat flux symmetric along the arc center. Heat flux due to conduction and convection decreases as the torch angle decreases resulting in a shallow weld pool. The nonsymmetric “w” shaped weld pool is developed by the combined effect of the gas shear and Marangoni convection. It is found that for 70o torch angle, the weld pool becomes non-symmetric, shallow and wide ahead of the electrode tip in the welding direction. The numerical weld pool shapes are verified through experiments

    Dielectric Spectroscopy of Chromium Doped and Undoped Semi-Insulating Gallium Arsenide

    Get PDF

    Establishing Property Rights through a Secure System of Land Title Management

    Get PDF
    Land is a critical economic asset in any economy. The need for a timely, accurate, safe, simple, secure, and universally accessible system of registering and recording land transactions and interest in land is the cornerstone of a proper land market. Such land markets are an important mechanism to empower the poor and reduce the costs of doing business.1 Secure property rights provide asset owners the incentive to invest in their property while allowing easier collateralisation. Without secure tradable property rights, land is ‘dead capital’

    Chiral Nonet Mixing in pi pi Scattering

    Get PDF
    Pion pion scattering is studied in a generalized linear sigma model which contains two scalar nonets (one of quark-antiquark type and the other of diquark-antidiquark type) and two corresponding pseudoscalar nonets. An interesting feature concerns the mixing of the four isosinglet scalar mesons which yield poles in the scattering amplitude. Some realism is introduced by enforcing exact unitarity via the K-matrix method. It is shown that a reasonable agreement with experimental data is obtained up to about 1 GeV. The poles in the unitarized scattering amplitude are studied in some detail. The lowest pole clearly represents the sigma meson (or f0(600)) with a mass and decay width around 500 MeV. The second pole invites comparison with the f0(980) which has a mass around 1 GeV and decay width around 100 MeV. The third and fourth poles, resemble some of the isosinglet state in the complicated 1-2 GeV region. Some comparison is made to the situation in the usual SU(3) linear sigma model with a single scalar nonet
    corecore