2 research outputs found

    Program-Level Control of Network Delay for Parallel Asynchronous Iterative Applications

    No full text
    Software distributedshared memory (DSM)platfamzs on networks of workstations tolerate large network latencies by employing one of several weak memory consistency mod-els. Fully asynchronous parallel iterative algorithms offer an additional degree offreedom to tolerate network latency: they behave correctly when supplied outdated shared data. However; these algorithms canjood the network with mes-sages in the presence of large delays. We propose a method of controlling asynchronous iterative methods wherein the reader of a shared datum imposes an upper bound on its age via use of a blocking GlobalRead primitive. This reduces the overall number of iterations executed by the readel; thus controlling the amount of shared updates generated. Exper-iments for a fully asynchronous linear equation solver run-ning on a network of 10 IBM RY6000 workstations show that the proposed GlobalRead primitive provides sign$c-ant performance improvement. 1

    Program-Level Control of Network Delay for Parallel Asynchronous Iterative Applications

    Get PDF
    Software distributed shared memory (DSM) platforms on networks of workstations tolerate large network latencies by employing one of several weak memory consistency models. Fully asynchronous parallel iterative algorithms offer an additional degree of freedom to tolerate network latency. They behave correctly when supplied outdated shared data. However these algorithms can flood the network with messages in the presence of large delays. We propose a method of controlling asynchronous iterative methods wherein the reader of a shared datum imposes an upper bound on its age via use of a blocking Global Read primitive. This reduces the overall number of iteration is executed by the reader; thus controlling the amount of shared updates generated. Experiments for a fully asynchronous linear equation solver running on a network of 10 IBM RS/6000 workstations show that the proposed Global Read primitive provides significant performance improvement
    corecore