45 research outputs found

    Implementing Fine/Medium Grained TLP Support in a Many-Core Architecture

    Get PDF
    Abstract. We believe that future many-core architectures should support a simple and scalable way to execute many threads that are generated by parallel programs. A good candidate to implement an efficient and scalable execution of threads is the DTA (Decoupled Threaded Architecture), which is designed to exploit fine/medium grained Thread Level Parallelism (TLP) by using a hardware scheduling unit and relying on existing simple cores. In this paper, we present an initial implementation of DTA concept in a many-core architecture where it interacts with other architectural components designed from scratch in order to address the problem of scalability. We present initial results that show the scalability of the solution that were obtained using a many-core simulator written in SARCSim (a variant of UNISIM) with DTA support

    An MPI Tool to Measure Application Sensitivity to Variation in Communication Parameters

    No full text

    Wisp

    No full text

    An Efficient In-Network Event Detection Algorithm for Wireless Sensor Nodes

    No full text

    U-Net

    No full text

    A case for NOW (Networks of Workstations)

    No full text

    Optimization of Parallel Algorithms on Cluster of SMP’s

    No full text

    Design Middleware Platforms for Ubiquitous Smart Service on City Gas Environment in Korea

    No full text
    corecore