14 research outputs found
The President as International Leader
In this thesis, we address issues associated with programming modern heterogeneous systems while focusing on a special kind of heterogeneous systems that include multicore CPUs and one or more GPUs, called GPU-based systems.We consider the skeleton programming approach to achieve high level abstraction for efficient and portable programming of these GPU-based systemsand present our work on SkePU library which is a skeleton library for these systems. We extend the existing SkePU library with a two-dimensional (2D) data type and skeleton operations and implement several new applications using newly made skeletons. Furthermore, we consider the algorithmic choice present in SkePU and implement support to specify and automatically optimize the algorithmic choice for a skeleton call, on a given platform. To show how to achieve performance, we provide a case-study on optimized GPU-based skeleton implementation for 2D stencil computations and introduce two metrics to maximize resource utilization on a GPU. By devising a mechanism to automatically calculate these two metrics, performance can be retained while porting an application from one GPU architecture to another. Another contribution of this thesis is implementation of the runtime support for the SkePU skeleton library. This is achieved with the help of the StarPUruntime system. By this implementation,support for dynamic scheduling and load balancing for the SkePU skeleton programs is achieved. Furthermore, a capability to do hybrid executionby parallel execution on all available CPUs and GPUs in a system, even for a single skeleton invocation, is developed. SkePU initially supported only data-parallel skeletons. The first task-parallel skeleton (farm) in SkePU is implemented with support for performance-aware scheduling and hierarchical parallel execution by enabling all data parallel skeletons to be usable as tasks inside the farm construct. Experimental evaluations are carried out and presented for algorithmic selection, performance portability, dynamic scheduling and hybrid execution aspects of our work
Wounded cells drive rapid epidermal repair in the early Drosophila
Epithelial tissues are protective barriers that display a remarkable ability to repair wounds. Wound repair is often associated with an accumulation of actin and nonmuscle myosin II around the wound, forming a purse string. The role of actomyosin networks in generating mechanical force during wound repair is not well understood. Here we investigate the mechanisms of force generation during wound repair in the epidermis of early and late Drosophila embryos. We find that wound closure is faster in early embryos, where, in addition to a purse string around the wound, actomyosin networks at the medial cortex of the wounded cells contribute to rapid wound repair. Laser ablation demonstrates that both medial and purse-string actomyosin networks generate contractile force. Quantitative analysis of protein localization dynamics during wound closure indicates that the rapid contraction of medial actomyosin structures during wound repair in early embryos involves disassembly of the actomyosin network. By contrast, actomyosin purse strings in late embryos contract more slowly in a mechanism that involves network condensation. We propose that the combined action of two force-generating structures—a medial actomyosin network and an actomyosin purse string—contributes to the increased efficiency of wound repair in the early embryo