1 research outputs found

    Limiting the memory footprint when dynamically scheduling DAGs on shared-memory platforms

    Get PDF
    Scientific workflows are frequently modeled as Directed Acyclic Graphs (DAGs) oftasks, which represent computational modules and their dependences in the form of data pro-duced by a task and used by another one. This formulation allows the use of runtime systemswhich dynamically allocate tasks onto the resources of increasingly complex computing plat-forms. However, for some workflows, such a dynamic schedule may run out of memory byprocessing too many tasks simultaneously. This paper focuses on the problem of transform-ing such a DAG to prevent memory shortage, and concentrates on shared memory platforms.We first propose a simple model of DAGs which is expressive enough to emulate complex mem-ory behaviors. We then exhibit a polynomial-time algorithm that computes the maximum peakmemory of a DAG, that is, the maximum memory needed by any parallel schedule. We considerthe problem of reducing this maximum peak memory to make it smaller than a given bound.Our solution consists in adding new fictitious edges, while trying to minimize the critical pathof the graph. After proving that this problem is NP-complete, we provide an ILP solution as wellas several heuristic strategies that are thoroughly compared by simulation on synthetic DAGsmodeling actual computational workflows. We show that on most instances we are able to de-crease the maximum peak memory at the cost of a small increase in the critical path, thus withlittle impact on the quality of the final parallel schedule
    corecore