97 research outputs found

    Distilling Structure in Scientific Workflows

    Get PDF
    International audienceIn this work, we have conducted a series of experiments to better understand the structure of scientific workflows. In particular, we have investigated techniques to understand why scientific workflows may or may not have a series-parallel structure

    Distilling structure in Taverna scientific workflows: a refactoring approach

    Get PDF
    BACKGROUND: Scientific workflows management systems are increasingly used to specify and manage bioinformatics experiments. Their programming model appeals to bioinformaticians, who can use them to easily specify complex data processing pipelines. Such a model is underpinned by a graph structure, where nodes represent bioinformatics tasks and links represent the dataflow. The complexity of such graph structures is increasing over time, with possible impacts on scientific workflows reuse. In this work, we propose effective methods for workflow design, with a focus on the Taverna model. We argue that one of the contributing factors for the difficulties in reuse is the presence of "anti-patterns", a term broadly used in program design, to indicate the use of idiomatic forms that lead to over-complicated design. The main contribution of this work is a method for automatically detecting such anti-patterns, and replacing them with different patterns which result in a reduction in the workflow's overall structural complexity. Rewriting workflows in this way will be beneficial both in terms of user experience (easier design and maintenance), and in terms of operational efficiency (easier to manage, and sometimes to exploit the latent parallelism amongst the tasks). RESULTS: We have conducted a thorough study of the workflows structures available in Taverna, with the aim of finding out workflow fragments whose structure could be made simpler without altering the workflow semantics. We provide four contributions. Firstly, we identify a set of anti-patterns that contribute to the structural workflow complexity. Secondly, we design a series of refactoring transformations to replace each anti-pattern by a new semantically-equivalent pattern with less redundancy and simplified structure. Thirdly, we introduce a distilling algorithm that takes in a workflow and produces a distilled semantically-equivalent workflow. Lastly, we provide an implementation of our refactoring approach that we evaluate on both the public Taverna workflows and on a private collection of workflows from the BioVel project. CONCLUSION: We have designed and implemented an approach to improving workflow structure by way of rewriting preserving workflow semantics. Future work includes considering our refactoring approach during the phase of workflow design and proposing guidelines for designing distilled workflows

    Building Ontologies in DAML + OIL

    Get PDF
    In this article we describe an approach to representing and building ontologies advocated by the Bioinformatics and Medical Informatics groups at the University of Manchester. The hand-crafting of ontologies offers an easy and rapid avenue to delivering ontologies. Experience has shown that such approaches are unsustainable. Description logic approaches have been shown to offer computational support for building sound, complete and logically consistent ontologies. A new knowledge representation language, DAML + OIL, offers a new standard that is able to support many styles of ontology, from hand-crafted to full logic-based descriptions with reasoning support. We describe this language, the OilEd editing tool, reasoning support and a strategy for the language’s use. We finish with a current example, in the Gene Ontology Next Generation (GONG) project, that uses DAML + OIL as the basis for moving the Gene Ontology from its current hand-crafted, form to one that uses logical descriptions of a concept’s properties to deliver a more complete version of the ontology

    Taverna, reloaded

    Get PDF
    The Taverna workflow management system is an open source project with a history of widespread adoption within multiple experimental science communities, and a long-term ambition of effectively supporting the evolving need of those communities for complex, data-intensive, service-based experimental pipelines. This short paper describes how the recently overhauled technical architecture of Taverna addresses issues of efficiency, scalability, and extensibility, and presents performance results based on a collection of synthetic workflows, as well as a concrete case study involving a production workflow in the area of cancer research.</p

    Health related quality of life trajectories and predictors following coronary artery bypass surgery

    Get PDF
    BACKGROUND: Many studies have demonstrated that health related quality of life (HRQoL) improves, on average, after coronary artery bypass graft surgery (CABGS). However, this average improvement may not be realized for all patients, and it is possible that there are two or more distinctive groups with different, possibly non-linear, trajectories of change over time. Furthermore, little is known about the predictors that are associated with these possible HRQoL trajectories after CABGS. METHODS: 182 patients listed for elective CABGS at The Royal Melbourne Hospital completed a postal battery of questionnaires which included the Short-Form-36 (SF-36), Profile of Mood States (POMS) and the Everyday Functioning Questionnaire (EFQ). These data were collected on average a month before surgery, and at two months and six months after surgery. Socio-demographic and medical characteristics prior to surgery, as well as surgical and post-surgical complications and symptoms were also assessed. Growth curve and growth mixture modelling were used to identify trajectories of HRQoL. RESULTS: For both the physical component summary scale (PCS) and the mental component summary scale (MCS) of the SF-36, two groups of patients with distinct trajectories of HRQoL following surgery could be identified (improvers and non-improvers). A series of logistic regression analyses identified different predictors of group membership for PCS and MCS trajectories. For the PCS the most significant predictors of non-improver membership were lower scores on POMS vigor-activity and higher New York Heart Association dyspnoea class; for the MCS the most significant predictors of non-improver membership were higher scores on POMS depression-dejection and manual occupation. CONCLUSION: It is incorrect to assume that HRQoL will improve in a linear fashion for all patients following CABGS. Nor was there support for a single response trajectory. It is important to identify characteristics of each patient, and those post-operative symptoms that could be possible targets for intervention to improve HRQoL outcomes

    DistillFlow: removing redundancy in scientific workflows

    Get PDF
    International audienceScientific workflows management systems are increasingly used by scientists to specify complex data processing pipelines. Workflows are represented using a graph structure, where nodes represent tasks and links represent the dataflow. However, the complexity of workflow structures is increasing over time, reducing the rate of scientific workflows reuse. Here, we introduce DistillFlow, a tool based on effective methods for workflow design, with a focus on the Taverna model. DistillFlow is able to detect "anti-patterns" in the structure of workflows (idiomatic forms that lead to over-complicated design) and replace them with different patterns to reduce the workflow's overall structural complexity. Rewriting workflows in this way is beneficial both in terms of user experience and workflow maintenance

    Landscape Analysis for the Specimen Data Refinery

    Get PDF
    This report reviews the current state-of-the-art applied approaches on automated tools, services and workflows for extracting information from images of natural history specimens and their labels. We consider the potential for repurposing existing tools, including workflow management systems; and areas where more development is required. This paper was written as part of the SYNTHESYS+ project for software development teams and informatics teams working on new software-based approaches to improve mass digitisation of natural history specimens

    Proteinase 3 promotes formation of multinucleated giant cells and granuloma-like structures in patients with granulomatosis with polyangiitis

    Get PDF
    OBJECTIVES: Granulomatosis with polyangiitis (GPA) and microscopic polyangiitis (MPA) are autoimmune vasculitides associated with antineutrophil cytoplasm antibodies that target proteinase 3 (PR3) or myeloperoxidase (MPO) found within neutrophils and monocytes. Granulomas are exclusively found in GPA and form around multinucleated giant cells (MGCs), at sites of microabscesses, containing apoptotic and necrotic neutrophils. Since patients with GPA have augmented neutrophil PR3 expression, and PR3-expressing apoptotic cells frustrate macrophage phagocytosis and cellular clearance, we investigated the role of PR3 in stimulating giant cell and granuloma formation. METHODS: We stimulated purified monocytes and whole peripheral blood mononuclear cells (PBMCs) from patients with GPA, patients with MPA or healthy controls with PR3 or MPO and visualised MGC and granuloma-like structure formation using light, confocal and electron microscopy, as well as measuring the cell cytokine production. We investigated the expression of PR3 binding partners on monocytes and tested the impact of their inhibition. Finally, we injected zebrafish with PR3 and characterised granuloma formation in a novel animal model. RESULTS: In vitro, PR3 promoted monocyte-derived MGC formation using cells from patients with GPA but not from patients with MPA, and this was dependent on soluble interleukin 6 (IL-6), as well as monocyte MAC-1 and protease-activated receptor-2, found to be overexpressed in the cells of patients with GPA. PBMCs stimulated by PR3 formed granuloma-like structures with central MGC surrounded by T cells. This effect of PR3 was confirmed in vivo using zebrafish and was inhibited by niclosamide, a IL-6-STAT3 pathway inhibitor. CONCLUSIONS: These data provide a mechanistic basis for granuloma formation in GPA and a rationale for novel therapeutic approaches
    • 

    corecore