85 research outputs found

    Open Science in Software Engineering

    Full text link
    Open science describes the movement of making any research artefact available to the public and includes, but is not limited to, open access, open data, and open source. While open science is becoming generally accepted as a norm in other scientific disciplines, in software engineering, we are still struggling in adapting open science to the particularities of our discipline, rendering progress in our scientific community cumbersome. In this chapter, we reflect upon the essentials in open science for software engineering including what open science is, why we should engage in it, and how we should do it. We particularly draw from our experiences made as conference chairs implementing open science initiatives and as researchers actively engaging in open science to critically discuss challenges and pitfalls, and to address more advanced topics such as how and under which conditions to share preprints, what infrastructure and licence model to cover, or how do it within the limitations of different reviewing models, such as double-blind reviewing. Our hope is to help establishing a common ground and to contribute to make open science a norm also in software engineering.Comment: Camera-Ready Version of a Chapter published in the book on Contemporary Empirical Methods in Software Engineering; fixed layout issue with side-note

    Signatures of arithmetic simplicity in metabolic network architecture

    Get PDF
    Metabolic networks perform some of the most fundamental functions in living cells, including energy transduction and building block biosynthesis. While these are the best characterized networks in living systems, understanding their evolutionary history and complex wiring constitutes one of the most fascinating open questions in biology, intimately related to the enigma of life's origin itself. Is the evolution of metabolism subject to general principles, beyond the unpredictable accumulation of multiple historical accidents? Here we search for such principles by applying to an artificial chemical universe some of the methodologies developed for the study of genome scale models of cellular metabolism. In particular, we use metabolic flux constraint-based models to exhaustively search for artificial chemistry pathways that can optimally perform an array of elementary metabolic functions. Despite the simplicity of the model employed, we find that the ensuing pathways display a surprisingly rich set of properties, including the existence of autocatalytic cycles and hierarchical modules, the appearance of universally preferable metabolites and reactions, and a logarithmic trend of pathway length as a function of input/output molecule size. Some of these properties can be derived analytically, borrowing methods previously used in cryptography. In addition, by mapping biochemical networks onto a simplified carbon atom reaction backbone, we find that several of the properties predicted by the artificial chemistry model hold for real metabolic networks. These findings suggest that optimality principles and arithmetic simplicity might lie beneath some aspects of biochemical complexity

    A retrospective population-based study of childhood hospital admissions with record linkage to a birth defects registry

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>Using population-based linked records of births, deaths, birth defects and hospital admissions for children born 1980–1999 enables profiles of hospital morbidity to be created for each child.</p> <p>Methods</p> <p>This is an analysis of a state-based registry of birth defects linked to population-based hospital admission data. Transfers and readmissions within one day could be taken into account and treated as one episode of care for the purposes of analyses (N = 485,446 children; 742,845 non-birth admissions).</p> <p>Results</p> <p>Children born in Western Australia from 1980–1999 with a major birth defect comprised 4.6% of live births but 12.0% of non-birth hospital admissions from 1980–2000. On average, the children with a major birth defect remained in hospital longer than the children in the comparison group for the same diagnosis. The mean and median lengths of stay (LOS) for admissions before the age of 5 years have decreased for all children since 1980. However, the mean number of admissions per child admitted has remained constant at around 3.8 admissions for children with a major birth defect and 2.2 admissions for all other children.</p> <p>Conclusion</p> <p>To gain a true picture of the burden of hospital-based morbidity in childhood, admission records need to be linked for each child. We have been able to do this at a population level using birth defect cases ascertained by a birth defects registry. Our results showed a greater mean LOS and mean number of admissions per child admitted than previous studies. The results suggest there may be an opportunity for the children with a major birth defect to be monitored and seen earlier in the primary care setting for common childhood illnesses to avoid hospitalisation or reduce the LOS.</p

    The distribution of inverted repeat sequences in the Saccharomyces cerevisiae genome

    Get PDF
    Although a variety of possible functions have been proposed for inverted repeat sequences (IRs), it is not known which of them might occur in vivo. We investigate this question by assessing the distributions and properties of IRs in the Saccharomyces cerevisiae (SC) genome. Using the IRFinder algorithm we detect 100,514 IRs having copy length greater than 6 bp and spacer length less than 77 bp. To assess statistical significance we also determine the IR distributions in two types of randomization of the S. cerevisiae genome. We find that the S. cerevisiae genome is significantly enriched in IRs relative to random. The S. cerevisiae IRs are significantly longer and contain fewer imperfections than those from the randomized genomes, suggesting that processes to lengthen and/or correct errors in IRs may be operative in vivo. The S. cerevisiae IRs are highly clustered in intergenic regions, while their occurrence in coding sequences is consistent with random. Clustering is stronger in the 3′ flanks of genes than in their 5′ flanks. However, the S. cerevisiae genome is not enriched in those IRs that would extrude cruciforms, suggesting that this is not a common event. Various explanations for these results are considered
    corecore