13 research outputs found
Surface and Temporal Biosignatures
Recent discoveries of potentially habitable exoplanets have ignited the
prospect of spectroscopic investigations of exoplanet surfaces and atmospheres
for signs of life. This chapter provides an overview of potential surface and
temporal exoplanet biosignatures, reviewing Earth analogues and proposed
applications based on observations and models. The vegetation red-edge (VRE)
remains the most well-studied surface biosignature. Extensions of the VRE,
spectral "edges" produced in part by photosynthetic or nonphotosynthetic
pigments, may likewise present potential evidence of life. Polarization
signatures have the capacity to discriminate between biotic and abiotic "edge"
features in the face of false positives from band-gap generating material.
Temporal biosignatures -- modulations in measurable quantities such as gas
abundances (e.g., CO2), surface features, or emission of light (e.g.,
fluorescence, bioluminescence) that can be directly linked to the actions of a
biosphere -- are in general less well studied than surface or gaseous
biosignatures. However, remote observations of Earth's biosphere nonetheless
provide proofs of concept for these techniques and are reviewed here. Surface
and temporal biosignatures provide complementary information to gaseous
biosignatures, and while likely more challenging to observe, would contribute
information inaccessible from study of the time-averaged atmospheric
composition alone.Comment: 26 pages, 9 figures, review to appear in Handbook of Exoplanets.
Fixed figure conversion error
Perspectives on automated composition of workflows in the life sciences
Scientific data analyses often combine several computational tools in automated pipelines, or workflows. Thousands of such workflows have been used in the life sciences, though their composition has remained a cumbersome manual process due to a lack of standards for annotation, assembly, and implementation. Recent technological advances have returned the long-standing vision of automated workflow composition into focus. This article summarizes a recent Lorentz Center workshop dedicated to automated composition of workflows in the life sciences. We survey previous initiatives to automate the composition process, and discuss the current state of the art and future perspectives. We start by drawing the “big picture” of the scientific workflow development life cycle, before surveying and discussing current methods, technologies and practices for semantic domain modelling, automation in workflow development, and workflow assessment. Finally, we derive a roadmap of individual and community-based actions to work toward the vision of automated workflow development in the forthcoming years. A central outcome of the workshop is a general description of the workflow life cycle in six stages: 1) scientific question or hypothesis, 2) conceptual workflow, 3) abstract workflow, 4) concrete workflow, 5) production workflow, and 6) scientific results. The transitions between stages are facilitated by diverse tools and methods, usually incorporating domain knowledge in some form. Formal semantic domain modelling is hard and often a bottleneck for the application of semantic technologies. However, life science communities have made considerable progress here in recent years and are continuously improving, renewing interest in the application of semantic technologies for workflow exploration, composition and instantiation. Combined with systematic benchmarking with reference data and large-scale deployment of production-stage workflows, such technologies enable a more systematic process of workflow development than we know today. We believe that this can lead to more robust, reusable, and sustainable workflows in the future
systemPipeR: NGS workflow and report generation environment
BACKGROUND: Next-generation sequencing (NGS) has revolutionized how research is carried out in many areas of biology and medicine. However, the analysis of NGS data remains a major obstacle to the efficient utilization of the technology, as it requires complex multi-step processing of big data demanding considerable computational expertise from users. While substantial effort has been invested on the development of software dedicated to the individual analysis steps of NGS experiments, insufficient resources are currently available for integrating the individual software components within the widely used R/Bioconductor environment into automated workflows capable of running the analysis of most types of NGS applications from start-to-finish in a time-efficient and reproducible manner. RESULTS: To address this need, we have developed the R/Bioconductor package systemPipeR. It is an extensible environment for both building and running end-to-end analysis workflows with automated report generation for a wide range of NGS applications. Its unique features include a uniform workflow interface across different NGS applications, automated report generation, and support for running both R and command-line software on local computers and computer clusters. A flexible sample annotation infrastructure efficiently handles complex sample sets and experimental designs. To simplify the analysis of widely used NGS applications, the package provides pre-configured workflows and reporting templates for RNA-Seq, ChIP-Seq, VAR-Seq and Ribo-Seq. Additional workflow templates will be provided in the future. CONCLUSIONS: systemPipeR accelerates the extraction of reproducible analysis results from NGS experiments. By combining the capabilities of many R/Bioconductor and command-line tools, it makes efficient use of existing software resources without limiting the user to a set of predefined methods or environments. systemPipeR is freely available for all common operating systems from Bioconductor (http://bioconductor.org/packages/devel/systemPipeR). ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12859-016-1241-0) contains supplementary material, which is available to authorized users