15 research outputs found

    A Factor Graph Approach to Automated GO Annotation

    Get PDF
    As volume of genomic data grows, computational methods become essential for providing a first glimpse onto gene annotations. Automated Gene Ontology (GO) annotation methods based on hierarchical ensemble classification techniques are particularly interesting when interpretability of annotation results is a main concern. In these methods, raw GO-term predictions computed by base binary classifiers are leveraged by checking the consistency of predefined GO relationships. Both formal leveraging strategies, with main focus on annotation precision, and heuristic alternatives, with main focus on scalability issues, have been described in literature. In this contribution, a factor graph approach to the hierarchical ensemble formulation of the automated GO annotation problem is presented. In this formal framework, a core factor graph is first built based on the GO structure and then enriched to take into account the noisy nature of GO-term predictions. Hence, starting from raw GO-term predictions, an iterative message passing algorithm between nodes of the factor graph is used to compute marginal probabilities of target GO-terms. Evaluations on Saccharomyces cerevisiae, Arabidopsis thaliana and Drosophila melanogaster protein sequences from the GO Molecular Function domain showed significant improvements over competing approaches, even when protein sequences were naively characterized by their physicochemical and secondary structure properties or when loose noisy annotation datasets were considered. Based on these promising results and using Arabidopsis thaliana annotation data, we extend our approach to the identification of most promising molecular function annotations for a set of proteins of unknown function in Solanum lycopersicum.Fil: Spetale, Flavio Ezequiel. Consejo Nacional de Investigaciones Científicas y Técnicas. Centro Científico Tecnológico Conicet - Rosario. Centro Internacional Franco Argentino de Ciencias de la Información y de Sistemas. Universidad Nacional de Rosario. Centro Internacional Franco Argentino de Ciencias de la Información y de Sistemas; ArgentinaFil: Krsticevic, Flavia Jorgelina. Consejo Nacional de Investigaciones Científicas y Técnicas. Centro Científico Tecnológico Conicet - Rosario. Centro Internacional Franco Argentino de Ciencias de la Información y de Sistemas. Universidad Nacional de Rosario. Centro Internacional Franco Argentino de Ciencias de la Información y de Sistemas; ArgentinaFil: Roda, Fernando. Consejo Nacional de Investigaciones Científicas y Técnicas. Centro Científico Tecnológico Conicet - Rosario. Centro Internacional Franco Argentino de Ciencias de la Información y de Sistemas. Universidad Nacional de Rosario. Centro Internacional Franco Argentino de Ciencias de la Información y de Sistemas; ArgentinaFil: Bulacio, Pilar Estela. Consejo Nacional de Investigaciones Científicas y Técnicas. Centro Científico Tecnológico Conicet - Rosario. Centro Internacional Franco Argentino de Ciencias de la Información y de Sistemas. Universidad Nacional de Rosario. Centro Internacional Franco Argentino de Ciencias de la Información y de Sistemas; Argentin

    The Mouse Limb Anatomy Atlas: An interactive 3D tool for studying embryonic limb patterning

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>The developing mouse limb is widely used as a model system for studying tissue patterning. Despite this, few references are available that can be used for the correct identification of developing limb structures, such as muscles and tendons. Existing textual references consist of two-dimensional (2D) illustrations of the adult rat or mouse limb that can be difficult to apply when attempting to describe the complex three-dimensional (3D) relationship between tissues.</p> <p>Results</p> <p>To improve the resources available in the mouse model, we have generated a free, web-based, interactive reference of limb muscle, tendon, and skeletal structures at embryonic day (E) 14.5 <url>http://www.nimr.mrc.ac.uk/3dlimb/</url>. The Atlas was generated using mouse forelimb and hindlimb specimens stained using immunohistochemistry to detect muscle and tendon. Limbs were scanned using Optical Projection Tomography (OPT), reconstructed to make 3D models and annotated using computer-assisted segmentation tools in Amira 3D Visualisation software. The annotated dataset is visualised using Java, JAtlasView software. Users click on the names of structures and view their shape, position and relationship with other structures within the 3D model and also in 2D virtual sections.</p> <p>Conclusion</p> <p>The Mouse Limb Anatomy Atlas provides a novel and valuable tool for researchers studying limb development and can be applied to a range of research areas, including the identification of abnormal limb patterning in transgenic lines and studies of models of congenital limb abnormalities. By using the Atlas for "virtual" dissection, this resource offers an alternative to animal dissection. The techniques we have developed and employed are also applicable to many other model systems and anatomical structures.</p

    The Mouse Limb Anatomy Atlas: An interactive 3D tool for studying embryonic limb patterning

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>The developing mouse limb is widely used as a model system for studying tissue patterning. Despite this, few references are available that can be used for the correct identification of developing limb structures, such as muscles and tendons. Existing textual references consist of two-dimensional (2D) illustrations of the adult rat or mouse limb that can be difficult to apply when attempting to describe the complex three-dimensional (3D) relationship between tissues.</p> <p>Results</p> <p>To improve the resources available in the mouse model, we have generated a free, web-based, interactive reference of limb muscle, tendon, and skeletal structures at embryonic day (E) 14.5 <url>http://www.nimr.mrc.ac.uk/3dlimb/</url>. The Atlas was generated using mouse forelimb and hindlimb specimens stained using immunohistochemistry to detect muscle and tendon. Limbs were scanned using Optical Projection Tomography (OPT), reconstructed to make 3D models and annotated using computer-assisted segmentation tools in Amira 3D Visualisation software. The annotated dataset is visualised using Java, JAtlasView software. Users click on the names of structures and view their shape, position and relationship with other structures within the 3D model and also in 2D virtual sections.</p> <p>Conclusion</p> <p>The Mouse Limb Anatomy Atlas provides a novel and valuable tool for researchers studying limb development and can be applied to a range of research areas, including the identification of abnormal limb patterning in transgenic lines and studies of models of congenital limb abnormalities. By using the Atlas for "virtual" dissection, this resource offers an alternative to animal dissection. The techniques we have developed and employed are also applicable to many other model systems and anatomical structures.</p

    Anatomical ontologies: names and places in biology

    Get PDF
    Ontology has long been the preserve of philosophers and logicians. Recently, ideas from this field have been picked up by computer scientists as a basis for encoding knowledge and with the hope of achieving interoperability and intelligent system behavior. In bioinformatics, ontologies might allow hitherto impossible query and data-mining activities. We review the use of anatomy ontologies to represent space in biological organisms, specifically mouse and human

    aGEM: an integrative system for analyzing spatial-temporal gene-expression information

    Get PDF
    Motivation: The work presented here describes the ‘anatomical Gene-Expression Mapping (aGEM)’ Platform, a development conceived to integrate phenotypic information with the spatial and temporal distributions of genes expressed in the mouse. The aGEM Platform has been built by extending the Distributed Annotation System (DAS) protocol, which was originally designed to share genome annotations over the WWW. DAS is a client-server system in which a single client integrates information from multiple distributed servers

    More than a decade of developmental gene expression atlases: where are we now?

    Get PDF
    To unravel regulatory networks of genes functioning during embryonic development, information on in situ gene expression is required. Enormous amounts of such data are available in literature, where each paper reports on a limited number of genes and developmental stages. The best way to make these data accessible is via spatio-temporal gene expression atlases. Eleven atlases, describing developing vertebrates and covering at least 100 genes, were reviewed. This review focuses on: (i) the used anatomical framework, (ii) the handling of input data and (iii) the retrieval of information. Our aim is to provide insights into both the possibilities of the atlases, as well as to describe what more than a decade of developmental gene expression atlases can teach us about the requirements of the design of the ‘ideal atlas’. This review shows that most ingredients needed to develop the ideal atlas are already applied to some extent in at least one of the discussed atlases. A review of these atlases shows that the ideal atlas should be based on a spatial framework, i.e. a series of 3D reference models, which is anatomically annotated using an ontology with sufficient resolution, both for relations as well as for anatomical terms

    The FaceBase Consortium: A comprehensive program to facilitate craniofacial research

    Get PDF
    The FaceBase Consortium consists of ten interlinked research and technology projects whose goal is to generate craniofacial research data and technology for use by the research community through a central data management and integrated bioinformatics hub. Funded by the National Institute of Dental and Craniofacial Research (NIDCR) and currently focused on studying the development of the middle region of the face, the Consortium will produce comprehensive datasets of global gene expression patterns, regulatory elements and sequencing; will generate anatomical and molecular atlases; will provide human normative facial data and other phenotypes; conduct follow up studies of a completed genome-wide association study; generate independent data on the genetics of craniofacial development, build repositories of animal models and of human samples and data for community access and analysis; and will develop software tools and animal models for analyzing and functionally testing and integrating these data. The FaceBase website (http://www.facebase.org) will serve as a web home for these efforts, providing interactive tools for exploring these datasets, together with discussion forums and other services to support and foster collaboration within the craniofacial research community

    Celsius: a community resource for Affymetrix microarray data

    Get PDF
    Celsius is a new system that serves as a warehouse by aggregating Affymetrix files and associated metadata, and containing the largest publicly available source of Affymetrix microarray data

    Providing visualisation support for the analysis of anatomy ontology data

    Get PDF
    BACKGROUND: Improvements in technology have been accompanied by the generation of large amounts of complex data. This same technology must be harnessed effectively if the knowledge stored within the data is to be retrieved. Storing data in ontologies aids its management; ontologies serve as controlled vocabularies that promote data exchange and re-use, improving analysis. The Edinburgh Mouse Atlas Project stores the developmental stages of the mouse embryo in anatomy ontologies. This project is looking at the use of visual data overviews for intuitive analysis of the ontology data. RESULTS: A prototype has been developed that visualises the ontologies using directed acyclic graphs in two dimensions, with the ability to study detail in regions of interest in isolation or within the context of the overview. This is followed by the development of a technique that layers individual anatomy ontologies in three-dimensional space, so that relationships across multiple data sets may be mapped using physical links drawn along the third axis. CONCLUSION: Usability evaluations of the applications confirmed advantages in visual analysis of complex data. This project will look next at data input from multiple sources, and continue to develop the techniques presented to provide intuitive identification of relationships that span multiple ontologies
    corecore