27 research outputs found

    The CODATA-RDA Data Steward School

    Get PDF
    Given the expected increase in demand for Data Stewards and Data Stewardship skills it is clear that there is a need to develop training, education and CPD (continuous professional development) in this area. In this paper a brief introduction is provided to the origin of definitions of Data Stewardship. Also it notes the present tendency towards equivalence between Data Stewardship skills and FAIR principles. It then focuses on one specific training event – the pilot Data Stewardship strand of the CODATA-RDA Research Data Science schools that by the time of the IDCC meeting will have been held in Trieste in August 2019. The paper will discuss the overall curriculum for the pilot school, how it matches with the FAIR4S framework, and plans for getting feedback from the students. Finally, the paper discuss future plans for the school, in particular how to deepen the integration between the Data Stewardship strand with the Early Career Researcher strand. [This paper is a conference pre-print presented at IDCC 2020 after lightweight peer review.

    The BioMart interface to the eMouseAtlas gene expression database EMAGE

    Get PDF
    Here, we describe the BioMart interface to the eMouseAtlas gene expression database EMAGE. EMAGE is a spatiotemporal database of in situ gene expression patterns in the developing mouse embryo. BioMart provides a generic web query interface and programmable access using web services. The BioMart interface extends access to EMAGE via a powerful method of structuring complex queries and one with which users may already be familiar with from other BioMart implementations. The interface is structured into several data sets providing the user with comprehensive query access to the EMAGE data. The federated nature of BioMart allows scope for integration and cross querying of EMAGE with other similar BioMarts

    Integrated analysis of Wnt signalling system component gene expression

    Get PDF
    Wnt signalling controls patterning and differentiation across many tissues and organs of the developing embryo through temporally and spatially restricted expression of multi-gene families encoding ligands, receptors, pathway modulators and intracellular components. Here, we report an integrated analysis of key genes in the 3D space of the mouse embryo across multiple stages of development. We applied a method for 3D/3D image transformation to map all gene expression patterns to a single reference embryo for each stage, providing both visual analysis and volumetric mapping allowing computational methods to interrogate the combined expression patterns. We identify territories where multiple Wnt and Fzd genes are co-expressed and cross-compare all patterns, including all seven Wnt paralogous gene pairs. The comprehensive analysis revealed regions in the embryo where no Wnt or Fzd gene expression is detected, and where single Wnt genes are uniquely expressed. This work provides insight into a previously unappreciated level of organisation of expression patterns, as well as presenting a resource that can be utilised further by the research community for whole-system analysis

    EMAGE: a spatial database of gene expression patterns during mouse embryo development

    Get PDF
    EMAGE () is a freely available, curated database of gene expression patterns generated by in situ techniques in the developing mouse embryo. It is unique in that it contains standardized spatial representations of the sites of gene expression for each gene, denoted against a set of virtual reference embryo models. As such, the data can be interrogated in a novel and abstract manner by using space to define a query. Accompanying the spatial representations of gene expression patterns are text descriptions of the sites of expression, which also allows searching of the data by more conventional text-based methods

    EMAGE: a spatial database of gene expression patterns during mouse embryo development

    Get PDF
    EMAGE () is a freely available, curated database of gene expression patterns generated by in situ techniques in the developing mouse embryo. It is unique in that it contains standardized spatial representations of the sites of gene expression for each gene, denoted against a set of virtual reference embryo models. As such, the data can be interrogated in a novel and abstract manner by using space to define a query. Accompanying the spatial representations of gene expression patterns are text descriptions of the sites of expression, which also allows searching of the data by more conventional text-based methods

    eMouseAtlas, EMAGE, and the spatial dimension of the transcriptome

    Get PDF
    Abstract eMouseAtlas (www.emouseatlas.org) is a com-prehensive online resource to visualise mouse development and investigate gene expression in the mouse embryo. We have recently deployed a completely redesigned Mouse Anatomy Atlas website (www.emouseatlas.org/emap/ema) that allows users to view 3D embryo reconstructions, delineated anatomy, and high-resolution histological sec-tions. A new feature of the website is the IIP3D web tool that allows a user to view arbitrary sections of 3D embryo reconstructions using a web browser. This feature provides interactive access to very high-volume 3D images via a tiled pan-and-zoom style interface and circumvents the need to download large image files for visualisation. eMouseAtla

    EMAGE mouse embryo spatial gene expression database: 2010 update

    Get PDF
    EMAGE (http://www.emouseatlas.org/emage) is a freely available online database of in situ gene expression patterns in the developing mouse embryo. Gene expression domains from raw images are extracted and integrated spatially into a set of standard 3D virtual mouse embryos at different stages of development, which allows data interrogation by spatial methods. An anatomy ontology is also used to describe sites of expression, which allows data to be queried using text-based methods. Here, we describe recent enhancements to EMAGE including: the release of a completely re-designed website, which offers integration of many different search functions in HTML web pages, improved user feedback and the ability to find similar expression patterns at the click of a button; back-end refactoring from an object oriented to relational architecture, allowing associated SQL access; and the provision of further access by standard formatted URLs and a Java API. We have also increased data coverage by sourcing from a greater selection of journals and developed automated methods for spatial data annotation that are being applied to spatially incorporate the genome-wide (∼19 000 gene) ‘EURExpress’ dataset into EMAGE

    EMAGE—Edinburgh Mouse Atlas of Gene Expression: 2008 update

    Get PDF
    EMAGE (http://genex.hgu.mrc.ac.uk/Emage/database) is a database of in situ gene expression patterns in the developing mouse embryo. Domains of expression from raw data images are spatially integrated into a set of standard 3D virtual mouse embryos at different stages of development, allowing data interrogation by spatial methods. Sites of expression are also described using an anatomy ontology and data can be queried using text-based methods. Here we describe recent enhancements to EMAGE which include advances in spatial search methods including: a refined local spatial similarity search algorithm, a method to allow global spatial comparison of patterns in EMAGE and subsequent hierarchical-clustering, and spatial searches across multiple stages of development. In addition, we have extended data access by the introduction of web services and new HTML-based search interfaces, which allow access to data that has not yet been spatially annotated. We have also started incorporating full 3D images of gene expression that have been generated using optical projection tomography (OPT)

    LAMA: automated image analysis for the developmental phenotyping of mouse embryos

    Get PDF
    Advanced 3D imaging modalities, such as micro-computed tomography (micro-CT), have been incorporated into the high-throughput embryo pipeline of the International Mouse Phenotyping Consortium (IMPC). This project generates large volumes of raw data that cannot be immediately exploited without significant resources of personnel and expertise. Thus, rapid automated annotation is crucial to ensure that 3D imaging data can be integrated with other multi-dimensional phenotyping data. We present an automated computational mouse embryo phenotyping pipeline that harnesses the large amount of wild-type control data available in the IMPC embryo pipeline in order to address issues of low mutant sample number as well as incomplete penetrance and variable expressivity. We also investigate the effect of developmental substage on automated phenotyping results. Designed primarily for developmental biologists, our software performs image pre-processing, registration, statistical analysis and segmentation of embryo images. We also present a novel anatomical E14.5 embryo atlas average and, using it with LAMA, show that we can uncover known and novel dysmorphology from two IMPC knockout lines
    corecore