47 research outputs found

    The Drosophila phenotype ontology

    Get PDF
    BACKGROUND: Phenotype ontologies are queryable classifications of phenotypes. They provide a widely-used means for annotating phenotypes in a form that is human-readable, programatically accessible and that can be used to group annotations in biologically meaningful ways. Accurate manual annotation requires clear textual definitions for terms. Accurate grouping and fruitful programatic usage require high-quality formal definitions that can be used to automate classification. The Drosophila phenotype ontology (DPO) has been used to annotate over 159,000 phenotypes in FlyBase to date, but until recently lacked textual or formal definitions. RESULTS: We have composed textual definitions for all DPO terms and formal definitions for 77% of them. Formal definitions reference terms from a range of widely-used ontologies including the Phenotype and Trait Ontology (PATO), the Gene Ontology (GO) and the Cell Ontology (CL). We also describe a generally applicable system, devised for the DPO, for recording and reasoning about the timing of death in populations. As a result of the new formalisations, 85% of classifications in the DPO are now inferred rather than asserted, with much of this classification leveraging the structure of the GO. This work has significantly improved the accuracy and completeness of classification and made further development of the DPO more sustainable. CONCLUSIONS: The DPO provides a set of well-defined terms for annotating Drosophila phenotypes and for grouping and querying the resulting annotation sets in biologically meaningful ways. Such queries have already resulted in successful function predictions from phenotype annotation. Moreover, such formalisations make extended queries possible, including cross-species queries via the external ontologies used in formal definitions. The DPO is openly available under an open source license in both OBO and OWL formats. There is good potential for it to be used more broadly by the Drosophila community, which may ultimately result in its extension to cover a broader range of phenotypes

    FlyBase at 25: looking to the future.

    Get PDF
    Since 1992, FlyBase (flybase.org) has been an essential online resource for the Drosophila research community. Concentrating on the most extensively studied species, Drosophila melanogaster, FlyBase includes information on genes (molecular and genetic), transgenic constructs, phenotypes, genetic and physical interactions, and reagents such as stocks and cDNAs. Access to data is provided through a number of tools, reports, and bulk-data downloads. Looking to the future, FlyBase is expanding its focus to serve a broader scientific community. In this update, we describe new features, datasets, reagent collections, and data presentations that address this goal, including enhanced orthology data, Human Disease Model Reports, protein domain search and visualization, concise gene summaries, a portal for external resources, video tutorials and the FlyBase Community Advisory Group

    The function of miR-143, miR-145 and the MiR-143 host gene in cardiovascular development and disease

    Get PDF
    Noncoding RNAs (long noncoding RNAs and small RNAs) are emerging as critical modulators of phenotypic changes associated with physiological and pathological contexts in a variety of cardiovascular diseases (CVDs). Although it has been well established that hereditable genetic alterations and exposure to risk factors are crucial in the development of CVDs, other critical regulators of cell function impact on disease processes. Here we discuss noncoding RNAs have only recently been identified as key players involved in the progression of disease. In particular, we discuss micro RNA (miR)-143/145 since they represent one of the most characterised microRNA clusters regulating smooth muscle cell (SMC) differentiation and phenotypic switch in response to vascular injury and remodelling. MiR143HG is a well conserved long noncoding RNA (lncRNA), which is the host gene for miR-143/145 and recently implicated in cardiac specification during heart development. Although the lncRNA-miRNA interactions have not been completely characterised, their crosstalk is now beginning to emerge and likely requires further research focus. In this review we give an overview of the biology of the genomic axis that is miR-143/145 and MiR143HG, focusing on their important functional role(s) in the cardiovascular system

    Multiple Peptidoglycan Modification Networks Modulate Helicobacter pylori's Cell Shape, Motility, and Colonization Potential

    Get PDF
    Helical cell shape of the gastric pathogen Helicobacter pylori has been suggested to promote virulence through viscosity-dependent enhancement of swimming velocity. However, H. pylori csd1 mutants, which are curved but lack helical twist, show normal velocity in viscous polymer solutions and the reason for their deficiency in stomach colonization has remained unclear. Characterization of new rod shaped mutants identified Csd4, a DL-carboxypeptidase of peptidoglycan (PG) tripeptide monomers and Csd5, a putative scaffolding protein. Morphological and biochemical studies indicated Csd4 tripeptide cleavage and Csd1 crosslinking relaxation modify the PG sacculus through independent networks that coordinately generate helical shape. csd4 mutants show attenuation of stomach colonization, but no change in proinflammatory cytokine induction, despite four-fold higher levels of Nod1-agonist tripeptides in the PG sacculus. Motility analysis of similarly shaped mutants bearing distinct alterations in PG modifications revealed deficits associated with shape, but only in gel-like media and not viscous solutions. As gastric mucus displays viscoelastic gel-like properties, our results suggest enhanced penetration of the mucus barrier underlies the fitness advantage conferred by H. pylori's characteristic shape

    Genetic mechanisms of critical illness in COVID-19.

    Get PDF
    Host-mediated lung inflammation is present1, and drives mortality2, in the critical illness caused by coronavirus disease 2019 (COVID-19). Host genetic variants associated with critical illness may identify mechanistic targets for therapeutic development3. Here we report the results of the GenOMICC (Genetics Of Mortality In Critical Care) genome-wide association study in 2,244 critically ill patients with COVID-19 from 208 UK intensive care units. We have identified and replicated the following new genome-wide significant associations: on chromosome 12q24.13 (rs10735079, P = 1.65 × 10-8) in a gene cluster that encodes antiviral restriction enzyme activators (OAS1, OAS2 and OAS3); on chromosome 19p13.2 (rs74956615, P = 2.3 × 10-8) near the gene that encodes tyrosine kinase 2 (TYK2); on chromosome 19p13.3 (rs2109069, P = 3.98 ×  10-12) within the gene that encodes dipeptidyl peptidase 9 (DPP9); and on chromosome 21q22.1 (rs2236757, P = 4.99 × 10-8) in the interferon receptor gene IFNAR2. We identified potential targets for repurposing of licensed medications: using Mendelian randomization, we found evidence that low expression of IFNAR2, or high expression of TYK2, are associated with life-threatening disease; and transcriptome-wide association in lung tissue revealed that high expression of the monocyte-macrophage chemotactic receptor CCR2 is associated with severe COVID-19. Our results identify robust genetic signals relating to key host antiviral defence mechanisms and mediators of inflammatory organ damage in COVID-19. Both mechanisms may be amenable to targeted treatment with existing drugs. However, large-scale randomized clinical trials will be essential before any change to clinical practice

    Finishing the euchromatic sequence of the human genome

    Get PDF
    The sequence of the human genome encodes the genetic instructions for human physiology, as well as rich information about human evolution. In 2001, the International Human Genome Sequencing Consortium reported a draft sequence of the euchromatic portion of the human genome. Since then, the international collaboration has worked to convert this draft into a genome sequence with high accuracy and nearly complete coverage. Here, we report the result of this finishing process. The current genome sequence (Build 35) contains 2.85 billion nucleotides interrupted by only 341 gaps. It covers ∼99% of the euchromatic genome and is accurate to an error rate of ∼1 event per 100,000 bases. Many of the remaining euchromatic gaps are associated with segmental duplications and will require focused work with new methods. The near-complete sequence, the first for a vertebrate, greatly improves the precision of biological analyses of the human genome including studies of gene number, birth and death. Notably, the human enome seems to encode only 20,000-25,000 protein-coding genes. The genome sequence reported here should serve as a firm foundation for biomedical research in the decades ahead

    Mouse genomic variation and its effect on phenotypes and gene regulation

    Get PDF
    We report genome sequences of 17 inbred strains of laboratory mice and identify almost ten times more variants than previously known. We use these genomes to explore the phylogenetic history of the laboratory mouse and to examine the functional consequences of allele-specific variation on transcript abundance, revealing that at least 12% of transcripts show a significant tissue-specific expression bias. By identifying candidate functional variants at 718 quantitative trait loci we show that the molecular nature of functional variants and their position relative to genes vary according to the effect size of the locus. These sequences provide a starting point for a new era in the functional analysis of a key model organism
    corecore