74 research outputs found

    Clinical application of tumour-in-normal contamination assessment from whole genome sequencing

    Get PDF
    The unexpected contamination of normal samples with tumour cells reduces variant detection sensitivity, compromising downstream analyses in canonical tumour-normal analyses. Leveraging whole-genome sequencing data available at Genomics England, we develop a tool for normal sample contamination assessment, which we validate in silico and against minimal residual disease testing. From a systematic review of 771 patients with haematological malignancies and sarcomas, we find contamination across a range of cancer clinical indications and DNA sources, with highest prevalence in saliva samples from acute myeloid leukaemia patients, and sorted CD3+ T-cells from myeloproliferative neoplasms. Further exploration reveals 108 hotspot mutations in genes associated with haematological cancers at risk of being subtracted by standard variant calling pipelines. Our work highlights the importance of contamination assessment for accurate somatic variants detection in research and clinical settings, especially with large-scale sequencing projects being utilised to deliver accurate data from which to make clinical decisions for patient care

    Complete mitochondrial genomes and nuclear ribosomal RNA operons of two species of Diplostomum (Platyhelminthes: Trematoda): a molecular resource for taxonomy and molecular epidemiology of important fish pathogens

    Get PDF
    © 2015 Brabec et al. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly credited. The Creative Commons Public Domain Dedication waiver (http:// creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. The attached file is the published version of the article

    Ensembl Genomes 2022: an expanding genome resource for non-vertebrates

    Get PDF
    Ensembl Genomes (https://www.ensemblgenomes.org) provides access to non-vertebrate genomes and analysis complementing vertebrate resources developed by the Ensembl project (https://www.ensembl.org). The two resources collectively present genome annotation through a consistent set of interfaces spanning the tree of life presenting genome sequence, annotation, variation, transcriptomic data and comparative analysis. Here we present our largest increase in plant, metazoan and fungal genomes since the project’s inception creating one of the world’s most comprehensive genomic resources and describe our efforts to reduce genome redundancy in our Bacteria portal. We also detail our new efforts in gene annotation, our emerging support for pangenome analysis and efforts to accelerate data dissemination through the Ensembl Rapid Release resource. We also present our new AlphaFold visualisation. Finally, we present details of our future plans including updates on our integration with Ensembl, and how we plan to improve our support for the microbial research community. Software and data are made available without restriction via our website, online tools platform and programmatic interfaces (available under an Apache 2.0 license). Data updates are synchronised with Ensembl’s release cycle

    Diversification of Schistosoma japonicum in Mainland China Revealed by Mitochondrial DNA

    Get PDF
    Despite the existing threat of schistosomiasis in some rural areas along the Yangtze River, the genetic diversity of Schistosoma japonicum has not been investigated across its wide geographical distribution in China, and such information may provide insight into the disease epidemiology and the development of its control measures. In this study, the adult parasites, obtained through infecting mice with cercariae from snails of the genus Oncomelania collected from a wide range of localities in currently endemic areas of schistosomiasis in the middle and lower (ML) reaches of the Yangtze River, and in Sichuan and Yunnan provinces in the upper reaches of the river in southwest (SW) China, were sequenced individually for mitochondrial genes. In general, a relatively high degree of genetic variation was observed in populations in the ML reaches in terms of nucleotide and haplotype diversity, but a low level was observed in populations in the SW. The significant difference in genetic diversity as revealed by F-statistics, and the existence of no shared haplotypes, were observed between populations in the ML reaches and those in the SW, indicating the effect of geographical separation/isolation upon the schistosomes and probably the parasite-snail system in China

    Identification of 4 novel human ocular coloboma genes ANK3, BMPR1B, PDGFRA, and CDH4 through evolutionary conserved vertebrate gene analysis

    Get PDF
    Purpose: Ocular coloboma arises from genetic or environmental perturbations that inhibit optic fissure (OF) fusion during early eye development. Despite high genetic heterogeneity, 70% to 85% of patients remain molecularly undiagnosed. In this study, we have identified new potential causative genes using cross-species comparative meta-analysis. Methods: Evolutionarily conserved differentially expressed genes were identified through in silico analysis, with in situ hybridization, gene knockdown, and rescue performed to confirm spatiotemporal gene expression and phenotype. Interrogation of the 100,000 Genomes Project for putative pathogenic variants was performed. Results: Nine conserved differentially expressed genes between zebrafish and mouse were identified. Expression of zebrafish ank3a, bmpr1ba/b, cdh4, and pdgfaa was localized to the OF, periocular mesenchyme cells, or ciliary marginal zone, regions traversed by the OF. Knockdown of ank3, bmpr1b, and pdgfaa revealed a coloboma and/or microphthalmia phenotype. Novel pathogenic variants in ANK3, BMPR1B, PDGFRA, and CDH4 were identified in 8 unrelated coloboma families. We showed BMPR1B rescued the knockdown phenotype but variant messenger RNAs failed, providing evidence of pathogenicity. Conclusion: We show the utility of cross-species meta-analysis to identify several novel coloboma disease-causing genes. There is a potential to increase the diagnostic yield for new and unsolved patients while adding to our understanding of the genetic basis of OF morphogenesis

    Clinical, genetic, epidemiologic, evolutionary, and functional delineation of TSPEAR-related autosomal recessive ectodermal dysplasia 14

    Get PDF
    TSPEAR variants cause autosomal recessive ectodermal dysplasia (ARED) 14. The function of TSPEAR is unknown. The clinical features, the mutation spectrum, and the underlying mechanisms of ARED14 are poorly understood. Combining data from new and previously published individuals established that ARED14 is primarily characterized by dental anomalies such as conical tooth cusps and hypodontia, like those seen in individuals with WNT10A-related odontoonychodermal dysplasia. AlphaFold-predicted structure-based analysis showed that most of the pathogenic TSPEAR missense variants likely destabilize the β-propeller of the protein. Analysis of 100000 Genomes Project (100KGP) data revealed multiple founder TSPEAR variants across different populations. Mutational and recombination clock analyses demonstrated that non-Finnish European founder variants likely originated around the end of the last ice age, a period of major climatic transition. Analysis of gnomAD data showed that the non-Finnish European population TSPEAR gene-carrier rate is ∼1/140, making it one of the commonest AREDs. Phylogenetic and AlphaFold structural analyses showed that TSPEAR is an ortholog of drosophila Closca, an extracellular matrix-dependent signaling regulator. We, therefore, hypothesized that TSPEAR could have a role in enamel knot, a structure that coordinates patterning of developing tooth cusps. Analysis of mouse single-cell RNA sequencing (scRNA-seq) data revealed highly restricted expression of Tspear in clusters representing enamel knots. A tspeara−/−;tspearb−/− double-knockout zebrafish model recapitulated the clinical features of ARED14 and fin regeneration abnormalities of wnt10a knockout fish, thus suggesting interaction between tspear and wnt10a. In summary, we provide insights into the role of TSPEAR in ectodermal development and the evolutionary history, epidemiology, mechanisms, and consequences of its loss of function variants

    DYNC2H1 hypomorphic or retina-predominant variants cause nonsyndromic retinal degeneration

    Get PDF
    Purpose: Determining the role of DYNC2H1 variants in nonsyndromic inherited retinal disease (IRD). Methods: Genome and exome sequencing were performed for five unrelated cases of IRD with no identified variant. In vitro assays were developed to validate the variants identified (fibroblast assay, induced pluripotent stem cell [iPSC] derived retinal organoids, and a dynein motility assay). Results: Four novel DYNC2H1 variants (V1, g.103327020_103327021dup; V2, g.103055779A>T; V3, g.103112272C>G; V4, g.103070104A>C) and one previously reported variant (V5, g.103339363T>G) were identified. In proband 1 (V1/V2), V1 was predicted to introduce a premature termination codon (PTC), whereas V2 disrupted the exon 41 splice donor site causing incomplete skipping of exon 41. V1 and V2 impaired dynein-2 motility in vitro and perturbed IFT88 distribution within cilia. V3, homozygous in probands 2–4, is predicted to cause a PTC in a retina-predominant transcript. Analysis of retinal organoids showed that this new transcript expression increased with organoid differentiation. V4, a novel missense variant, was in trans with V5, previously associated with Jeune asphyxiating thoracic dystrophy (JATD). Conclusion: The DYNC2H1 variants discussed herein were either hypomorphic or affecting a retina-predominant transcript and caused nonsyndromic IRD. Dynein variants, specifically DYNC2H1 variants are reported as a cause of non syndromic IRD

    Assessing the digenic model in rare disorders using population sequencing data

    Get PDF
    An important fraction of patients with rare disorders remains with no clear genetic diagnostic, even after whole-exome or whole-genome sequencing, posing a difficulty in giving adequate treatment and genetic counseling. The analysis of genomic data in rare disorders mostly considers the presence of single gene variants in coding regions that follow a concrete monogenic mode of inheritance. A digenic inheritance, with variants in two functionally-related genes in the same individual, is a plausible alternative that might explain the genetic basis of the disease in some cases. In this case, digenic disease combinations should be absent or underrepresented in healthy individuals. We develop a framework to evaluate the significance of digenic combinations and test its statistical power in different scenarios. We suggest that this approach will be relevant with the advent of new sequencing efforts including hundreds of thousands of samples

    Whipworm genome and dual-species transcriptome analyses provide molecular insights into an intimate host-parasite interaction.

    Get PDF
    Whipworms are common soil-transmitted helminths that cause debilitating chronic infections in man. These nematodes are only distantly related to Caenorhabditis elegans and have evolved to occupy an unusual niche, tunneling through epithelial cells of the large intestine. We report here the whole-genome sequences of the human-infective Trichuris trichiura and the mouse laboratory model Trichuris muris. On the basis of whole-transcriptome analyses, we identify many genes that are expressed in a sex- or life stage-specific manner and characterize the transcriptional landscape of a morphological region with unique biological adaptations, namely, bacillary band and stichosome, found only in whipworms and related parasites. Using RNA sequencing data from whipworm-infected mice, we describe the regulated T helper 1 (TH1)-like immune response of the chronically infected cecum in unprecedented detail. In silico screening identified numerous new potential drug targets against trichuriasis. Together, these genomes and associated functional data elucidate key aspects of the molecular host-parasite interactions that define chronic whipworm infection
    • …
    corecore