2,905 research outputs found

    A unified compendium of prokaryotic and viral genomes from over 300 anaerobic digestion microbiomes

    Get PDF
    BackgroundThe anaerobic digestion process degrades organic matter into simpler compounds and occurs in strictly anaerobic and microaerophilic environments. The process is carried out by a diverse community of microorganisms where each species has a unique role and it has relevant biotechnological applications since it is used for biogas production. Some aspects of the microbiome, including its interaction with phages, remains still unclear: a better comprehension of the community composition and role of each species is crucial for a cured understanding of the carbon cycle in anaerobic systems and improving biogas production.ResultsThe primary objective of this study was to expand our understanding on the anaerobic digestion microbiome by jointly analyzing its prokaryotic and viral components. By integrating 192 additional datasets into a previous metagenomic database, the binning process generated 11,831 metagenome-assembled genomes from 314 metagenome samples published between 2014 and 2022, belonging to 4,568 non-redundant species based on ANI calculation and quality verification. CRISPR analysis on these genomes identified 76 archaeal genomes with active phage interactions. Moreover, single-nucleotide variants further pointed to archaea as the most critical members of the community. Among the MAGs, two methanogenic archaea, Methanothrix sp. 43zhSC_152 and Methanoculleus sp. 52maCN_3230, had the highest number of SNVs, with the latter having almost double the density of most other MAGs.ConclusionsThis study offers a more comprehensive understanding of microbial community structures that thrive at different temperatures. The findings revealed that the fraction of archaeal species characterized at the genome level and reported in public databases is higher than that of bacteria, although still quite limited. The identification of shared spacers between phages and microbes implies a history of phage-bacterial interactions, and specifically lysogenic infections. A significant number of SNVs were identified, primarily comprising synonymous and nonsynonymous variants. Together, the findings indicate that methanogenic archaea are subject to intense selective pressure and suggest that genomic variants play a critical role in the anaerobic digestion process. Overall, this study provides a more balanced and diverse representation of the anaerobic digestion microbiota in terms of geographic location, temperature range and feedstock utilization

    Exploring missing heritability in neurodevelopmental disorders:Learning from regulatory elements

    Get PDF
    In this thesis, I aimed to solve part of the missing heritability in neurodevelopmental disorders, using computational approaches. Next to the investigations of a novel epilepsy syndrome and investigations aiming to elucidate the regulation of the gene involved, I investigated and prioritized genomic sequences that have implications in gene regulation during the developmental stages of human brain, with the goal to create an atlas of high confidence non-coding regulatory elements that future studies can assess for genetic variants in genetically unexplained individuals suffering from neurodevelopmental disorders that are of suspected genetic origin

    Genetic and epigenetic characterization of cell-free DNA:In patients with solid tumors

    Get PDF

    Towards personalized medicine for metastatic urothelial cancer

    Get PDF

    ClinPrior: an algorithm for diagnosis and novel gene discovery by network-based prioritization

    Full text link
    BackgroundWhole-exome sequencing (WES) and whole-genome sequencing (WGS) have become indispensable tools to solve rare Mendelian genetic conditions. Nevertheless, there is still an urgent need for sensitive, fast algorithms to maximise WES/WGS diagnostic yield in rare disease patients. Most tools devoted to this aim take advantage of patient phenotype information for prioritization of genomic data, although are often limited by incomplete gene-phenotype knowledge stored in biomedical databases and a lack of proper benchmarking on real-world patient cohorts.MethodsWe developed ClinPrior, a novel method for the analysis of WES/WGS data that ranks candidate causal variants based on the patient's standardized phenotypic features (in Human Phenotype Ontology (HPO) terms). The algorithm propagates the data through an interactome network-based prioritization approach. This algorithm was thoroughly benchmarked using a synthetic patient cohort and was subsequently tested on a heterogeneous prospective, real-world series of 135 families affected by hereditary spastic paraplegia (HSP) and/or cerebellar ataxia (CA).ResultsClinPrior successfully identified causative variants achieving a final positive diagnostic yield of 70% in our real-world cohort. This includes 10 novel candidate genes not previously associated with disease, 7 of which were functionally validated within this project. We used the knowledge generated by ClinPrior to create a specific interactome for HSP/CA disorders thus enabling future diagnoses as well as the discovery of novel disease genes.ConclusionsClinPrior is an algorithm that uses standardized phenotype information and interactome data to improve clinical genomic diagnosis. It helps in identifying atypical cases and efficiently predicts novel disease-causing genes. This leads to increasing diagnostic yield, shortening of the diagnostic Odysseys and advancing our understanding of human illnesses

    Genetic and epigenetic characterization of cell-free DNA:In patients with solid tumors

    Get PDF

    Towards personalized medicine for metastatic urothelial cancer

    Get PDF

    Predicting the impact of rare variants on RNA splicing in CAGI6

    Get PDF
    Variants which disrupt splicing are a frequent cause of rare disease that have been under-ascertained clinically. Accurate and efficient methods to predict a variant’s impact on splicing are needed to interpret the growing number of variants of unknown significance (VUS) identified by exome and genome sequencing. Here, we present the results of the CAGI6 Splicing VUS challenge, which invited predictions of the splicing impact of 56 variants ascertained clinically and functionally validated to determine splicing impact. The performance of 12 prediction methods, along with SpliceAI and CADD, was compared on the 56 functionally validated variants. The maximum accuracy achieved was 82% from two different approaches, one weighting SpliceAI scores by minor allele frequency, and one applying the recently published Splicing Prediction Pipeline (SPiP). SPiP performed optimally in terms of sensitivity, while an ensemble method combining multiple prediction tools and information from databases exceeded all others for specificity. Several challenge methods equalled or exceeded the performance of SpliceAI, with ultimate choice of prediction method likely to depend on experimental or clinical aims. One quarter of the variants were incorrectly predicted by at least 50% of the methods, highlighting the need for further improvements to splicing prediction methods for successful clinical application

    Uncovering the clinical relevance of unclassified variants in DNA repair genes: a focus on BRCA negative Tunisian cancer families

    Get PDF
    Introduction: Recent advances in sequencing technologies have significantly increased our capability to acquire large amounts of genetic data. However, the clinical relevance of the generated data continues to be challenging particularly with the identification of Variants of Uncertain Significance (VUSs) whose pathogenicity remains unclear. In the current report, we aim to evaluate the clinical relevance and the pathogenicity of VUSs in DNA repair genes among Tunisian breast cancer families.Methods: A total of 67 unsolved breast cancer cases have been investigated. The pathogenicity of VUSs identified within 26 DNA repair genes was assessed using different in silico prediction tools including SIFT, PolyPhen2, Align-GVGD and VarSEAK. Effects on the 3D structure were evaluated using the stability predictor DynaMut and molecular dynamics simulation with NAMD. Family segregation analysis was also performed.Results: Among a total of 37 VUSs identified, 11 variants are likely deleterious affecting ATM, BLM, CHEK2, ERCC3, FANCC, FANCG, MSH2, PMS2 and RAD50 genes. The BLM variant, c.3254dupT, is novel and seems to be associated with increased risk of breast, endometrial and colon cancer. Moreover, c.6115G>A in ATM and c.592+3A>T in CHEK2 were of keen interest identified in families with multiple breast cancer cases and their familial cosegregation with disease has been also confirmed. In addition, functional in silico analyses revealed that the ATM variant may lead to protein immobilization and rigidification thus decreasing its activity. We have also shown that FANCC and FANCG variants may lead to protein destabilization and alteration of the structure compactness which may affect FANCC and FANCG protein activity.Conclusion: Our findings revealed that VUSs in DNA repair genes might be associated with increased cancer risk and highlight the need for variant reclassification for better disease management. This will help to improve the genetic diagnosis and therapeutic strategies of cancer patients not only in Tunisia but also in neighboring countries

    Exploring missing heritability in neurodevelopmental disorders:Learning from regulatory elements

    Get PDF
    corecore