83 research outputs found

    Entropy of complex relevant components of Boolean networks

    Full text link
    Boolean network models of strongly connected modules are capable of capturing the high regulatory complexity of many biological gene regulatory circuits. We study numerically the previously introduced basin entropy, a parameter for the dynamical uncertainty or information storage capacity of a network as well as the average transient time in random relevant components as a function of their connectivity. We also demonstrate that basin entropy can be estimated from time-series data and is therefore also applicable to non-deterministic networks models.Comment: 8 pages, 6 figure

    The allele distribution in next-generation sequencing data sets is accurately described as the result of a stochastic branching process

    Get PDF
    With the availability of next-generation sequencing (NGS) technology, it is expected that sequence variants may be called on a genomic scale. Here, we demonstrate that a deeper understanding of the distribution of the variant call frequencies at heterozygous loci in NGS data sets is a prerequisite for sensitive variant detection. We model the crucial steps in an NGS protocol as a stochastic branching process and derive a mathematical framework for the expected distribution of alleles at heterozygous loci before measurement that is sequencing. We confirm our theoretical results by analyzing technical replicates of human exome data and demonstrate that the variance of allele frequencies at heterozygous loci is higher than expected by a simple binomial distribution. Due to this high variance, mutation callers relying on binomial distributed priors are less sensitive for heterozygous variants that deviate strongly from the expected mean frequency. Our results also indicate that error rates can be reduced to a greater degree by technical replicates than by increasing sequencing depth

    AI-based multi-PRS models outperform classical single-PRS models

    Get PDF
    Polygenic risk scores (PRS) calculate the risk for a specific disease based on the weighted sum of associated alleles from different genetic loci in the germline estimated by regression models. Recent advances in genetics made it possible to create polygenic predictors of complex human traits, including risks for many important complex diseases, such as cancer, diabetes, or cardiovascular diseases, typically influenced by many genetic variants, each of which has a negligible effect on overall risk. In the current study, we analyzed whether adding additional PRS from other diseases to the prediction models and replacing the regressions with machine learning models can improve overall predictive performance. Results showed that multi-PRS models outperform single-PRS models significantly on different diseases. Moreover, replacing regression models with machine learning models, i.e., deep learning, can also improve overall accuracy

    Extending the allelic spectrum at noncoding risk loci of orofacial clefting

    Get PDF
    Genome-wide association studies (GWAS) have generated unprecedented insights into the genetic etiology of orofacial clefting (OFC). The moderate effect sizes of associated noncoding risk variants and limited access to disease-relevant tissue represent considerable challenges for biological interpretation of genetic findings. As rare variants with stronger effect sizes are likely to also contribute to OFC, an alternative approach to delineate pathogenic mechanisms is to identify private mutations and/or an increased burden of rare variants in associated regions. This report describes a framework for targeted resequencing at selected noncoding risk loci contributing to nonsyndromic cleft lip with/without cleft palate (nsCL/P), the most frequent OFC subtype. Based on GWAS data, we selected three risk loci and identified candidate regulatory regions (CRRs) through the integration of credible SNP information, epigenetic data from relevant cells/tissues, and conservation scores. The CRRs (total 57 kb) were resequenced in a multiethnic study population (1061 patients; 1591 controls), using single-molecule molecular inversion probe technology. Combining evidence from in silico variant annotation, pedigree- and burden analyses, we identified 16 likely deleterious rare variants that represent new candidates for functional studies in nsCL/P. Our framework is scalable and represents a promising approach to the investigation of additional congenital malformations with multifactorial etiology

    Genome sequencing in families with congenital limb malformations

    Get PDF
    The extensive clinical and genetic heterogeneity of congenital limb malformation calls for comprehensive genome-wide analysis of genetic variation. Genome sequencing (GS) has the potential to identify all genetic variants. Here we aim to determine the diagnostic potential of GS as a comprehensive one-test-for-all strategy in a cohort of undiagnosed patients with congenital limb malformations. We collected 69 cases (64 trios, 1 duo, 5 singletons) with congenital limb malformations with no molecular diagnosis after standard clinical genetic testing and performed genome sequencing. We also developed a framework to identify potential noncoding pathogenic variants. We identified likely pathogenic/disease-associated variants in 12 cases (17.4%) including four in known disease genes, and one repeat expansion in HOXD13. In three unrelated cases with ectrodactyly, we identified likely pathogenic variants in UBA2, establishing it as a novel disease gene. In addition, we found two complex structural variants (3%). We also identified likely causative variants in three novel high confidence candidate genes. We were not able to identify any noncoding variants. GS is a powerful strategy to identify all types of genomic variants associated with congenital limb malformation, including repeat expansions and complex structural variants missed by standard diagnostic approaches. In this cohort, no causative noncoding SNVs could be identified. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1007/s00439-021-02295-y

    Identification of de novo variants in nonsyndromic cleft lip with/without cleft palate patients with low polygenic risk scores

    Get PDF
    [Background]: Nonsyndromic cleft lip with/without cleft palate (nsCL/P) is a congenital malformation of multifactorial etiology. Research has identified >40 genome-wide significant risk loci, which explain less than 40% of nsCL/P heritability. Studies show that some of the hidden heritability is explained by rare penetrant variants. [Methods]: To identify new candidate genes, we searched for highly penetrant de novo variants (DNVs) in 50 nsCL/P patient/parent-trios with a low polygenic risk for the phenotype (discovery). We prioritized DNV-carrying candidate genes from the discovery for resequencing in independent cohorts of 1010 nsCL/P patients of diverse ethnicities and 1574 population-matched controls (replication). Segregation analyses and rare variant association in the replication cohort, in combination with additional data (genome-wide association data, expression, protein–protein-interactions), were used for final prioritization. [Conclusion]: In the discovery step, 60 DNVs were identified in 60 genes, including a variant in the established nsCL/P risk gene CDH1. Re-sequencing of 32 prioritized genes led to the identification of 373 rare, likely pathogenic variants. Finally, MDN1 and PAXIP1 were prioritized as top candidates. Our findings demonstrate that DNV detection, including polygenic risk score analysis, is a powerful tool for identifying nsCL/P candidate genes, which can also be applied to other multifactorial congenital malformations.The present study was supported by the German Research Foundation (DFG)-Grants BE 3828/8-1, LU 1944/2-1, MA 2546/5-1, and LU1944/3-1

    Interpreting the role of de novo protein-coding mutations in neuropsychiatric disease

    Get PDF
    Pedigree, linkage and association studies are consistent with heritable variation for complex disease due to the segregation of genetic factors in families and in the population. In contrast, de novo mutations make only minor contributions to heritability estimates for complex traits. Nonetheless, some de novo variants are known to be important in disease etiology. The identification of risk-conferring de novo variants will contribute to the discovery of etiologically relevant genes and pathways and may help in genetic counseling. There is considerable interest in the role of such mutations in complex neuropsychiatric disease, largely driven by new genotyping and sequencing technologies. An important role for large de novo copy number variations has been established. Recently, whole-exome sequencing has been used to extend the investigation of de novo variation to point mutations in protein-coding regions. Here, we consider several challenges for the interpretation of such mutations in the context of their role in neuropsychiatric disease

    CUX1-related neurodevelopmental disorder: deep insights into phenotype-genotype spectrum and underlying pathology

    Get PDF
    Heterozygous, pathogenic CUX1 variants are associated with global developmental delay or intellectual disability. This study delineates the clinical presentation in an extended cohort and investigates the molecular mechanism underlying the disorder in a Cux1+/− mouse model. Through international collaboration, we assembled the phenotypic and molecular information for 34 individuals (23 unpublished individuals). We analyze brain CUX1 expression and susceptibility to epilepsy in Cux1+/− mice. We describe 34 individuals, from which 30 were unrelated, with 26 different null and four missense variants. The leading symptoms were mild to moderate delayed speech and motor development and borderline to moderate intellectual disability. Additional symptoms were muscular hypotonia, seizures, joint laxity, and abnormalities of the forehead. In Cux1+/− mice, we found delayed growth, histologically normal brains, and increased susceptibility to seizures. In Cux1+/− brains, the expression of Cux1 transcripts was half of WT animals. Expression of CUX1 proteins was reduced, although in early postnatal animals significantly more than in adults. In summary, disease-causing CUX1 variants result in a non-syndromic phenotype of developmental delay and intellectual disability. In some individuals, this phenotype ameliorates with age, resulting in a clinical catch-up and normal IQ in adulthood. The post-transcriptional balance of CUX1 expression in the heterozygous brain at late developmental stages appears important for this favorable clinical course.CAG was supported by the Eunice Kennedy Shriver National Institute Of Child Health & Human Development of the National Institutes of Health under Award Number P50 HD103525. This work was funded by PID2020-112831GB-I00 AEI /10.13039/501100011033 (MN). SS was supported by a grant from the NIH/NINDS (K23NS119666). SWS is supported by the Hospital for Sick Children Foundation, Autism Speaks, and the University of Toronto McLaughlin Center. EM-G was supported by a grant from MICIU FPU18/06240. EVS. was supported by a grant from the NIH (EY025718). CRF was supported by the fund to support clinical research careers in the Region of Southern Denmark (Region Syddanmarks pulje for kliniske forskerkarriereforløb).Peer reviewe
    corecore