79 research outputs found

    Entropy of complex relevant components of Boolean networks

    Full text link
    Boolean network models of strongly connected modules are capable of capturing the high regulatory complexity of many biological gene regulatory circuits. We study numerically the previously introduced basin entropy, a parameter for the dynamical uncertainty or information storage capacity of a network as well as the average transient time in random relevant components as a function of their connectivity. We also demonstrate that basin entropy can be estimated from time-series data and is therefore also applicable to non-deterministic networks models.Comment: 8 pages, 6 figure

    The allele distribution in next-generation sequencing data sets is accurately described as the result of a stochastic branching process

    Get PDF
    With the availability of next-generation sequencing (NGS) technology, it is expected that sequence variants may be called on a genomic scale. Here, we demonstrate that a deeper understanding of the distribution of the variant call frequencies at heterozygous loci in NGS data sets is a prerequisite for sensitive variant detection. We model the crucial steps in an NGS protocol as a stochastic branching process and derive a mathematical framework for the expected distribution of alleles at heterozygous loci before measurement that is sequencing. We confirm our theoretical results by analyzing technical replicates of human exome data and demonstrate that the variance of allele frequencies at heterozygous loci is higher than expected by a simple binomial distribution. Due to this high variance, mutation callers relying on binomial distributed priors are less sensitive for heterozygous variants that deviate strongly from the expected mean frequency. Our results also indicate that error rates can be reduced to a greater degree by technical replicates than by increasing sequencing depth

    AI-based multi-PRS models outperform classical single-PRS models

    Get PDF
    Polygenic risk scores (PRS) calculate the risk for a specific disease based on the weighted sum of associated alleles from different genetic loci in the germline estimated by regression models. Recent advances in genetics made it possible to create polygenic predictors of complex human traits, including risks for many important complex diseases, such as cancer, diabetes, or cardiovascular diseases, typically influenced by many genetic variants, each of which has a negligible effect on overall risk. In the current study, we analyzed whether adding additional PRS from other diseases to the prediction models and replacing the regressions with machine learning models can improve overall predictive performance. Results showed that multi-PRS models outperform single-PRS models significantly on different diseases. Moreover, replacing regression models with machine learning models, i.e., deep learning, can also improve overall accuracy

    Extending the allelic spectrum at noncoding risk loci of orofacial clefting

    Get PDF
    Genome-wide association studies (GWAS) have generated unprecedented insights into the genetic etiology of orofacial clefting (OFC). The moderate effect sizes of associated noncoding risk variants and limited access to disease-relevant tissue represent considerable challenges for biological interpretation of genetic findings. As rare variants with stronger effect sizes are likely to also contribute to OFC, an alternative approach to delineate pathogenic mechanisms is to identify private mutations and/or an increased burden of rare variants in associated regions. This report describes a framework for targeted resequencing at selected noncoding risk loci contributing to nonsyndromic cleft lip with/without cleft palate (nsCL/P), the most frequent OFC subtype. Based on GWAS data, we selected three risk loci and identified candidate regulatory regions (CRRs) through the integration of credible SNP information, epigenetic data from relevant cells/tissues, and conservation scores. The CRRs (total 57 kb) were resequenced in a multiethnic study population (1061 patients; 1591 controls), using single-molecule molecular inversion probe technology. Combining evidence from in silico variant annotation, pedigree- and burden analyses, we identified 16 likely deleterious rare variants that represent new candidates for functional studies in nsCL/P. Our framework is scalable and represents a promising approach to the investigation of additional congenital malformations with multifactorial etiology

    Genome sequencing in families with congenital limb malformations

    Get PDF
    The extensive clinical and genetic heterogeneity of congenital limb malformation calls for comprehensive genome-wide analysis of genetic variation. Genome sequencing (GS) has the potential to identify all genetic variants. Here we aim to determine the diagnostic potential of GS as a comprehensive one-test-for-all strategy in a cohort of undiagnosed patients with congenital limb malformations. We collected 69 cases (64 trios, 1 duo, 5 singletons) with congenital limb malformations with no molecular diagnosis after standard clinical genetic testing and performed genome sequencing. We also developed a framework to identify potential noncoding pathogenic variants. We identified likely pathogenic/disease-associated variants in 12 cases (17.4%) including four in known disease genes, and one repeat expansion in HOXD13. In three unrelated cases with ectrodactyly, we identified likely pathogenic variants in UBA2, establishing it as a novel disease gene. In addition, we found two complex structural variants (3%). We also identified likely causative variants in three novel high confidence candidate genes. We were not able to identify any noncoding variants. GS is a powerful strategy to identify all types of genomic variants associated with congenital limb malformation, including repeat expansions and complex structural variants missed by standard diagnostic approaches. In this cohort, no causative noncoding SNVs could be identified. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1007/s00439-021-02295-y

    Interpreting the role of de novo protein-coding mutations in neuropsychiatric disease

    Get PDF
    Pedigree, linkage and association studies are consistent with heritable variation for complex disease due to the segregation of genetic factors in families and in the population. In contrast, de novo mutations make only minor contributions to heritability estimates for complex traits. Nonetheless, some de novo variants are known to be important in disease etiology. The identification of risk-conferring de novo variants will contribute to the discovery of etiologically relevant genes and pathways and may help in genetic counseling. There is considerable interest in the role of such mutations in complex neuropsychiatric disease, largely driven by new genotyping and sequencing technologies. An important role for large de novo copy number variations has been established. Recently, whole-exome sequencing has been used to extend the investigation of de novo variation to point mutations in protein-coding regions. Here, we consider several challenges for the interpretation of such mutations in the context of their role in neuropsychiatric disease

    De novo variants of CSNK2B cause a new intellectual disability-craniodigital syndrome by disrupting the canonical Wnt signaling pathway

    Get PDF
    CSNK2B encodes for casein kinase II subunit beta (CK2b), the regulatory subunit of casein kinase II (CK2), which is known to mediate diverse cellular pathways. Variants in this gene have been recently identified as a cause of Poirier-Bienvenu neurodevelopmental syndrome (POBINDS), but functional evidence is sparse. Here, we report five unrelated individuals: two of them manifesting POBINDS, while three are identified to segregate a new intellectual disability-craniodigital syndrome (IDCS), distinct from POBINDS. The three IDCS individuals carried two different de novo missense variants affecting the same codon of CSNK2B. Both variants, NP_001311.3; p.Asp32His and NP_001311.3; p.Asp32Asn, lead to an upregulation of CSNK2B expression at transcript and protein level, along with global dysregulation of canonical Wnt signaling. We found impaired interaction of the two key players DVL3 and b-catenin with mutated CK2b. The variants compromise the kinase activity of CK2 as evident by a marked reduction of phosphorylated b-catenin and consequent absence of active b-catenin inside nuclei of the patient-derived lymphoblastoid cell lines (LCLs). In line with these findings, whole-transcriptome profiling of patient-derived LCLs harboring the NP_001311.3; p.Asp32His variant confirmed a marked difference in expression of genes involved in the Wnt signaling pathway. In addition, whole-phosphoproteome analysis of the LCLs of the same subject showed absence of phosphorylation for 313 putative CK2 substrates, enriched in the regulation of nuclear b-catenin and transcription of the target genes. Our findings suggest that discrete variants in CSNK2B cause dominant-negative perturbation of the canonical Wnt signaling pathway, leading to a new craniodigital syndrome distinguishable from POBINDS

    Phenotypic and Genome-Wide Analysis of an Antibiotic-Resistant Small Colony Variant (SCV) of Pseudomonas aeruginosa

    Get PDF
    Small colony variants (SCVs) are slow-growing bacteria, which often show increased resistance to antibiotics and cause latent or recurrent infections. It is therefore important to understand the mechanisms at the basis of this phenotypic switch.One SCV (termed PAO-SCV) was isolated, showing high resistance to gentamicin and to the cephalosporine cefotaxime. PAO-SCV was prone to reversion as evidenced by emergence of large colonies with a frequency of 10(-5) on media without antibiotics while it was stably maintained in presence of gentamicin. PAO-SCV showed a delayed growth, defective motility, and strongly reduced levels of the quorum sensing Pseudomonas quinolone signal (PQS). Whole genome expression analysis further suggested a multi-layered antibiotic resistance mechanism, including simultaneous over-expression of two drug efflux pumps (MexAB-OprM, MexXY-OprM), the LPS modification operon arnBCADTEF, and the PhoP-PhoQ two-component system. Conversely, the genes for the synthesis of PQS were strongly down-regulated in PAO-SCV. Finally, genomic analysis revealed the presence of mutations in phoP and phoQ genes as well as in the mexZ gene encoding a repressor of the mexXY and mexAB-oprM genes. Only one mutation occurred only in REV, at nucleotide 1020 of the tufA gene, a paralog of tufB, both encoding the elongation factor Tu, causing a change of the rarely used aspartic acid codon GAU to the more common GAC, possibly causing an increase of tufA mRNA translation. High expression of phoP and phoQ was confirmed for the SCV variant while the revertant showed expression levels reduced to wild-type levels.By combining data coming from phenotypic, gene expression and proteome analysis, we could demonstrate that resistance to aminoglycosides in one SCV mutant is multifactorial including overexpression of efflux mechanisms, LPS modification and is accompanied by a drastic down-regulation of the Pseudomonas quinolone signal quorum sensing system
    corecore