72 research outputs found

    Conserved DNA sequence features underlie pervasive RNA polymerase pausing

    Get PDF
    Pausing of transcribing RNA polymerase is regulated and creates opportunities to control gene expression. Research in metazoans has so far mainly focused on RNA polymerase II (Pol II) promoter-proximal pausing leaving the pervasive nature of pausing and its regulatory potential in mammalian cells unclear. Here, we developed a pause detecting algorithm (PDA) for nucleotide-resolution occupancy data and a new native elongating transcript sequencing approach, termed nested NET-seq, that strongly reduces artifactual peaks commonly misinterpreted as pausing sites. Leveraging PDA and nested NET-seq reveal widespread genome-wide Pol II pausing at single-nucleotide resolution in human cells. Notably, the majority of Pol II pauses occur outside of promoter-proximal gene regions primarily along the gene-body of transcribed genes. Sequence analysis combined with machine learning modeling reveals DNA sequence properties underlying widespread transcriptional pausing including a new pause motif. Interestingly, key sequence determinants of RNA polymerase pausing are conserved between human cells and bacteria. These studies indicate pervasive sequence-induced transcriptional pausing in human cells and the knowledge of exact pause locations implies potential functional roles in gene expression

    Risk patterns and correlated brain activities. Multidimensional statistical analysis of FMRI data in economic decision making study

    No full text
    Decision making usually involves uncertainty and risk. Understanding which parts of the human brain are activated during decisions under risk and which neural processes underly (risky) investment decisions are important goals in neuroeconomics. Here, we analyze functional magnetic resonance imaging (fMRI) data on 17 subjects who were exposed to an investment decision task from Mohr, Biele, Krugel, Li, and Heekeren (in NeuroImage 49, 2556–2563, 2010b). We obtain a time series of three-dimensional images of the blood-oxygen-level dependent (BOLD) fMRI signals. We apply a panel version of the dynamic semiparametric factor model (DSFM) presented in Park, Mammen, Wolfgang, and Borak (in Journal of the American Statistical Association 104(485), 284–298, 2009) and identify task-related activations in space and dynamics in time. With the panel DSFM (PDSFM) we can capture the dynamic behavior of the specific brain regions common for all subjects and represent the high-dimensional time-series data in easily interpretable low-dimensional dynamic factors without large loss of variability. Further, we classify the risk attitudes of all subjects based on the estimated low-dimensional time series. Our classification analysis successfully confirms the estimated risk attitudes derived directly from subjects’ decision behavior

    Assessing genome-wide dynamic changes in enhancer activity during early mESC differentiation by FAIRE-STARR-seq

    Get PDF
    Embryonic stem cells (ESCs) can differentiate into any given cell type and therefore represent a versatile model to study the link between gene regulation and differentiation. To quantitatively assess the dynamics of enhancer activity during the early stages of murine ESC differentiation, we analyzed accessible genomic regions using STARR-seq, a massively parallel reporter assay. This resulted in a genome-wide quantitative map of active mESC enhancers, in pluripotency and during the early stages of differentiation. We find that only a minority of accessible regions is active and that such regions are enriched near promoters, characterized by specific chromatin marks, enriched for distinct sequence motifs, and modeling shows that active regions can be predicted from sequence alone. Regions that change their activity upon retinoic acid-induced differentiation are more prevalent at distal intergenic regions when compared to constitutively active enhancers. Further, analysis of differentially active enhancers verified the contribution of individual TF motifs toward activity and inducibility as well as their role in regulating endogenous genes. Notably, the activity of retinoic acid receptor alpha (RARα) occupied regions can either increase or decrease upon the addition of its ligand, retinoic acid, with the direction of the change correlating with spacing and orientation of the RARα consensus motif and the co-occurrence of additional sequence motifs. Together, our genome-wide enhancer activity map elucidates features associated with enhancer activity levels, identifies regulatory regions disregarded by computational prediction tools, and provides a resource for future studies into regulatory elements in mESCs

    Ten millennia of hepatitis B virus evolution

    Get PDF
    Hepatitis B virus (HBV) has been infecting humans for millennia and remains a global health problem, but its past diversity and dispersal routes are largely unknown. We generated HBV genomic data from 137 Eurasians and Native Americans dated between ~10,500 and ~400 years ago. We date the most recent common ancestor of all HBV lineages to between ~20,000 and 12,000 years ago, with the virus present in European and South American hunter-gatherers during the early Holocene. After the European Neolithic transition, Mesolithic HBV strains were replaced by a lineage likely disseminated by early farmers that prevailed throughout western Eurasia for ~4000 years, declining around the end of the 2nd millennium BCE. The only remnant of this prehistoric HBV diversity is the rare genotype G, which appears to have reemerged during the HIV pandemic

    Unsuccessful therapy with adefovir and entecavir-tenofovir in a patient with chronic hepatitis B infection with previous resistance to lamivudine: a fourteen-year evolution of hepatitis B virus mutations

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>Complex mutants can be selected under sequential selective pressure by HBV therapy. To determine hepatitis B virus genomic evolution during antiviral therapy we characterized the HBV quasi-species in a patient who did no respond to therapy following lamivudine breakthrough for a period of 14 years.</p> <p>Case Presentation</p> <p>The polymerase and precore/core genes were amplified and sequenced at determined intervals in a period of 14 years. HBV viral load and HBeAg/Anti-HBe serological profiles as well as amino transferase levels were also measured. A mixture of lamivudine-resistant genotype A2 HBV strains harboring the rtM204V mutation coexisted in the patient following viral breakthrough to lamivudine. The L180M+M204V dominant mutant displayed strong lamivudine-resistance. As therapy was changed to adefovir, then to entecavir, and finally to entecavir-tenofovir the viral load showed fluctuations but lamivudine-resistant strains continued to be selected, with minor contributions to the HBV quasi-species composition of additional resistance-associated mutations. At the end of the 14-year follow up period, high viral loads were predominant, with viral strains harboring the lamivudine-resistance signature rtL180M+M204V. The precore/core frame A1762T and G1764A double mutation was detected before treatment and remaining in this condition during the entire follow-up. Specific entecavir and tenofovir primary resistance-associated mutations were not detected at any time. Plasma concentrations of tenofovir indicated adequate metabolism of the drug.</p> <p>Conclusions</p> <p>We report the selection of HBV mutants carrying well-defined primary resistance mutations that escaped lamivudine in a fourteen-year follow-up period. With the exception of tenofovir resistance mutations, subsequent unselected primary resistance mutations were detected as minor populations into the HBV quasispecies composition during adefovir or entecavir monotherapies. Although tenofovir is considered an appropriate therapeutic alternative for the treatment of entecavir-unresponsive patients, its use was not effective in the case reported here.</p

    X-exome sequencing of 405 unresolved families identifies seven novel intellectual disability genes

    Get PDF
    X-linked intellectual disability (XLID) is a clinically and genetically heterogeneous disorder. During the past two decades in excess of 100 X-chromosome ID genes have been identified. Yet, a large number of families mapping to the X-chromosome remained unresolved suggesting that more XLID genes or loci are yet to be identified. Here, we have investigated 405 unresolved families with XLID. We employed massively parallel sequencing of all X-chromosome exons in the index males. The majority of these males were previously tested negative for copy number variations and for mutations in a subset of known XLID genes by Sanger sequencing. In total, 745 X-chromosomal genes were screened. After stringent filtering, a total of 1297 non-recurrent exonic variants remained for prioritization. Co-segregation analysis of potential clinically relevant changes revealed that 80 families (20%) carried pathogenic variants in established XLID genes. In 19 families, we detected likely causative protein truncating and missense variants in 7 novel and validated XLID genes (CLCN4, CNKSR2, FRMPD4, KLHL15, LAS1L, RLIM and USP27X) and potentially deleterious variants in 2 novel candidate XLID genes (CDK16 and TAF1). We show that the CLCN4 and CNKSR2 variants impair protein functions as indicated by electrophysiological studies and altered differentiation of cultured primary neurons from Clcn4−/− mice or after mRNA knock-down. The newly identified and candidate XLID proteins belong to pathways and networks with established roles in cognitive function and intellectual disability in particular. We suggest that systematic sequencing of all X-chromosomal genes in a cohort of patients with genetic evidence for X-chromosome locus involvement may resolve up to 58% of Fragile X-negative cases

    Exome-wide Rare Variant Analysis Identifies TUBA4A Mutations Associated with Familial ALS

    Get PDF
    Exome sequencing is an effective strategy for identifying human disease genes. However, this methodology is difficult in late-onset diseases where limited availability of DNA from informative family members prohibits comprehensive segregation analysis. To overcome this limitation, we performed an exome-wide rare variant burden analysis of 363 index cases with familial ALS (FALS). The results revealed an excess of patient variants within TUBA4A, the gene encoding the Tubulin, Alpha 4A protein. Analysis of a further 272 FALS cases and 5,510 internal controls confirmed the overrepresentation as statistically significant and replicable. Functional analyses revealed that TUBA4A mutants destabilize the microtubule network, diminishing its repolymerization capability. These results further emphasize the role of cytoskeletal defects in ALS and demonstrate the power of gene-based rare variant analyses in situations where causal genes cannot be identified through traditional segregation analysis

    Ten millennia of hepatitis B virus evolution

    Get PDF
    Hepatitis B virus (HBV) has been infecting humans for millennia and remains a global health problem, but its past diversity and dispersal routes are largely unknown. We generated HBV genomic data from 137 Eurasians and Native Americans dated between similar to 10,500 and similar to 400 years ago. We date the most recent common ancestor of all HBV lineages to between similar to 20,000 and 12,000 years ago, with the virus present in European and South American hunter-gatherers during the early Holocene. After the European Neolithic transition, Mesolithic HBV strains were replaced by a lineage likely disseminated by early farmers that prevailed throughout western Eurasia for similar to 4000 years, declining around the end of the 2nd millennium BCE. The only remnant of this prehistoric HBV diversity is the rare genotype G, which appears to have reemerged during the HIV pandemic.Molecular Technology and Informatics for Personalised Medicine and Healt

    Ten millennia of hepatitis B virus evolution

    Get PDF
    Hepatitis B virus (HBV) has been infecting humans for millennia and remains a global health problem, but its past diversity and dispersal routes are largely unknown. We generated HBV genomic data from 137 Eurasians and Native Americans dated between ~10,500 and ~400 years ago. We date the most recent common ancestor of all HBV lineages to between ~20,000 and 12,000 years ago, with the virus present in European and South American hunter-gatherers during the early Holocene. After the European Neolithic transition, Mesolithic HBV strains were replaced by a lineage likely disseminated by early farmers that prevailed throughout western Eurasia for ~4000 years, declining around the end of the 2nd millennium BCE. The only remnant of this prehistoric HBV diversity is the rare genotype G, which appears to have reemerged during the HIV pandemic
    corecore