20 research outputs found

    Discovery of widespread transcription initiation at microsatellites predictable by sequence-based deep neural network

    Get PDF
    Using the Cap Analysis of Gene Expression (CAGE) technology, the FANTOM5 consortium provided one of the most comprehensive maps of transcription start sites (TSSs) in several species. Strikingly, ~72% of them could not be assigned to a specific gene and initiate at unconventional regions, outside promoters or enhancers. Here, we probe these unassigned TSSs and show that, in all species studied, a significant fraction of CAGE peaks initiate at microsatellites, also called short tandem repeats (STRs). To confirm this transcription, we develop Cap Trap RNA-seq, a technology which combines cap trapping and long read MinION sequencing. We train sequence-based deep learning models able to predict CAGE signal at STRs with high accuracy. These models unveil the importance of STR surrounding sequences not only to distinguish STR classes, but also to predict the level of transcription initiation. Importantly, genetic variants linked to human diseases are preferentially found at STRs with high transcription initiation level, supporting the biological and clinical relevance of transcription initiation at STRs. Together, our results extend the repertoire of non-coding transcription associated with DNA tandem repeats and complexify STR polymorphism

    Regulation of the Gene Encoding Transaction Elongation-Factor-3 During Growth and Morphogenesis in Candida Albicans

    No full text
    The level of the TEF3 mRNA, which encodes the fungal-specific translation elongation factor 3 (EF-3), was measured during the yeast-to-hyphal transition in Candida albicans. In contrast to a previous report, TEF3 mRNA levels were shown to change during dilution into fresh medium, increasing only transiently when dimorphism was induced by either (i) an increase in growth temperature (from 25 degrees C to 37 degrees C) combined with the addition of 10% (v/v) bovine calf serum to the medium, or (ii) an increase in growth temperature (from 25 degrees C to 37 degrees C) combined with an increase in the ph of the medium (from pH 4.5 to 6.5). TEF3 mRNA levels also increased in control cultures under conditions where germ tubes were not formed. but they remained elevated in contrast to cultures undergoing morphological changes. TEF3 mRNA levels were not significantly affected by heat-shock, but were tightly regulated during batch growth of the yeast form, reaching maximal levels in exponential phase. Therefore, the changes in TEF3 expression that accompany the dimorphic transition in C. albicans appear to reflect the underlying physiological changes that occur during morphogenesis and are not a response to morphogenesis per se. For this reason TEF3 mRNA measurement cannot be used as a loading control in Northern analyses of dimorphic gene regulation. Comparison of TEF3 mRNA levels with the abundance of the EF-3 polypeptide indicated that the synthesis of this essential translation factor might be subject to post-transcriptional regulation

    Expression of seven members of the gene family encoding secretory aspartyl proteinases in Candida albicans

    No full text
    The opportunistic fungal pathogen Candida albicans produces secretory aspartyl proteinases, which are believed to be virulence factors in infection. We have studied the in vitro expression of seven known members of the SAP gene family in a range of strains and serotypes by Northern analysis. SAP1 and SAP3 were regulated during phenotypic switching between the white and opaque forms of the organism. The SAP2 mRNA, which was the dominant transcript in the yeast form, was found to be autoinduced by peptide products of Sap2 activity and to be repressed by amino acids. The expression of the closely related SAP4-SAP6 genes was observed only at neutral pH during serum-induced yeast to hyphal transition. No SAP7 mRNA was detected under any of the conditions or in any of the strains tested. Our data suggest that the various members of the SAP gene family may have distinct roles in the colonization and invasion of the host
    corecore