43 research outputs found

    Analysis of the genome and transcriptome of Cryptococcus neoformans var. grubii reveals complex RNA expression and microevolution leading to virulence attenuation.

    Get PDF
    Cryptococcus neoformans is a pathogenic basidiomycetous yeast responsible for more than 600,000 deaths each year. It occurs as two serotypes (A and D) representing two varieties (i.e. grubii and neoformans, respectively). Here, we sequenced the genome and performed an RNA-Seq-based analysis of the C. neoformans var. grubii transcriptome structure. We determined the chromosomal locations, analyzed the sequence/structural features of the centromeres, and identified origins of replication. The genome was annotated based on automated and manual curation. More than 40,000 introns populating more than 99% of the expressed genes were identified. Although most of these introns are located in the coding DNA sequences (CDS), over 2,000 introns in the untranslated regions (UTRs) were also identified. Poly(A)-containing reads were employed to locate the polyadenylation sites of more than 80% of the genes. Examination of the sequences around these sites revealed a new poly(A)-site-associated motif (AUGHAH). In addition, 1,197 miscRNAs were identified. These miscRNAs can be spliced and/or polyadenylated, but do not appear to have obvious coding capacities. Finally, this genome sequence enabled a comparative analysis of strain H99 variants obtained after laboratory passage. The spectrum of mutations identified provides insights into the genetics underlying the micro-evolution of a laboratory strain, and identifies mutations involved in stress responses, mating efficiency, and virulence

    The Dyad Symmetry Element of Epstein-Barr Virus Is a Dominant but Dispensable Replication Origin

    Get PDF
    OriP, the latent origin of Epstein-Barr virus (EBV), consists of two essential elements: the dyad symmetry (DS) and the family of repeats (FR). The function of these elements has been predominantly analyzed in plasmids transfected into transformed cells. Here, we examined the molecular functions of DS in its native genomic context and at an ectopic position in the mini-EBV episome. Mini-EBV plasmids contain 41% of the EBV genome including all information required for the proliferation of human B cells. Both FR and DS function independently of their genomic context. We show that DS is the most active origin of replication present in the mini-EBV genome regardless of its location, and it is characterized by the binding of the origin recognition complex (ORC) allowing subsequent replication initiation. Surprisingly, the integrity of oriP is not required for the formation of the pre-replicative complex (pre-RC) at or near DS. In addition we show that initiation events occurring at sites other than the DS are also limited to once per cell cycle and that they are ORC-dependent. The deletion of DS increases initiation from alternative origins, which are normally used very infrequently in the mini-EBV genome. The sequence-independent distribution of ORC-binding, pre-RC-assembly, and initiation patterns indicates that a large number of silent origins are present in the mini-EBV genome. We conclude that, in mini-EBV genomes lacking the DS element, the absence of a strong ORC binding site results in an increase of ORC binding at dispersed sites

    Transcription Initiation Activity Sets Replication Origin Efficiency in Mammalian Cells

    Get PDF
    Genomic mapping of DNA replication origins (ORIs) in mammals provides a powerful means for understanding the regulatory complexity of our genome. Here we combine a genome-wide approach to identify preferential sites of DNA replication initiation at 0.4% of the mouse genome with detailed molecular analysis at distinct classes of ORIs according to their location relative to the genes. Our study reveals that 85% of the replication initiation sites in mouse embryonic stem (ES) cells are associated with transcriptional units. Nearly half of the identified ORIs map at promoter regions and, interestingly, ORI density strongly correlates with promoter density, reflecting the coordinated organisation of replication and transcription in the mouse genome. Detailed analysis of ORI activity showed that CpG island promoter-ORIs are the most efficient ORIs in ES cells and both ORI specification and firing efficiency are maintained across cell types. Remarkably, the distribution of replication initiation sites at promoter-ORIs exactly parallels that of transcription start sites (TSS), suggesting a co-evolution of the regulatory regions driving replication and transcription. Moreover, we found that promoter-ORIs are significantly enriched in CAGE tags derived from early embryos relative to all promoters. This association implies that transcription initiation early in development sets the probability of ORI activation, unveiling a new hallmark in ORI efficiency regulation in mammalian cells

    Nuclear Scaffold Attachment Sites within ENCODE Regions Associate with Actively Transcribed Genes

    Get PDF
    The human genome must be packaged and organized in a functional manner for the regulation of DNA replication and transcription. The nuclear scaffold/matrix, consisting of structural and functional nuclear proteins, remains after extraction of nuclei and anchors loops of DNA. In the search for cis-elements functioning as chromatin domain boundaries, we identified 453 nuclear scaffold attachment sites purified by lithium-3,5-iodosalicylate extraction of HeLa nuclei across 30 Mb of the human genome studied by the ENCODE pilot project. The scaffold attachment sites mapped predominately near expressed genes and localized near transcription start sites and the ends of genes but not to boundary elements. In addition, these regions were enriched for RNA polymerase II and transcription factor binding sites and were located in early replicating regions of the genome. We believe these sites correspond to genome-interactions mediated by transcription factors and transcriptional machinery immobilized on a nuclear substructure

    Replication Fork Polarity Gradients Revealed by Megabase-Sized U-Shaped Replication Timing Domains in Human Cell Lines

    Get PDF
    In higher eukaryotes, replication program specification in different cell types remains to be fully understood. We show for seven human cell lines that about half of the genome is divided in domains that display a characteristic U-shaped replication timing profile with early initiation zones at borders and late replication at centers. Significant overlap is observed between U-domains of different cell lines and also with germline replication domains exhibiting a N-shaped nucleotide compositional skew. From the demonstration that the average fork polarity is directly reflected by both the compositional skew and the derivative of the replication timing profile, we argue that the fact that this derivative displays a N-shape in U-domains sustains the existence of large-scale gradients of replication fork polarity in somatic and germline cells. Analysis of chromatin interaction (Hi-C) and chromatin marker data reveals that U-domains correspond to high-order chromatin structural units. We discuss possible models for replication origin activation within U/N-domains. The compartmentalization of the genome into replication U/N-domains provides new insights on the organization of the replication program in the human genome

    Cdc45 Limits Replicon Usage from a Low Density of preRCs in Mammalian Cells

    Get PDF
    Little is known about mammalian preRC stoichiometry, the number of preRCs on chromosomes, and how this relates to replicon size and usage. We show here that, on average, each 100-kb of the mammalian genome contains a preRC composed of approximately one ORC hexamer, 4–5 MCM hexamers, and 2 Cdc6. Relative to these subunits, ∼0.35 total molecules of the pre-Initiation Complex factor Cdc45 are present. Thus, based on ORC availability, somatic cells contain ∼70,000 preRCs of this average total stoichiometry, although subunits may not be juxtaposed with each other. Except for ORC, the chromatin-bound complement of preRC subunits is even lower. Cdc45 is present at very low levels relative to the preRC subunits, but is highly stable, and the same limited number of stable Cdc45 molecules are present from the beginning of S-phase to its completion. Efforts to artificially increase Cdc45 levels through ectopic expression block cell growth. However, microinjection of excess purified Cdc45 into S-phase nuclei activates additional replication foci by three-fold, indicating that Cdc45 functions to activate dormant preRCs and is rate-limiting for somatic replicon usage. Paradoxically, although Cdc45 colocalizes in vivo with some MCM sites and is rate-limiting for DNA replication to occur, neither Cdc45 nor MCMs colocalize with active replication sites. Embryonic metazoan chromatin consists of small replicons that are used efficiently via an excess of preRC subunits. In contrast, somatic mammalian cells contain a low density of preRCs, each containing only a few MCMs that compete for limiting amounts of Cdc45. This provides a molecular explanation why, relative to embryonic replicon dynamics, somatic replicons are, on average, larger and origin efficiency tends to be lower. The stable, continuous, and rate-limiting nature of Cdc45 suggests that Cdc45 contributes to the staggering of replicon usage throughout S-phase, and that replicon activation requires reutilization of existing Cdc45 during S-phase

    Evidence for Sequential and Increasing Activation of Replication Origins along Replication Timing Gradients in the Human Genome

    Get PDF
    Genome-wide replication timing studies have suggested that mammalian chromosomes consist of megabase-scale domains of coordinated origin firing separated by large originless transition regions. Here, we report a quantitative genome-wide analysis of DNA replication kinetics in several human cell types that contradicts this view. DNA combing in HeLa cells sorted into four temporal compartments of S phase shows that replication origins are spaced at 40 kb intervals and fire as small clusters whose synchrony increases during S phase and that replication fork velocity (mean 0.7 kb/min, maximum 2.0 kb/min) remains constant and narrowly distributed through S phase. However, multi-scale analysis of a genome-wide replication timing profile shows a broad distribution of replication timing gradients with practically no regions larger than 100 kb replicating at less than 2 kb/min. Therefore, HeLa cells lack large regions of unidirectional fork progression. Temporal transition regions are replicated by sequential activation of origins at a rate that increases during S phase and replication timing gradients are set by the delay and the spacing between successive origin firings rather than by the velocity of single forks. Activation of internal origins in a specific temporal transition region is directly demonstrated by DNA combing of the IGH locus in HeLa cells. Analysis of published origin maps in HeLa cells and published replication timing and DNA combing data in several other cell types corroborate these findings, with the interesting exception of embryonic stem cells where regions of unidirectional fork progression seem more abundant. These results can be explained if origins fire independently of each other but under the control of long-range chromatin structure, or if replication forks progressing from early origins stimulate initiation in nearby unreplicated DNA. These findings shed a new light on the replication timing program of mammalian genomes and provide a general model for their replication kinetics
    corecore