32 research outputs found
Determination of the in vivo structural DNA loop organization in the genomic region of the rat albumin locus by means of a topological approach
Nuclear DNA of metazoans is organized in supercoiled loops anchored to a proteinaceous substructure known as the nuclear matrix (NM). DNA is anchored to the NM by non-coding sequences known as matrix attachment regions (MARs). There are no consensus sequences for identification of MARs and not all potential MARs are actually bound to the NM constituting loop attachment regions (LARs). Fundamental processes of nuclear physiology occur at macromolecular complexes organized on the NM; thus, the topological organization of DNA loops must be important. Here, we describe a general method for determining the structural DNA loop organization in any large genomic region with a known sequence. The method exploits the topological properties of loop DNA attached to the NM and elementary topological principles such as that points in a deformable string (DNA) can be positionally mapped relative to a position-reference invariant (NM), and from such mapping, the configuration of the string in third dimension can be deduced. Therefore, it is possible to determine the specific DNA loop configuration without previous characterization of the LARs involved. We determined in hepatocytes and B-lymphocytes of the rat the DNA loop organization of a genomic region that contains four members of the albumin gene family
Expanded encyclopaedias of DNA elements in the human and mouse genomes
All data are available on the ENCODE data portal: www.encodeproject. org. All code is available on GitHub from the links provided in the methods section. Code related to the Registry of cCREs can be found at https:// github.com/weng-lab/ENCODE-cCREs. Code related to SCREEN can be found at https://github.com/weng-lab/SCREEN.© The Author(s) 2020. The human and mouse genomes contain instructions that specify RNAs and proteins and govern the timing, magnitude, and cellular context of their production. To better delineate these elements, phase III of the Encyclopedia of DNA Elements (ENCODE) Project has expanded analysis of the cell and tissue repertoires of RNA transcription, chromatin structure and modification, DNA methylation, chromatin looping, and occupancy by transcription factors and RNA-binding proteins. Here we summarize these efforts, which have produced 5,992 new experimental datasets, including systematic determinations across mouse fetal development. All data are available through the ENCODE data portal (https://www.encodeproject.org), including phase II ENCODE1 and Roadmap Epigenomics2 data. We have developed a registry of 926,535 human and 339,815 mouse candidate cis-regulatory elements, covering 7.9 and 3.4% of their respective genomes, by integrating selected datatypes associated with gene regulation, and constructed a web-based server (SCREEN; http://screen.encodeproject.org) to provide flexible, user-defined access to this resource. Collectively, the ENCODE data and registry provide an expansive resource for the scientific community to build a better understanding of the organization and function of the human and mouse genomes.This work was supported by grants from the NIH under U01HG007019, U01HG007033, U01HG007036, U01HG007037, U41HG006992, U41HG006993, U41HG006994, U41HG006995, U41HG006996, U41HG006997, U41HG006998, U41HG006999, U41HG007000, U41HG007001, U41HG007002, U41HG007003, U54HG006991, U54HG006997, U54HG006998, U54HG007004, U54HG007005, U54HG007010 and UM1HG009442
Stage-Matching of Human, Marmoset, Mouse, and Pig Embryos to Enhance Organ Development Through Interspecies Chimerism
Currently, there is a significant shortage of transplantable organs for patients in need. Interspecies chimerism and blastocyst complementation are alternatives for generating transplantable human organs in host animals such as pigs to meet this shortage. While successful interspecies chimerism and organ generation have been observed between evolutionarily close species such as rat and mouse, barriers still exist for more distant species pairs such as human–mouse, marmoset–mouse, human–pig, and others. One of the proposed barriers to chimerism is the difference in developmental stages between the donor cells and the host embryo at the time the cells are introduced into the host embryo. Hence, there is a logical effort to stage-match the donor cells with the host embryos for enhancing interspecies chimerism. In this study, we used an in silico approach to simultaneously stage-match the early developing embryos of four species, including human, marmoset, mouse, and pig based on transcriptome similarities. We used an unsupervised clustering algorithm to simultaneously stage-match all four species as well as Spearman’s correlation analyses to stage-match pairs of donor–host species. From our stage-matching analyses, we found that the four stages that best matched with each other are the human blastocyst (E6/E7), the gastrulating mouse embryo (E6–E6.75), the marmoset late inner cell mass, and the pig late blastocyst. We further demonstrated that human pluripotent stem cells best matched with the mouse post-implantation stages. We also performed ontology analysis of the genes upregulated and commonly expressed between donor–host species pairs at their best matched stages. The stage-matching results predicted by this study will inform in vivo and in vitro interspecies chimerism and blastocyst complementation studies and can be used to match donor cells with host embryos between multiple species pairs to enhance chimerism for organogenesis
Allele-specific control of replication timing and genome organization during development
DNA replication occurs in a defined temporal order known as the replication-timing (RT) program. RT is regulated during development in discrete chromosomal units, coordinated with transcriptional activity and 3D genome organization. Here, we derived distinct cell types from F1 hybrid musculus × castaneus mouse crosses and exploited the high single-nucleotide polymorphism (SNP) density to characterize allelic differences in RT (Repli-seq), genome organization (Hi-C and promoter-capture Hi-C), gene expression (total nuclear RNA-seq), and chromatin accessibility (ATAC-seq). We also present HARP, a new computational tool for sorting SNPs in phased genomes to efficiently measure allele-specific genome-wide data. Analysis of six different hybrid mESC clones with different genomes (C57BL/6, 129/sv, and CAST/Ei), parental configurations, and gender revealed significant RT asynchrony between alleles across ∼12% of the autosomal genome linked to subspecies genomes but not to parental origin, growth conditions, or gender. RT asynchrony in mESCs strongly correlated with changes in Hi-C compartments between alleles but not as strongly with SNP density, gene expression, imprinting, or chromatin accessibility. We then tracked mESC RT asynchronous regions during development by analyzing differentiated cell types, including extraembryonic endoderm stem (XEN) cells, four male and female primary mouse embryonic fibroblasts (MEFs), and neural precursor cells (NPCs) differentiated in vitro from mESCs with opposite parental configurations. We found that RT asynchrony and allelic discordance in Hi-C compartments seen in mESCs were largely lost in all differentiated cell types, accompanied by novel sites of allelic asynchrony at a considerably smaller proportion of the genome, suggesting that genome organization of homologs converges to similar folding patterns during cell fate commitment