504 research outputs found

    Breakpoint regions and homologous synteny blocks in chromosomes have different evolutionary histories

    Get PDF
    The persistence of large blocks of homologous synteny and a high frequency of breakpoint reuse are distinctive features of mammalian chromosomes that are not well understood in evolutionary terms. To gain a better understanding of the evolutionary forces that affect genome architecture, synteny relationships among 10 amniotes (human, chimp, macaque, rat, mouse, pig, cattle, dog, opossum, and chicken) were compared at <1 human-Mbp resolution. Homologous synteny blocks (HSBs; N = 2233) and chromosome evolutionary breakpoint regions (EBRs; N = 1064) were identified from pairwise comparisons of all genomes. Analysis of the size distribution of HSBs shared in all 10 species' chromosomes (msHSBs) identified three (>20 Mbp) that are larger than expected by chance. Gene network analysis of msHSBs >3 human-Mbp and EBRs <1 Mbp demonstrated that msHSBs are significantly enriched for genes involved in development of the central nervous and other organ systems, whereas EBRs are enriched for genes associated with adaptive functions. In addition, we found EBRs are significantly enriched for structural variations (segmental duplications, copy number variants, and indels), retrotransposed and zinc finger genes, and single nucleotide polymorphisms. These results demonstrate that chromosome breakage in evolution is nonrandom and that HSBs and EBRs are evolving in distinctly different ways. We suggest that natural selection acts on the genome to maintain combinations of genes and their regulatory elements that are essential to fundamental processes of amniote development and biological organization. Furthermore, EBRs may be used extensively to generate new genetic variation and novel combinations of genes and regulatory elements that contribute to adaptive phenotypes

    CGAT: a comparative genome analysis tool for visualizing alignments in the analysis of complex evolutionary changes between closely related genomes

    Get PDF
    BACKGROUND: The recent accumulation of closely related genomic sequences provides a valuable resource for the elucidation of the evolutionary histories of various organisms. However, although numerous alignment calculation and visualization tools have been developed to date, the analysis of complex genomic changes, such as large insertions, deletions, inversions, translocations and duplications, still presents certain difficulties. RESULTS: We have developed a comparative genome analysis tool, named CGAT, which allows detailed comparisons of closely related bacteria-sized genomes mainly through visualizing middle-to-large-scale changes to infer underlying mechanisms. CGAT displays precomputed pairwise genome alignments on both dotplot and alignment viewers with scrolling and zooming functions, and allows users to move along the pre-identified orthologous alignments. Users can place several types of information on this alignment, such as the presence of tandem repeats or interspersed repetitive sequences and changes in G+C contents or codon usage bias, thereby facilitating the interpretation of the observed genomic changes. In addition to displaying precomputed alignments, the viewer can dynamically calculate the alignments between specified regions; this feature is especially useful for examining the alignment boundaries, as these boundaries are often obscure and can vary between programs. Besides the alignment browser functionalities, CGAT also contains an alignment data construction module, which contains various procedures that are commonly used for pre- and post-processing for large-scale alignment calculation, such as the split-and-merge protocol for calculating long alignments, chaining adjacent alignments, and ortholog identification. Indeed, CGAT provides a general framework for the calculation of genome-scale alignments using various existing programs as alignment engines, which allows users to compare the outputs of different alignment programs. Earlier versions of this program have been used successfully in our research to infer the evolutionary history of apparently complex genome changes between closely related eubacteria and archaea. CONCLUSION: CGAT is a practical tool for analyzing complex genomic changes between closely related genomes using existing alignment programs and other sequence analysis tools combined with extensive manual inspection

    The evolution of chromatin folding in mammals: a role for CTCF

    Get PDF
    The organisation of the DNA inside eukaryotic cell nuclei is not random. Rather than being intermingled with one another, chromosomes are compacted in a hierarchical manner which is conserved throughout eukaryote evolution. Topological domains have recently emerged as key architectural building blocks of chromosomes in complex genomes. This thesis aimed to explore the evolutionary dynamics of chromatin architecture in order to shed light on its functional relevance and on how architectural proteins can shape it. Comparative Hi-C in liver cells from four mammalian species was used to characterise conservation and divergence of chromosomal structure within distantly related genomes. Results show that the modular organisation of chromosomes is robustly conserved in syntenic regions, and that domain structure is maintained during chromosomal rearrangements. This conservation is compatible with the evolution of the binding landscape of the architectural protein CTCF. Specifically, conserved CTCF sites are more often co-localised with cohesin, enriched at strong topological domain borders and bind to DNA motifs that are under strong selection. Interestingly, CTCF binding sites which are divergent between species are strongly correlated with divergence of internal domain structure. This divergence is likely driven by CTCF binding sequence changes, demonstrating how genome evolution can be linked directly with a continuous flux of local chromosome conformation changes. Finally, the architectural activity of CTCF is cohesin- dependent and the manner in which individual CTCF/cohesin sites choose interacting partners is dictated by the orientation of the CTCF binding motif

    Analysis of the P. lividus sea urchin genome highlights contrasting trends of genomic and regulatory evolution in deuterostomes

    Get PDF
    Sea urchins are emblematic models in developmental biology and display several characteristics that set them apart from other deuterostomes. To uncover the genomic cues that may underlie these specificities, we generated a chromosome-scale genome assembly for the sea urchin Paracentrotus lividus and an extensive gene expression and epigenetic profiles of its embryonic development. We found that, unlike vertebrates, sea urchins retained ancestral chromosomal linkages but underwent very fast intrachromosomal gene order mixing. We identified a burst of gene duplication in the echinoid lineage and showed that some of these expanded genes have been recruited in novel structures (water vascular system, Aristotle's lantern, and skeletogenic micromere lineage). Finally, we identified gene-regulatory modules conserved between sea urchins and chordates. Our results suggest that gene-regulatory networks controlling development can be conserved despite extensive gene order rearrangement

    Analysis of the P. lividus sea urchin genome highlights contrasting trends of genomic and regulatory evolution in deuterostomes

    Full text link
    Sea urchins are emblematic models in developmental biology and display several characteristics that set them apart from other deuterostomes. To uncover the genomic cues that may underlie these specificities, we generated a chromosome-scale genome assembly for the sea urchin Paracentrotus lividus and an extensive gene expression and epigenetic profiles of its embryonic development. We found that, unlike vertebrates, sea urchins retained ancestral chromosomal linkages but underwent very fast intrachromosomal gene order mixing. We identified a burst of gene duplication in the echinoid lineage and showed that some of these expanded genes have been recruited in novel structures (water vascular system, Aristotle's lantern, and skeletogenic micromere lineage). Finally, we identified gene-regulatory modules conserved between sea urchins and chordates. Our results suggest that gene-regulatory networks controlling development can be conserved despite extensive gene order rearrangement

    Highly syntenic regions in the genomes of soybean, Medicago truncatula, and Arabidopsis thaliana

    Get PDF
    BACKGROUND: Recent genome sequencing enables mega-base scale comparisons between related genomes. Comparisons between animals, plants, fungi, and bacteria demonstrate extensive synteny tempered by rearrangements. Within the legume plant family, glimpses of synteny have also been observed. Characterizing syntenic relationships in legumes is important in transferring knowledge from model legumes to crops that are important sources of protein, fixed nitrogen, and health-promoting compounds. RESULTS: We have uncovered two large soybean regions exhibiting synteny with M. truncatula and with a network of segmentally duplicated regions in Arabidopsis. In all, syntenic regions comprise over 500 predicted genes spanning 3 Mb. Up to 75% of soybean genes are colinear with M. truncatula, including one region in which 33 of 35 soybean predicted genes with database support are colinear to M. truncatula. In some regions, 60% of soybean genes share colinearity with a network of A. thaliana duplications. One region is especially interesting because this 500 kbp segment of soybean is syntenic to two paralogous regions in M. truncatula on different chromosomes. Phylogenetic analysis of individual genes within these regions demonstrates that one is orthologous to the soybean region, with which it also shows substantially denser synteny and significantly lower levels of synonymous nucleotide substitutions. The other M. truncatula region is inferred to be paralogous, presumably resulting from a duplication event preceding speciation. CONCLUSION: The presence of well-defined M. truncatula segments showing orthologous and paralogous relationships with soybean allows us to explore the evolution of contiguous genomic regions in the context of ancient genome duplication and speciation events

    Fragmentation of the large subunit ribosomal RNA gene in oyster mitochondrial genomes

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>Discontinuous genes have been observed in bacteria, archaea, and eukaryotic nuclei, mitochondria and chloroplasts. Gene discontinuity occurs in multiple forms: the two most frequent forms result from introns that are spliced out of the RNA and the resulting exons are spliced together to form a single transcript, and fragmented gene transcripts that are not covalently attached post-transcriptionally. Within the past few years, fragmented ribosomal RNA (rRNA) genes have been discovered in bilateral metazoan mitochondria, all within a group of related oysters.</p> <p>Results</p> <p>In this study, we have characterized this fragmentation with comparative analysis and experimentation. We present secondary structures, modeled using comparative sequence analysis of the discontinuous mitochondrial large subunit rRNA genes of the cupped oysters <it>C. virginica, C. gigas</it>, and <it>C. hongkongensis</it>. Comparative structure models for the large subunit rRNA in each of the three oyster species are generally similar to those for other bilateral metazoans. We also used RT-PCR and analyzed ESTs to determine if the two fragmented LSU rRNAs are spliced together. The two segments are transcribed separately, and not spliced together although they still form functional rRNAs and ribosomes.</p> <p>Conclusions</p> <p>Although many examples of discontinuous ribosomal genes have been documented in bacteria and archaea, as well as the nuclei, chloroplasts, and mitochondria of eukaryotes, oysters are some of the first characterized examples of fragmented bilateral animal mitochondrial rRNA genes. The secondary structures of the oyster LSU rRNA fragments have been predicted on the basis of previous comparative metazoan mitochondrial LSU rRNA structure models.</p

    The genome diversity and karyotype evolution of mammals

    Get PDF
    The past decade has witnessed an explosion of genome sequencing and mapping in evolutionary diverse species. While full genome sequencing of mammals is rapidly progressing, the ability to assemble and align orthologous whole chromosome regions from more than a few species is still not possible. The intense focus on building of comparative maps for companion (dog and cat), laboratory (mice and rat) and agricultural (cattle, pig, and horse) animals has traditionally been used as a means to understand the underlying basis of disease-related or economically important phenotypes. However, these maps also provide an unprecedented opportunity to use multispecies analysis as a tool for inferring karyotype evolution. Comparative chromosome painting and related techniques are now considered to be the most powerful approaches in comparative genome studies. Homologies can be identified with high accuracy using molecularly defined DNA probes for fluorescence in situ hybridization (FISH) on chromosomes of different species. Chromosome painting data are now available for members of nearly all mammalian orders. In most orders, there are species with rates of chromosome evolution that can be considered as 'default' rates. The number of rearrangements that have become fixed in evolutionary history seems comparatively low, bearing in mind the 180 million years of the mammalian radiation. Comparative chromosome maps record the history of karyotype changes that have occurred during evolution. The aim of this review is to provide an overview of these recent advances in our endeavor to decipher the karyotype evolution of mammals by integrating the published results together with some of our latest unpublished results

    Lymphocyte evolution and ontogeny in non-eutherian mammals

    Get PDF
    Based on their different reproductive modes and evolutionary history, extant mammals are divided into three lineages: eutherians, marsupials and monotremes. Marsupials are the closest relatives to eutherians. They are born immune-incompetent. The characteristic that they develop their immune system postnatally makes them unique models to study newborn immune development and maternal immunology. As monotremes comprises the basal lineage of mammals, the study of monotreme immunology will provide unique insights regarding the origin of the mammalian immune system. Unfortunately, compared to our in-depth knowledge of eutherian immunology, especially humans and mice, knowledge of marsupial and monotreme immunology is scarce. The goal of the first part of this research was to investigate the development of immune-competence in a model marsupial, the gray short-tailed opossum, Monodelphis domestica. This research specifically looked at the ontogeny of B cells with emphasis on diversity of immunoglobulin genes during postnatal development. It takes advantage of genomic resources available for the opossum and a captive colony at the UNM Biology Department. To achieve these goals, the content and genomic organization of Ig heavy chain and light chain loci was determined. Opossum Ig heavy chain are of low germ-line diversity while light chains have high germ-line diversity. This suggests that opossums rely more on light chains than heavy chains for repertoire diversity (Wang et al. 2009). Using the detailed genomic information available of Ig loci, the timing of B cell ontogeny and Ig repertoire diversity was then determined. Opossum newborns start heavy chain VDJ recombination within the first 24 hours postpartum. The expression of the surrogate L chains occurs at day 6 postnatally. The subsequent rearrangement of the Ig\u03bb and Ig\u03ba L chain genes occur at days 7 and 8 postnatal, respectively. The diversity of early B cell H chains is limited and reduced in N region additions, as has been seen in fetal humans and mice, but lacks bias in the V, D and J segments used. Different from H chains, L chains develop much more diverse VJ recombinations and high IgL repertoire diversity when first expressed. Collectively the results demonstrate that B cell development is entirely postnatal in the opossum. The earliest time-point that an opossum has mature B cells is at the starting the second week of life (Wang et al. in preparation). These results are consistent with earlier work demonstrating that most marsupial species, including opossums are unable to generate an antibody response until the second week. A second goal of my research was the characterization of a novel T cell receptor in the duckbill platypus Ornithorhynchus anatinus. Although different from the previous goal, it nonetheless uses a non-eutherian model to address broader questions regarding immunoglobulin and T cell receptor gene evolution. TCRμ is a new T cell receptor that was first identified in marsupials and does not exist in eutherians (Parra et al. 2007). Homology searches of the platypus genome with opossum TCRμ sequence have identified a homologue of this unconventional TCRμ in platypus. Platypus TCRμ is expressed in a double V domains structure and these resemble Ig V more than conventional TCR V domains. Different from opossum TCRμ, platypus TCRμ requires two rounds of somatic recombination to assemble both V domains. The identification of TCRμ in platypus indicates that TCRμ as an ancient T cell receptor has been lost in eutherians

    Systematic Identification of Balanced Transposition Polymorphisms in Saccharomyces cerevisiae

    Get PDF
    High-throughput techniques for detecting DNA polymorphisms generally do not identify changes in which the genomic position of a sequence, but not its copy number, varies among individuals. To explore such balanced structural polymorphisms, we used array-based Comparative Genomic Hybridization (aCGH) to conduct a genome-wide screen for single-copy genomic segments that occupy different genomic positions in the standard laboratory strain of Saccharomyces cerevisiae (S90) and a polymorphic wild isolate (Y101) through analysis of six tetrads from a cross of these two strains. Paired-end high-throughput sequencing of Y101 validated four of the predicted rearrangements. The transposed segments contained one to four annotated genes each, yet crosses between S90 and Y101 yielded mostly viable tetrads. The longest segment comprised 13.5 kb near the telomere of chromosome XV in the S288C reference strain and Southern blotting confirmed its predicted location on chromosome IX in Y101. Interestingly, inter-locus crossover events between copies of this segment occurred at a detectable rate. The presence of low-copy repetitive sequences at the junctions of this segment suggests that it may have arisen through ectopic recombination. Our methodology and findings provide a starting point for exploring the origins, phenotypic consequences, and evolutionary fate of this largely unexplored form of genomic polymorphism
    corecore