452 research outputs found

    Pervasive Phylogenomic Incongruence Underlies Evolutionary Relationships in Eyebrights (Euphrasia, Orobanchaceae)

    Get PDF
    Disentangling the phylogenetic relationships of taxonomically complex plant groups is often mired by challenges associated with recent speciation, hybridization, complex mating systems, and polyploidy. Here, we perform a phylogenomic analysis of eyebrights (Euphrasia), a group renowned for taxonomic complexity, with the aim of documenting the extent of phylogenetic discordance at both deep and at shallow phylogenetic scales. We generate whole-genome sequencing data and integrate this with prior genomic data to perform a comprehensive analysis of nuclear genomic, nuclear ribosomal (nrDNA), and complete plastid genomes from 57 individuals representing 36 Euphrasia species. The species tree analysis of 3,454 conserved nuclear scaffolds (46 Mb) reveals that at shallow phylogenetic scales postglacial colonization of North Western Europe occurred in multiple waves from discrete source populations, with most species not being monophyletic, and instead combining genomic variants from across clades. At a deeper phylogenetic scale, the Euphrasia phylogeny is structured by geography and ploidy, and partially by taxonomy. Comparative analyses show Southern Hemisphere tetraploids include a distinct subgenome indicative of independent polyploidy events from Northern Hemisphere taxa. In contrast to the nuclear genome analyses, the plastid genome phylogeny reveals limited geographic structure, while the nrDNA phylogeny is informative of some geographic and taxonomic affinities but more thorough phylogenetic inference is impeded by the retention of ancestral polymorphisms in the polyploids. Overall our results reveal extensive phylogenetic discordance at both deeper and shallower nodes, with broad-scale geographic structure of genomic variation but a lack of definitive taxonomic signal. This suggests that Euphrasia species either have polytopic origins or are maintained by narrow genomic regions in the face of extensive homogenizing gene flow. Moreover, these results suggest genome skimming will not be an effective extended barcode to identify species in groups such as Euphrasia, or many other postglacial species groups

    The Selaginella Genome Identifies Changes in Gene Content Associated With the Evolution of Vascular Plants

    Get PDF
    Vascular plants appeared ~410 million years ago, then diverged into several lineages of which only two survive: the euphyllophytes (ferns and seed plants) and the lycophytes. We report here the genome sequence of the lycophyte Selaginella moellendorffii (Selaginella), the first nonseed vascular plant genome reported. By comparing gene content in evolutionarily diverse taxa, we found that the transition from a gametophyte- to a sporophyte-dominated life cycle required far fewer new genes than the transition from a nonseed vascular to a flowering plant, whereas secondary metabolic genes expanded extensively and in parallel in the lycophyte and angiosperm lineages. Selaginella differs in posttranscriptional gene regulation, including small RNA regulation of repetitive elements, an absence of the trans-acting small interfering RNA pathway, and extensive RNA editing of organellar genes

    Comparison of next generation sequencing technologies for transcriptome characterization

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>We have developed a simulation approach to help determine the optimal mixture of sequencing methods for most complete and cost effective transcriptome sequencing. We compared simulation results for traditional capillary sequencing with "Next Generation" (NG) ultra high-throughput technologies. The simulation model was parameterized using mappings of 130,000 cDNA sequence reads to the <it>Arabidopsis </it>genome (NCBI Accession SRA008180.19). We also generated 454-GS20 sequences and <it>de novo </it>assemblies for the basal eudicot California poppy (<it>Eschscholzia californica</it>) and the magnoliid avocado (<it>Persea americana</it>) using a variety of methods for cDNA synthesis.</p> <p>Results</p> <p>The <it>Arabidopsis </it>reads tagged more than 15,000 genes, including new splice variants and extended UTR regions. Of the total 134,791 reads (13.8 MB), 119,518 (88.7%) mapped exactly to known exons, while 1,117 (0.8%) mapped to introns, 11,524 (8.6%) spanned annotated intron/exon boundaries, and 3,066 (2.3%) extended beyond the end of annotated UTRs. Sequence-based inference of relative gene expression levels correlated significantly with microarray data. As expected, NG sequencing of normalized libraries tagged more genes than non-normalized libraries, although non-normalized libraries yielded more full-length cDNA sequences. The <it>Arabidopsis </it>data were used to simulate additional rounds of NG and traditional EST sequencing, and various combinations of each. Our simulations suggest a combination of FLX and Solexa sequencing for optimal transcriptome coverage at modest cost. We have also developed ESTcalc <url>http://fgp.huck.psu.edu/NG_Sims/ngsim.pl</url>, an online webtool, which allows users to explore the results of this study by specifying individualized costs and sequencing characteristics.</p> <p>Conclusion</p> <p>NG sequencing technologies are a highly flexible set of platforms that can be scaled to suit different project goals. In terms of sequence coverage alone, the NG sequencing is a dramatic advance over capillary-based sequencing, but NG sequencing also presents significant challenges in assembly and sequence accuracy due to short read lengths, method-specific sequencing errors, and the absence of physical clones. These problems may be overcome by hybrid sequencing strategies using a mixture of sequencing methodologies, by new assemblers, and by sequencing more deeply. Sequencing and microarray outcomes from multiple experiments suggest that our simulator will be useful for guiding NG transcriptome sequencing projects in a wide range of organisms.</p

    Methods for Obtaining and Analyzing Whole Chloroplast Genome Sequences

    Get PDF
    During the past decade there has been a rapid increase in our understanding of plastid genome organization and evolution due to the availability of many new completely sequenced genomes. Currently there are 43 complete genomes published and ongoing projects are likely to increase this sampling to nearly 200 genomes during the next five years. Several groups of researchers including ours have been developing new techniques for gathering and analyzing entire plastid genome sequences and details of these developments are summarized in this chapter. The most important recent developments that enhance our ability to generate whole chloroplast genome sequences involve the generation of pure fractions of chloroplast genomes by whole genome amplification using rolling circular amplification, cloning genomes into Fosmid or BAC vectors, and the development of an organellar annotation program (DOGMA). In addition to providing details of these methods, we provide an overview of methods for analyzing complete plastid genome sequences for repeats and gene content, as well as approaches for using gene order and sequence data for phylogeny reconstruction. This explosive increase in the number of sequenced plastid genomes and improved computational tools will provide many insights into the evolution of these genomes and much new data for assessing relationships at deep nodes in plants and other photosynthetic organisms

    Analysis of 81 Genes From 64 Plastid Genomes Resolves Relationships in Angiosperms and Identifies Genome-Scale Evolutionary Patterns

    Get PDF
    Angiosperms are the largest and most successful clade of land plants with \u3e250,000 species distributed in nearly every terrestrial habitat. Many phylogenetic studies have been based on DNA sequences of one to several genes, but, despite decades of intensive efforts, relationships among early diverging lineages and several of the major clades remain either incompletely resolved or weakly supported. We performed phylogenetic analyses of 81 plastid genes in 64 sequenced genomes, including 13 new genomes, to estimate relationships among the major angiosperm clades, and the resulting trees are used to examine the evolution of gene and intron content. Phylogenetic trees from multiple methods, including model-based approaches, provide strong support for the position of Amborella as the earliest diverging lineage of flowering plants, followed by Nymphaeales and Austrobaileyales. The plastid genome trees also provide strong support for a sister relationship between eudicots and monocots, and this group is sister to a clade that includes Chloranthales and magnoliids. Resolution of relationships among the major clades of angiosperms provides the necessary framework for addressing numerous evolutionary questions regarding the rapid diversification of angiosperms. Gene and intron content are highly conserved among the early diverging angiosperms and basal eudicots, but 62 independent gene and intron losses are limited to the more derived monocot and eudicot clades. Moreover, a lineage-specific correlation was detected between rates of nucleotide substitutions, indels, and genomic rearrangements

    Deficiency in origin licensing proteins impairs cilia formation: implications for the aetiology of meier-gorlin syndrome

    Get PDF
    Mutations in ORC1, ORC4, ORC6, CDT1, and CDC6, which encode proteins required for DNA replication origin licensing, cause Meier-Gorlin syndrome (MGS), a disorder conferring microcephaly, primordial dwarfism, underdeveloped ears, and skeletal abnormalities. Mutations in ATR, which also functions during replication, can cause Seckel syndrome, a clinically related disorder. These findings suggest that impaired DNA replication could underlie the developmental defects characteristic of these disorders. Here, we show that although origin licensing capacity is impaired in all patient cells with mutations in origin licensing component proteins, this does not correlate with the rate of progression through S phase. Thus, the replicative capacity in MGS patient cells does not correlate with clinical manifestation. However, ORC1-deficient cells from MGS patients and siRNA-mediated depletion of origin licensing proteins also have impaired centrosome and centriole copy number. As a novel and unexpected finding, we show that they also display a striking defect in the rate of formation of primary cilia. We demonstrate that this impacts sonic hedgehog signalling in ORC1-deficient primary fibroblasts. Additionally, reduced growth factor-dependent signaling via primary cilia affects the kinetics of cell cycle progression following cell cycle exit and re-entry, highlighting an unexpected mechanism whereby origin licensing components can influence cell cycle progression. Finally, using a cell-based model, we show that defects in cilia function impair chondroinduction. Our findings raise the possibility that a reduced efficiency in forming cilia could contribute to the clinical features of MGS, particularly the bone development abnormalities, and could provide a new dimension for considering developmental impacts of licensing deficiency

    Application of qRT-PCR and RNA-Seq analysis for the identification of housekeeping genes useful for normalization of gene expression values during Striga hermonthica development.

    Get PDF
    Abstract Striga is a root parasitic weed that attacks many of the staple crops in Africa, India and Southeast Asia, inflicting tremendous losses in yield and for which there are few effective control measures. Studies of parasitic plant virulence and host resistance will be greatly facilitated by the recent emergence of genomic resources that include extensive transcriptome sequence datasets spanning all life stages of S. hermonthica. Functional characterization of Striga genes will require detailed analyses of gene expression patterns. Quantitative real-time PCR is a powerful tool for quantifying gene expression, but correct normalization of expression levels requires identification of control genes that have stable expression across tissues and life stages. Since no S. hermonthica housekeeping genes have been established for this purpose, we evaluated the suitability of six candidate housekeeping genes across key life stages of S. hermonthica from seed conditioning to flower initiation using qRT-PCR and high-throughput cDNA sequencing. Based on gene expression analysis by qRT-PCR and RNA-Seq across heterogeneous Striga life stages, we determined that using the combination of three genes, UBQ1, PP2A and TUB1 provides the best normalization for gene expression throughout the parasitic life cycle. The housekeeping genes characterized here provide robust standards that will facilitate powerful descriptions of parasite gene expression patterns

    DNA replication and the GINS complex: localization on extended chromatin fibers

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>The GINS complex is thought to be essential for the processes of initiation and elongation of DNA replication. This complex contains four subunits, one of which (Psf1) is proposed to bind to both chromatin and DNA replication-associated proteins. To date there have been no microscopic analyses to evaluate the chromatin distribution of this complex. Here, we show the organization of GINS complexes on extended chromatin fibers in relation to sites of DNA replication and replication-associated proteins.</p> <p>Results</p> <p>Using immunofluorescence microscopy we were able to visualize ORC1, ORC2, PCNA, and GINS complex proteins Psf1 and Psf2 bound to extended chromatin fibers. We were also able to detect these proteins concurrently with the visualization of tracks of recently replicated DNA where EdU, a thymidine analog, was incorporated. This allowed us to assess the chromatin association of proteins of interest in relation to the process of DNA replication. ORC and GINS proteins were found on chromatin fibers before replication could be detected. These proteins were also associated with newly replicated DNA in bead-like structures. Additionally, GINS proteins co-localized with PCNA at sites of active replication.</p> <p>Conclusion</p> <p>In agreement with its proposed role in the initiation of DNA replication, GINS proteins associated with chromatin near sites of ORC binding that were devoid of EdU (absence of DNA replication). The association of GINS proteins with PCNA was consistent with a role in the process of elongation. Additionally, the large size of our chromatin fibers (up to approximately 7 Mb) allowed for a more expansive analysis of the distance between active replicons than previously reported.</p
    • …
    corecore