73 research outputs found

    High-Throughput Sequencing of Three Lemnoideae (Duckweeds) Chloroplast Genomes from Total DNA

    Get PDF
    BACKGROUND: Chloroplast genomes provide a wealth of information for evolutionary and population genetic studies. Chloroplasts play a particularly important role in the adaption for aquatic plants because they float on water and their major surface is exposed continuously to sunlight. The subfamily of Lemnoideae represents such a collection of aquatic species that because of photosynthesis represents one of the fastest growing plant species on earth. METHODS: We sequenced the chloroplast genomes from three different genera of Lemnoideae, Spirodela polyrhiza, Wolffiella lingulata and Wolffia australiana by high-throughput DNA sequencing of genomic DNA using the SOLiD platform. Unfractionated total DNA contains high copies of plastid DNA so that sequences from the nucleus and mitochondria can easily be filtered computationally. Remaining sequence reads were assembled into contiguous sequences (contigs) using SOLiD software tools. Contigs were mapped to a reference genome of Lemna minor and gaps, selected by PCR, were sequenced on the ABI3730xl platform. CONCLUSIONS: This combinatorial approach yielded whole genomic contiguous sequences in a cost-effective manner. Over 1,000-time coverage of chloroplast from total DNA were reached by the SOLiD platform in a single spot on a quadrant slide without purification. Comparative analysis indicated that the chloroplast genome was conserved in gene number and organization with respect to the reference genome of L. minor. However, higher nucleotide substitution, abundant deletions and insertions occurred in non-coding regions of these genomes, indicating a greater genomic dynamics than expected from the comparison of other related species in the Pooideae. Noticeably, there was no transition bias over transversion in Lemnoideae. The data should have immediate applications in evolutionary biology and plant taxonomy with increased resolution and statistical power

    Adventures in the Enormous: A 1.8 Million Clone BAC Library for the 21.7 Gb Genome of Loblolly Pine

    Get PDF
    Loblolly pine (LP; Pinus taeda L.) is the most economically important tree in the U.S. and a cornerstone species in southeastern forests. However, genomics research on LP and other conifers has lagged behind studies on flowering plants due, in part, to the large size of conifer genomes. As a means to accelerate conifer genome research, we constructed a BAC library for the LP genotype 7-56. The LP BAC library consists of 1,824,768 individually-archived clones making it the largest single BAC library constructed to date, has a mean insert size of 96 kb, and affords 7.6X coverage of the 21.7 Gb LP genome. To demonstrate the efficacy of the library in gene isolation, we screened macroarrays with overgos designed from a pine EST anchored on LP chromosome 10. A positive BAC was sequenced and found to contain the expected full-length target gene, several gene-like regions, and both known and novel repeats. Macroarray analysis using the retrotransposon IFG-7 (the most abundant repeat in the sequenced BAC) as a probe indicates that IFG-7 is found in roughly 210,557 copies and constitutes about 5.8% or 1.26 Gb of LP nuclear DNA; this DNA quantity is eight times the Arabidopsis genome. In addition to its use in genome characterization and gene isolation as demonstrated herein, the BAC library should hasten whole genome sequencing of LP via next-generation sequencing strategies/technologies and facilitate improvement of trees through molecular breeding and genetic engineering. The library and associated products are distributed by the Clemson University Genomics Institute (www.genome.clemson.edu)

    Building a model: developing genomic resources for common milkweed (Asclepias syriaca) with low coverage genome sequencing

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>Milkweeds (<it>Asclepias </it>L.) have been extensively investigated in diverse areas of evolutionary biology and ecology; however, there are few genetic resources available to facilitate and compliment these studies. This study explored how low coverage genome sequencing of the common milkweed (<it>Asclepias syriaca </it>L.) could be useful in characterizing the genome of a plant without prior genomic information and for development of genomic resources as a step toward further developing <it>A. syriaca </it>as a model in ecology and evolution.</p> <p>Results</p> <p>A 0.5× genome of <it>A. syriaca </it>was produced using Illumina sequencing. A virtually complete chloroplast genome of 158,598 bp was assembled, revealing few repeats and loss of three genes: <it>accD, clpP</it>, and <it>ycf1</it>. A nearly complete rDNA cistron (18S-5.8S-26S; 7,541 bp) and 5S rDNA (120 bp) sequence were obtained. Assessment of polymorphism revealed that the rDNA cistron and 5S rDNA had 0.3% and 26.7% polymorphic sites, respectively. A partial mitochondrial genome sequence (130,764 bp), with identical gene content to tobacco, was also assembled. An initial characterization of repeat content indicated that Ty1/<it>copia</it>-like retroelements are the most common repeat type in the milkweed genome. At least one <it>A. syriaca </it>microread hit 88% of <it>Catharanthus roseus </it>(Apocynaceae) unigenes (median coverage of 0.29×) and 66% of single copy orthologs (COSII) in asterids (median coverage of 0.14×). From this partial characterization of the <it>A. syriaca </it>genome, markers for population genetics (microsatellites) and phylogenetics (low-copy nuclear genes) studies were developed.</p> <p>Conclusions</p> <p>The results highlight the promise of next generation sequencing for development of genomic resources for any organism. Low coverage genome sequencing allows characterization of the high copy fraction of the genome and exploration of the low copy fraction of the genome, which facilitate the development of molecular tools for further study of a target species and its relatives. This study represents a first step in the development of a community resource for further study of plant-insect co-evolution, anti-herbivore defense, floral developmental genetics, reproductive biology, chemical evolution, population genetics, and comparative genomics using milkweeds, and <it>A. syriaca </it>in particular, as ecological and evolutionary models.</p

    The Complete Chloroplast Genome Sequence of Date Palm (Phoenix dactylifera L.)

    Get PDF
    BACKGROUND: Date palm (Phoenix dactylifera L.), a member of Arecaceae family, is one of the three major economically important woody palms--the two other palms being oil palm and coconut tree--and its fruit is a staple food among Middle East and North African nations, as well as many other tropical and subtropical regions. Here we report a complete sequence of the data palm chloroplast (cp) genome based on pyrosequencing. METHODOLOGY/PRINCIPAL FINDINGS: After extracting 369,022 cp sequencing reads from our whole-genome-shotgun data, we put together an assembly and validated it with intensive PCR-based verification, coupled with PCR product sequencing. The date palm cp genome is 158,462 bp in length and has a typical quadripartite structure of the large (LSC, 86,198 bp) and small single-copy (SSC, 17,712 bp) regions separated by a pair of inverted repeats (IRs, 27,276 bp). Similar to what has been found among most angiosperms, the date palm cp genome harbors 112 unique genes and 19 duplicated fragments in the IR regions. The junctions between LSC/IRs and SSC/IRs show different features of sequence expansion in evolution. We identified 78 SNPs as major intravarietal polymorphisms within the population of a specific cp genome, most of which were located in genes with vital functions. Based on RNA-sequencing data, we also found 18 polycistronic transcription units and three highly expression-biased genes--atpF, trnA-UGC, and rrn23. CONCLUSIONS: Unlike most monocots, date palm has a typical cp genome similar to that of tobacco--with little rearrangement and gene loss or gain. High-throughput sequencing technology facilitates the identification of intravarietal variations in cp genomes among different cultivars. Moreover, transcriptomic analysis of cp genes provides clues for uncovering regulatory mechanisms of transcription and translation in chloroplasts

    Contrasting patterns of the 5S and 45S rDNA evolutions in the Byblis liniflora complex (Byblidaceae)

    Get PDF
    To clarify the evolutionary dynamics of ribosomal RNA genes (rDNAs) in the Byblis liniflora complex (Byblidaceae), we investigated the 5S and 45S rDNA genes through (1) chromosomal physical mapping by fluorescence in situ hybridization (FISH) and (2) phylogenetic analyses using the nontranscribed spacer of 5S rDNA (5S-NTS) and the internal transcribed spacer of 45S rDNA (ITS). In addition, we performed phylogenetic analyses based on rbcL and trnK intron. The complex was divided into 2 clades: B. aquatica–B. filifolia and B. guehoi–B. liniflora–B. rorida. Although members of the complex had conservative symmetric karyotypes, they were clearly differentiated on chromosomal rDNA distribution patterns. The sequence data indicated that ITS was almost homogeneous in all taxa in which two or four 45S rDNA arrays were frequently found at distal regions of chromosomes in the somatic karyotype. ITS homogenization could have been prompted by relatively distal 45S rDNA positions. In contrast, 2–12 5S rDNA arrays were mapped onto proximal/interstitial regions of chromosomes, and some paralogous 5S-NTS were found in the genomes harboring 4 or more arrays. 5S-NTS sequence type-specific FISH analysis showed sequence heterogeneity within and between some 5S rDNA arrays. Interlocus homogenization may have been hampered by their proximal location on chromosomes. Chromosomal location may have affected the contrasting evolutionary dynamics of rDNAs in the B. liniflora complex

    Five Nuclear Loci Resolve the Polyploid History of Switchgrass (Panicum virgatum L.) and Relatives

    Get PDF
    Polyploidy poses challenges for phylogenetic reconstruction because of the need to identify and distinguish between homoeologous loci. This can be addressed by use of low copy nuclear markers. Panicum s.s. is a genus of about 100 species in the grass tribe Paniceae, subfamily Panicoideae, and is divided into five sections. Many of the species are known to be polyploids. The most well-known of the Panicum polyploids are switchgrass (Panicum virgatum) and common or Proso millet (P. miliaceum). Switchgrass is in section Virgata, along with P. tricholaenoides, P. amarum, and P. amarulum, whereas P. miliaceum is in sect. Panicum. We have generated sequence data from five low copy nuclear loci and two chloroplast loci and have clarified the origin of P. virgatum. We find that all members of sects. Virgata and Urvilleana are the result of diversification after a single allopolyploidy event. The closest diploid relatives of switchgrass are in sect. Rudgeana, native to Central and South America. Within sections Virgata and Urvilleana, P. tricholaenoides is sister to the remaining species. Panicum racemosum and P. urvilleanum form a clade, which may be sister to P. chloroleucum. Panicum amarum, P. amarulum, and the lowland and upland ecotypes of P. virgatum together form a clade, within which relationships are complex. Hexaploid and octoploid plants are likely allopolyploids, with P. amarum and P. amarulum sharing genomes with P. virgatum. Octoploid P. virgatum plants are formed via hybridization between disparate tetraploids. We show that polyploidy precedes diversification in a complex set of polyploids; our data thus suggest that polyploidy could provide the raw material for diversification. In addition, we show two rounds of allopolyploidization in the ancestry of switchgrass, and identify additional species that may be part of its broader gene pool. This may be relevant for development of the crop for biofuels

    CenH3 evolution in diploids and polyploids of three angiosperm genera

    Get PDF
    BACKGROUND: Centromeric DNA sequences alone are neither necessary nor sufficient for centromere specification. The centromere specific histone, CenH3, evolves rapidly in many species, perhaps as a coevolutionary response to rapidly evolving centromeric DNA. To gain insight into CenH3 evolution, we characterized patterns of nucleotide and protein diversity among diploids and allopolyploids within three diverse angiosperm genera, Brassica, Oryza, and Gossypium (cotton), with a focus on evidence for diversifying selection in the various domains of the CenH3 gene. In addition, we compare expression profiles and alternative splicing patterns for CenH3 in representatives of each genus. RESULTS: All three genera retain both duplicated CenH3 copies, while Brassica and Gossypium exhibit pronounced homoeologous expression level bias. Comparisons among genera reveal shared and unique aspects of CenH3 evolution, variable levels of diversifying selection in different CenH3 domains, and that alternative splicing contributes significantly to CenH3 diversity. CONCLUSIONS: Since the N terminus is subject to diversifying selection but the DNA binding domains do not appear to be, rapidly evolving centromere sequences are unlikely to be the primary driver of CenH3 sequence diversification. At present, the functional explanation for the diversity generated by both conventional protein evolution in the N terminal domain, as well as alternative splicing, remains unexplained. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12870-014-0383-3) contains supplementary material, which is available to authorized users
    • …
    corecore