172 research outputs found

    Transcriptome sequencing reveals genome-wide variation in molecular evolutionary rate among ferns

    Get PDF
    Orthogroup data file. Zipped folder containing fasta-formatted reads identified by ProteinOrtho, used for all downstream orthogroup determination and analysis, along with readme document and relevant project-specific scripts (also available online via Dryad: http://dx.doi.org/10.5061/dryad.rg22j ). (ZIP 12021 kb

    Evolutionary Analysis of the LAFL Genes Involved in the Land Plant Seed Maturation Program

    Get PDF
    Seeds are one of the most significant innovations in the land plant lineage, critical to the diversification and adaptation of plants to terrestrial environments. From perspective of seed evo-devo, the most crucial developmental stage in this innovation is seed maturation, which includes accumulation of storage reserves, acquisition of desiccation tolerance, and induction of dormancy. Based on previous studies of seed development in the model plant Arabidopsis thaliana, seed maturation is mainly controlled by the LAFL regulatory network, which includes LEAFY COTYLEDON1 (LEC1) and LEC1-LIKE (L1L) of the NF-YB gene family, and ABSCISIC ACID INSENSITIVE3 (ABI3), FUSCA3 (FUS3), and LEC2 (LEAFY COTYLEDON2) of the B3-AFL gene family. In the present study, molecular evolution of these LAFL genes was analyzed, using representative species from across the major plant lineages. Additionally, to elucidate the molecular mechanisms of the seed maturation program, co-expression pattern analyses of LAFL genes were conducted across vascular plants. The results show that the origin of AFL gene family dates back to a common ancestor of bryophytes and vascular plants, while LEC1-type genes are only found in vascular plants. LAFL genes of vascular plants likely specify their co-expression in two different developmental phrases, spore and seed maturation, respectively, and expression patterns vary slightly across the major vascular plants lineages. All the information presented in this study will provide insights into the origin and diversification of seed plants.National Natural Science Foundation of China (NSFC) [91231105]SCI(E)ARTICLE

    rbcL and matK Earn Two Thumbs Up as the Core DNA Barcode for Ferns

    Get PDF
    BACKGROUND: DNA barcoding will revolutionize our understanding of fern ecology, most especially because the accurate identification of the independent but cryptic gametophyte phase of the fern's life history--an endeavor previously impossible--will finally be feasible. In this study, we assess the discriminatory power of the core plant DNA barcode (rbcL and matK), as well as alternatively proposed fern barcodes (trnH-psbA and trnL-F), across all major fern lineages. We also present plastid barcode data for two genera in the hyperdiverse polypod clade--Deparia (Woodsiaceae) and the Cheilanthes marginata group (currently being segregated as a new genus of Pteridaceae)--to further evaluate the resolving power of these loci. PRINCIPAL FINDINGS: Our results clearly demonstrate the value of matK data, previously unavailable in ferns because of difficulties in amplification due to a major rearrangement of the plastid genome. With its high sequence variation, matK complements rbcL to provide a two-locus barcode with strong resolving power. With sequence variation comparable to matK, trnL-F appears to be a suitable alternative barcode region in ferns, and perhaps should be added to the core barcode region if universal primer development for matK fails. In contrast, trnH-psbA shows dramatically reduced sequence variation for the majority of ferns. This is likely due to the translocation of this segment of the plastid genome into the inverted repeat regions, which are known to have a highly constrained substitution rate. CONCLUSIONS: Our study provides the first endorsement of the two-locus barcode (rbcL+matK) in ferns, and favors trnL-F over trnH-psbA as a potential back-up locus. Future work should focus on gathering more fern matK sequence data to facilitate universal primer development

    A Basic ddRADseq Two‐Enzyme Protocol Performs Well with Herbarium and Silica‐Dried Tissues across Four Genera

    Get PDF
    PREMISE: The ability to sequence genome-scale data from herbarium specimens would allow for the economical development of data sets with broad taxonomic and geographic sampling that would otherwise not be possible. Here, we evaluate the utility of a basic double-digest restriction site–associated DNA sequencing (ddRADseq) protocol using DNAs from four genera extracted from both silica-dried and herbarium tissue. METHODS: DNAs from Draba, Boechera, Solidago, and Ilex were processed with a ddRADseq protocol. The effects of DNA degradation, taxon, and specimen age were assessed. RESULTS: Although taxon, preservation method, and specimen age affected data recovery, large phylogenetically informative data sets were obtained from the majority of samples. DISCUSSION: These results suggest that herbarium samples can be incorporated into ddRADseq project designs, and that specimen age can be used as a rapid on-site guide for sample choice. The detailed protocol we provide will allow users to pursue herbariumbased ddRADseq projects that minimize the expenses associated with fieldwork and sample evaluation

    A Basic ddRADseq Two‐Enzyme Protocol Performs Well with Herbarium and Silica‐Dried Tissues across Four Genera

    Get PDF
    PREMISE: The ability to sequence genome-scale data from herbarium specimens would allow for the economical development of data sets with broad taxonomic and geographic sampling that would otherwise not be possible. Here, we evaluate the utility of a basic double-digest restriction site–associated DNA sequencing (ddRADseq) protocol using DNAs from four genera extracted from both silica-dried and herbarium tissue. METHODS: DNAs from Draba, Boechera, Solidago, and Ilex were processed with a ddRADseq protocol. The effects of DNA degradation, taxon, and specimen age were assessed. RESULTS: Although taxon, preservation method, and specimen age affected data recovery, large phylogenetically informative data sets were obtained from the majority of samples. DISCUSSION: These results suggest that herbarium samples can be incorporated into ddRADseq project designs, and that specimen age can be used as a rapid on-site guide for sample choice. The detailed protocol we provide will allow users to pursue herbariumbased ddRADseq projects that minimize the expenses associated with fieldwork and sample evaluation

    Target Sequence Capture of Nuclear-Encoded Genes for Phylogenetic Analysis in Ferns

    Get PDF
    Premise of the Study Until recently, most phylogenetic studies of ferns were based on chloroplast genes. Evolutionary inferences based on these data can be incomplete because the characters are from a single linkage group and are uniparentally inherited. These limitations are particularly acute in studies of hybridization, which is prevalent in ferns; fern hybrids are common and ferns are able to hybridize across highly diverged lineages, up to 60 million years since divergence in one documented case. However, it not yet clear what effect such hybridization has on fern evolution, in part due to a paucity of available biparentally inherited (nuclear‐encoded) markers. Methods We designed oligonucleotide baits to capture 25 targeted, low‐copy nuclear markers from a sample of 24 species spanning extant fern diversity. Results Most loci were successfully sequenced from most accessions. Although the baits were designed from exon (transcript) data, we successfully captured intron sequences that should be useful for more focused phylogenetic studies. We present phylogenetic analyses of the new target sequence capture data and integrate these into a previous transcript‐based data set. Discussion We make our bait sequences available to the community as a resource for further studies of fern phylogeny

    An Exploration into Fern Genome Space

    Get PDF
    Ferns are one of the few remaining major clades of land plants for which a complete genome sequence is lacking. Knowledge of genome space in ferns will enable broad-­‐scale comparative analyses of land plant genes and genomes, provide insights into genome evolution across green plants, and shed light on genetic and genomic features that characterize ferns, such as their high chromosome numbers and large genome sizes. As part of an initial exploration into fern genome space, we used a whole genome shotgun sequencing approach to obtain low-­‐density coverage (~0.4X to 2X) for six fern species from the Polypodiales (Ceratopteris, Pteridium, Polypodium, Cystopteris), Cyatheales (Plagiogyria), and Gleicheniales (Dipteris). We explore these data to characterize the proportion of the nuclear genome represented by repetitive sequences (including DNA transposons, retrotransposons, rDNA, and simple repeats) and protein-­‐coding genes, and to extract chloroplast and mitochondrial genome sequences. Such initial sweeps of fern genomes can provide information useful for selecting a promising candidate fern species for whole genome sequencing. We also describe variation of genomic traits across our sample and highlight some differences and similarities in repeat structure between ferns and seed plants

    Data access for the 1,000 Plants (1KP) project

    Get PDF
    © 2014 Matasci et al.; licensee BioMed Central Ltd. The 1,000 plants (1KP) project is an international multi-disciplinary consortium that has generated transcriptome data from over 1,000 plant species, with exemplars for all of the major lineages across the Viridiplantae (green plants) clade. Here, we describe how to access the data used in a phylogenomics analysis of the first 85 species, and how to visualize our gene and species trees. Users can develop computational pipelines to analyse these data, in conjunction with data of their own that they can upload. Computationally estimated protein-protein interactions and biochemical pathways can be visualized at another site. Finally, we comment on our future plans and how they fit within this scalable system for the dissemination, visualization, and analysis of large multi-species data sets
    corecore