57 research outputs found

    Uncovering the evolutionary origin of plant molecular processes: comparison of Coleochaete (Coleochaetales) and Spirogyra (Zygnematales) transcriptomes

    Get PDF
    Background: The large and diverse land plant lineage is nested within a clade of fresh water green algae, the charophytes. Collection of genome-scale data for land plants and other organisms over the past decade has invigorated the field of evolutionary biology. One of the core questions in the field asks: how did a colonization event by a green algae over 450 mya lead to one of the most successful lineages on the tree of life? This question can best be answered using the comparative method, the first step of which is to gather genome-scale data across closely related lineages to land plants. Before sequencing an entire genome it is useful to first gather transcriptome data: it is less expensive, it targets the protein coding regions of the genome, and provides support for gene models for future genome sequencing. We built Expressed Sequence Tag (EST) libraries for two charophyte species, Coleochaete orbicularis (Coleochaetales) and Spirogyra pratensis (Zygnematales). We used both Sanger sequencing and next generation 454 sequencing to cover as much of the transcriptome as possible. Results: Our sequencing effort for Spirogyra pratensis yielded 9,984 5' Sanger reads plus 598,460 GS FLX Standard 454 sequences; Coleochaete orbicularis yielded 4,992 5' Sanger reads plus 673,811 GS FLX Titanium 454 sequences. After clustering S. pratensis yielded 12,000 unique transcripts, or unigenes, and C. orbicularis yielded 19,000. Both transcriptomes were very plant-like, i.e. most of the transcripts were more similar to streptophytes (land plants + charophyte green algae) than to other green algae in the sister group chlorophytes. BLAST results of several land plant genes hypothesized to be important in early land plant evolution resulted in high quality hits in both transcriptomes revealing putative orthologs ripe for follow-up studies. Conclusions: Two main conclusions were drawn from this study. One illustrates the utility of next generation sequencing for transcriptome studies: larger scale data collection at a lower cost enabled us to cover a considerable portion of the transcriptome for both species. And, two, that the charophyte green algal transcriptoms are remarkably plant-like, which gives them the unique capacity to be major players for future evolutionary genomic studies addressing origin of land plant questions.https://doi.org/10.1186/1471-2229-10-9

    Conserved and diversified gene families of monovalent cation/H+ antiporters from algae to flowering plants

    Get PDF
    All organisms have evolved strategies to regulate ion and pH homeostasis in response to developmental and environmental cues. One strategy is mediated by monovalent cation–proton antiporters (CPA) that are classified in two superfamilies. Many CPA1 genes from bacteria, fungi, metazoa, and plants have been functionally characterized; though roles of plant CPA2 genes encoding K+-efflux antiporter (KEA) and cation/H+ exchanger (CHX) families are largely unknown. Phylogenetic analysis showed that three clades of the CPA1 Na+–H+ exchanger (NHX) family have been conserved from single-celled algae to Arabidopsis. These are (i) plasma membrane-bound SOS1/AtNHX7 that share ancestry with prokaryote NhaP, (ii) endosomal AtNHX5/6 that is part of the eukaryote Intracellular-NHE clade, and (iii) a vacuolar NHX clade (AtNHX1–4) specific to plants. Early diversification of KEA genes possibly from an ancestral cyanobacterium gene is suggested by three types seen in all plants. Intriguingly, CHX genes diversified from three to four members in one subclade of early land plants to 28 genes in eight subclades of Arabidopsis. Homologs from Spirogyra or Physcomitrella share high similarity with AtCHX20, suggesting that guard cell-specific AtCHX20 and its closest relatives are founders of the family, and pollen-expressed CHX genes appeared later in monocots and early eudicots. AtCHX proteins mediate K+ transport and pH homeostasis, and have been localized to intracellular and plasma membrane. Thus KEA genes are conserved from green algae to angiosperms, and their presence in red algae and secondary endosymbionts suggest a role in plastids. In contrast, AtNHX1–4 subtype evolved in plant cells to handle ion homeostasis of vacuoles. The great diversity of CHX genes in land plants compared to metazoa, fungi, or algae would imply a significant role of ion and pH homeostasis at dynamic endomembranes in the vegetative and reproductive success of flowering plants. [EN]This work was support in part by National Science Foundation Grant IBN0209788 and US Department of Energy Grant BES DEFG0207ER15883 to Heven Sze, grant BIO2008-01691 from Spanish Plan Nacional I + D + I to Kees Venema, and a Royal Thai Government Graduate Fellowship to Salil Chanroj. Work of CFD was supported by NSF grants #MCB-0523719 and DEB1036506.Peer reviewe

    Evaluation of BLAST-based edge-weighting metrics used for homology inference with the Markov Clustering algorithm

    Get PDF
    Clustering protein sequences according to inferred homology is a fundamental step in the analysis of many large data sets. Since the publication of the Markov Clustering (MCL) algorithm in 2002, it has been the centerpiece of several popular applications. Each of these approaches generates an undirected graph that represents sequences as nodes connected to each other by edges weighted with a BLAST-based metric. MCL is then used to infer clusters of homologous proteins by analyzing these graphs. The various approaches differ only by how they weight the edges, yet there has been very little direct examination of the relative performance of alternative edge-weighting metrics. This study compares the performance of four BLAST-based edge-weighting metrics: the bit score, bit score ratio (BSR), bit score over anchored length (BAL), and negative common log of the expectation value (NLE). Performance is tested using the Extended CEGMA KOGs (ECK) database, which we introduce here. All metrics performed similarly when analyzing full-length sequences, but dramatic differences emerged as progressively larger fractions of the test sequences were split into fragments. The BSR and BAL successfully rescued subsets of clusters by strengthening certain types of alignments between fragmented sequences, but also shifted the largest correct scores down near the range of scores generated from spurious alignments. This penalty outweighed the benefits in most test cases, and was greatly exacerbated by increasing the MCL inflation parameter, making these metrics less robust than the bit score or the more popular NLE. Notably, the bit score performed as well or better than the other three metrics in all scenarios. The results provide a strong case for use of the bit score, which appears to offer equivalent or superior performance to the more popular NLE. The insight that MCL-based clustering methods can be improved using a more tractable edge-weighting metric will greatly simplify future implementations. We demonstrate this with our own minimalist Python implementation: Porthos, which uses only standard libraries and can process a graph with 25 m + edges connecting the 60 k + KOG sequences in half a minute using less than half a gigabyte of memory.https://doi.org/10.1186/s12859-015-0625-xhttps://doi.org/10.1186/s12859-015-0690-

    Microbial Diversity in the Eukaryotic SAR Clade: Illuminating the Darkness Between Morphology and Molecular Data

    Get PDF
    Despite their diversity and ecological importance, many areas of the SAR—Stramenopila, Alveolata, and Rhizaria—clade are poorly understood as the majority (90%) of SAR species lack molecular data and only 5% of species are from well-sampled families. Here, we review and summarize the state of knowledge about the three major clades of SAR, describing the diversity within each clade and identifying synapomorphies when possible. We also assess the “dark area” of SAR: the morphologically described species that are missing molecular data. The majority of molecular data for SAR lineages are characterized from marine samples and vertebrate hosts, highlighting the need for additional research effort in areas such as freshwater and terrestrial habitats and “non-vertebrate” hosts. We also describe the paucity of data on the biogeography of SAR species, and point to opportunities to illuminate diversity in this major eukaryotic clade. See also the video abstract above

    Directional auxin transport mechanisms in early diverging land plants

    Get PDF
    The emergence and radiation of multicellular land plants was driven by crucial innovations to their body plans [1]. The directional transport of the phytohormone auxin represents a key, plant-specific mechanism for polarization and patterning in complex seed plants [2, 3, 4 and 5]. Here, we show that already in the early diverging land plant lineage, as exemplified by the moss Physcomitrella patens, auxin transport by PIN transporters is operational and diversified into ER-localized and plasma membrane-localized PIN proteins. Gain-of-function and loss-of-function analyses revealed that PIN-dependent intercellular auxin transport in Physcomitrella mediates crucial developmental transitions in tip-growing filaments and waves of polarization and differentiation in leaf-like structures. Plasma membrane PIN proteins localize in a polar manner to the tips of moss filaments, revealing an unexpected relation between polarization mechanisms in moss tip-growing cells and multicellular tissues of seed plants. Our results trace the origins of polarization and auxin-mediated patterning mechanisms and highlight the crucial role of polarized auxin transport during the evolution of multicellular land plants

    New phylogenetic hypotheses for the core Chlorophyta based on chloroplast sequence data

    Get PDF
    Phylogenetic relationships in the green algal phylum Chlorophyta have long been subject to debate, especially at higher taxonomic ranks (order, class). The relationships among three traditionally defined and well-studied classes, Chlorophyceae, Trebouxiophyceae, and Ulvophyceae are of particular interest, as these groups are species-rich and ecologically important worldwide. Different phylogenetic hypotheses have been proposed over the past two decades and the monophyly of the individual classes has been disputed on occasion. Our study seeks to test these hypotheses by combining high throughput sequencing data from the chloroplast genome with increased taxon sampling. Our results suggest that while many of the deep relationships are still problematic to resolve, the classes Trebouxiophyceae and Ulvophyceae are likely not monophyletic as currently defined. Our results also support relationships among several trebouxiophycean taxa that were previously unresolved. Finally, we propose that the common term for the grouping of the three classes, “UTC clade,” be replaced with the term “core Chlorophyta” for the well-supported clade containing Chlorophyceae, taxa belonging to Ulvophyceae and Trebouxiophyceae, and the classes Chlorodendrophyceae and Pedinophyceae

    Evolution of light-harvesting complex proteins from Chl c-containing algae

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>Light harvesting complex (LHC) proteins function in photosynthesis by binding chlorophyll (Chl) and carotenoid molecules that absorb light and transfer the energy to the reaction center Chl of the photosystem. Most research has focused on LHCs of plants and chlorophytes that bind Chl <it>a </it>and <it>b </it>and extensive work on these proteins has uncovered a diversity of biochemical functions, expression patterns and amino acid sequences. We focus here on a less-studied family of LHCs that typically bind Chl <it>a </it>and <it>c</it>, and that are widely distributed in Chl <it>c</it>-containing and other algae. Previous phylogenetic analyses of these proteins suggested that individual algal lineages possess proteins from one or two subfamilies, and that most subfamilies are characteristic of a particular algal lineage, but genome-scale datasets had revealed that some species have multiple different forms of the gene. Such observations also suggested that there might have been an important influence of endosymbiosis in the evolution of LHCs.</p> <p>Results</p> <p>We reconstruct a phylogeny of LHCs from Chl <it>c</it>-containing algae and related lineages using data from recent sequencing projects to give ~10-fold larger taxon sampling than previous studies. The phylogeny indicates that individual taxa possess proteins from multiple LHC subfamilies and that several LHC subfamilies are found in distantly related algal lineages. This phylogenetic pattern implies functional differentiation of the gene families, a hypothesis that is consistent with data on gene expression, carotenoid binding and physical associations with other LHCs. In all probability LHCs have undergone a complex history of evolution of function, gene transfer, and lineage-specific diversification.</p> <p>Conclusion</p> <p>The analysis provides a strikingly different picture of LHC diversity than previous analyses of LHC evolution. Individual algal lineages possess proteins from multiple LHC subfamilies. Evolutionary relationships showed support for the hypothesized origin of Chl <it>c </it>plastids. This work also allows recent experimental findings about molecular function to be understood in a broader phylogenetic context.</p

    Broad Phylogenomic Sampling and the Sister Lineage of Land Plants

    Get PDF
    The tremendous diversity of land plants all descended from a single charophyte green alga that colonized the land somewhere between 430 and 470 million years ago. Six orders of charophyte green algae, in addition to embryophytes, comprise the Streptophyta s.l. Previous studies have focused on reconstructing the phylogeny of organisms tied to this key colonization event, but wildly conflicting results have sparked a contentious debate over which lineage gave rise to land plants. The dominant view has been that ‘stoneworts,’ or Charales, are the sister lineage, but an alternative hypothesis supports the Zygnematales (often referred to as “pond scum”) as the sister lineage. In this paper, we provide a well-supported, 160-nuclear-gene phylogenomic analysis supporting the Zygnematales as the closest living relative to land plants. Our study makes two key contributions to the field: 1) the use of an unbiased method to collect a large set of orthologs from deeply diverging species and 2) the use of these data in determining the sister lineage to land plants. We anticipate this updated phylogeny not only will hugely impact lesson plans in introductory biology courses, but also will provide a solid phylogenetic tree for future green-lineage research, whether it be related to plants or green algae
    • …
    corecore