38 research outputs found
Tackling Rapid Radiations With Targeted Sequencing
In phylogenetic studies across angiosperms, at various taxonomic levels, polytomies have persisted despite efforts to resolve them by increasing sampling of taxa and loci. The large amount of genomic data now available and statistical tools to analyze them provide unprecedented power for phylogenetic inference. Targeted sequencing has emerged as a strong tool for estimating species trees in the face of rapid radiations, lineage sorting, and introgression. Evolutionary relationships in Cyperaceae have been studied mostly using Sanger sequencing until recently. Despite ample taxon sampling, relationships in many genera remain poorly understood, hampered by diversification rates that outpace mutation rates in the loci used. The C4 Cyperus clade of the genus Cyperus has been particularly difficult to resolve. Previous studies based on a limited set of markers resolved relationships among Cyperus species using the C3 photosynthetic pathway, but not among C4 Cyperus clade taxa. We test the ability of two targeted sequencing kits to resolve relationships in the C4 Cyperus clade, the universal Angiosperms-353 kit and a Cyperaceae-specific kit. Sequences of the targeted loci were recovered from data generated with both kits and used to investigate overlap in data between kits and relative efficiency of the general and custom approaches. The power to resolve shallow-level relationships was tested using a summary species tree method and a concatenated maximum likelihood approach. High resolution and support are obtained using both approaches, but high levels of missing data disproportionately impact the latter. Targeted sequencing provides new insights into the evolution of morphology in the C4 Cyperus clade, demonstrating for example that the former segregate genus Alinula is polyphyletic despite its seeming morphological integrity. An unexpected result is that the Cyperus margaritaceus-Cyperus niveus complex comprises a clade separate from and sister to the core C4 Cyperus clade. Our results demonstrate that data generated with a family-specific kit do not necessarily have more power than those obtained with a universal kit, but that data generated with different targeted sequencing kits can often be merged for downstream analyses. Moreover, our study contributes to the growing consensus that targeted sequencing data are a powerful tool in resolving rapid radiationsEspaña Ministry of Economy and Competitiveness (project CGL2016- 77401-P
Phylogenomics of the pantropical Connaraceae: revised infrafamilial classification and the evolution of heterostyly
Connaraceae is a pantropical family of about 200 species containing lianas and small trees with remarkably diverse floral polymorphisms, including distyly, tristyly, homostyly, and dioecy. To date, relationships within the family have not been investigated using a targeted molecular phylogenetic treatment, severely limiting systematic understanding and reconstruction of trait evolution. Accordingly, their last infrafamilial classification was based only on morphological data. Here, we used phylogenomic data obtained using the Angiosperms353 nuclear target sequence capture probes, sampling all tribes and almost all genera, entirely from herbarium specimens, to revise infrafamilial classification and investigate the evolution of heterostyly. The backbone of the resulting molecular phylogenetic tree is almost entirely resolved. Connaraceae consists of two clades, one containing only the African genus Manotes (4 or 5 species), which we newly recognize at the subfamily level. Vegetative and reproductive synapomorphies are proposed for Manotoideae. Within Connaroideae, Connareae is expanded to include the former Jollydoreae. The backbone of Cnestideae, which contains more than half of the Connaraceae species, remains incompletely resolved. Reconstructions of reproductive system evolution are presented that tentatively support tristyly as the ancestral state for the family, with multiple parallel losses, in agreement with previous hypotheses, plus possible re-gains. However, the great diversity of stylar polymorphisms and their phylogenetic lability preclude a definitive answer. Overall, this study reinforces the usefulness of herbarium phylogenomics, and unlocks the reproductive diversity of Connaraceae as a model system for the evolution of complex biological phenomena
Phylogenomics and the rise of the angiosperms
Angiosperms are the cornerstone of most terrestrial ecosystems and human livelihoods1,2. A robust understanding of angiosperm evolution is required to explain their rise to ecological dominance. So far, the angiosperm tree of life has been determined primarily by means of analyses of the plastid genome3,4. Many studies have drawn on this foundational work, such as classification and first insights into angiosperm diversification since their Mesozoic origins5,6,7. However, the limited and biased sampling of both taxa and genomes undermines confidence in the tree and its implications. Here, we build the tree of life for almost 8,000 (about 60%) angiosperm genera using a standardized set of 353 nuclear genes8. This 15-fold increase in genus-level sampling relative to comparable nuclear studies9 provides a critical test of earlier results and brings notable change to key groups, especially in rosids, while substantiating many previously predicted relationships. Scaling this tree to time using 200 fossils, we discovered that early angiosperm evolution was characterized by high gene tree conflict and explosive diversification, giving rise to more than 80% of extant angiosperm orders. Steady diversification ensued through the remaining Mesozoic Era until rates resurged in the Cenozoic Era, concurrent with decreasing global temperatures and tightly linked with gene tree conflict. Taken together, our extensive sampling combined with advanced phylogenomic methods shows the deep history and full complexity in the evolution of a megadiverse clade
Tackling rapid radiations with targeted sequencing
In phylogenetic studies across angiosperms, at various taxonomic levels, polytomies have persisted despite efforts to resolve them by increasing sampling of taxa and loci. The large amount of genomic data now available and statistical tools to analyze them provide unprecedented power for phylogenetic inference. Targeted sequencing has emerged as a strong tool for estimating species trees in the face of rapid radiations, lineage sorting, and introgression. Evolutionary relationships in Cyperaceae have been studied mostly using Sanger sequencing until recently. Despite ample taxon sampling, relationships in many genera remain poorly understood, hampered by diversification rates that outpace mutation rates in the loci used. The C4 Cyperus clade of the genus Cyperus has been particularly difficult to resolve. Previous studies based on a limited set of markers resolved relationships among Cyperus species using the C3 photosynthetic pathway, but not among C4 Cyperus clade taxa. We test the ability of two targeted sequencing kits to resolve relationships in the C4 Cyperus clade, the universal Angiosperms-353 kit and a Cyperaceae-specific kit. Sequences of the targeted loci were recovered from data generated with both kits and used to investigate overlap in data between kits and relative efficiency of the general and custom approaches. The power to resolve shallow-level relationships was tested using a summary species tree method and a concatenated maximum likelihood approach. High resolution and support are obtained using both approaches, but high levels of missing data disproportionately impact the latter. Targeted sequencing provides new insights into the evolution of morphology in the C4 Cyperus clade, demonstrating for example that the former segregate genus Alinula is polyphyletic despite its seeming morphological integrity. An unexpected result is that the Cyperus margaritaceus-Cyperus niveus complex comprises a clade separate from and sister to the core C4 Cyperus clade. Our results demonstrate that data generated with a family-specific kit do not necessarily have more power than those obtained with a universal kit, but that data generated with different targeted sequencing kits can often be merged for downstream analyses. Moreover, our study contributes to the growing consensus that targeted sequencing data are a powerful tool in resolving rapid radiations
Targeted sequencing supports morphology and embryo features in resolving the classification of Cyperaceae tribe Fuireneae s.l.
Molecular phylogenetic studies based on Sanger sequences have shown that Cyperaceae tribe Fuireneae s.l. is paraphyletic. However, taxonomic sampling in these studies has been poor, topologies have been inconsistent, and support for the backbone of trees has been weak. Moreover, uncertainty still surrounds the morphological limits of Schoenoplectiella, a genus of mainly small, amphicarpic annuals that was recently segregated from Schoenoplectus. Consequently, despite ample evidence from molecular analyses that Fuireneae s.l. might consist of two to four tribal lineages, no taxonomic changes have yet been made. Here, we use the Angiosperms353 enrichment panel for targeted sequencing in order to: (1) clarify the relationships of Fuireneae s.l. with the related tribes Abildgaardieae, Eleocharideae and Cypereae; (2) define the limits of Fuireneae s.s., and (3) test the monophyly of Fuireneae s.l. genera with emphasis on Schoenoplectus and Schoenoplectiella. Using more than a third of Fuireneae s.l. diversity, our phylogenomic analyses strongly support six genera and four major Fuireneae s.l. clades that we recognise as tribes: Bolboschoeneae stat.nov., Fuireneae s.s., Schoenoplecteae, and Pseudoschoeneae tr.nov. These results are consistent with morphological, micromorphological (nutlet epidermal cell shape), and embryo differences detected for each tribe. At the generic level, most subâSaharan African perennials currently treated in Schoenoplectus are transferred to Schoenoplectiella. Our targeted sequencing results show that these species are nested in Schoenoplectiella, and their treatment here is consistent with micromorphological and embryo characters shared by all Schoenoplectiella species. Keys to recognised tribes and genera are provided
A universal probe set for targeted sequencing of 353 nuclear genes from any flowering plant designed using k-medoids clustering
Sequencing of target-enriched libraries is an efficient and cost-effective method for obtaining DNA sequence data from hundreds of nuclear loci for phylogeny reconstruction. Much of the cost of developing targeted sequencing approaches is associated with the generation of preliminary data needed for the identification of orthologous loci for probe design. In plants, identifying orthologous loci has proven difficult due to a large number of whole-genome duplication events, especially in the angiosperms (flowering plants).We used multiple sequence alignments from over 600 angiosperms for 353 putatively single-copy protein-coding genes identified by the One Thousand Plant Transcriptomes Initiative to design a set of targeted sequencing probes for phylogenetic studies of any angiosperm group. To maximize the phylogenetic potential of the probes, while minimizing the cost of production, we introduce a k-medoids clustering approach to identify the minimum number of sequences necessary to represent each coding sequence in the final probe set. Using this method, 5â15 representative sequences were selected per orthologous locus, representing the sequence diversity of angiosperms more efficiently than if probes were designed using available sequenced genomes alone. To test our approximately 80,000 probes, we hybridized libraries from 42 species spanning all higher-order groups of angiosperms, with a focus on taxa not present in the sequence alignments used to design the probes. Out of a possible 353 coding sequences, we recovered an average of 283 per species and at least 100 in all species. Differences among taxa in sequence recovery could not be explained by relatedness to the representative taxa selected for probe design, suggesting that there is no phylogenetic bias in the probe set. Our probe set, which targeted 260 kbp of coding sequence, achieved a median recovery of 137 kbp per taxon in coding regions, a maximum recovery of 250 kbp, and an additional median of 212 kbp per taxon in flanking non-coding regions across all species. These results suggest that the Angiosperms353 probe set described here is effective for any group of flowering plants and would be useful for phylogenetic studies from the species level to higher-order groups, including the entire angiosperm clade itself
Phylogenomics and the rise of the angiosperms
Angiosperms are the cornerstone of most terrestrial ecosystems and human livelihoods1, 2. A robust understanding of angiosperm evolution is required to explain their rise to ecological dominance. So far, the angiosperm tree of life has been determined primarily by means of analyses of the plastid genome3, 4. Many studies have drawn on this foundational work, such as classification and first insights into angiosperm diversification since their Mesozoic origins5â7. However, the limited and biased sampling of both taxa and genomes undermines confidence in the tree and its implications. Here, we build the tree of life for almost 8,000 (about 60%) angiosperm genera using a standardized set of 353 nuclear genes8. This 15-fold increase in genus-level sampling relative to comparable nuclear studies9 provides a critical test of earlier results and brings notable change to key groups, especially in rosids, while substantiating many previously predicted relationships. Scaling this tree to time using 200 fossils, we discovered that early angiosperm evolution was characterized by high gene tree conflict and explosive diversification, giving rise to more than 80% of extant angiosperm orders. Steady diversification ensued through the remaining Mesozoic Era until rates resurged in the Cenozoic Era, concurrent with decreasing global temperatures and tightly linked with gene tree conflict. Taken together, our extensive sampling combined with advanced phylogenomic methods shows the deep history and full complexity in the evolution of a megadiverse clade
Phylogenomics and the rise of the angiosperms
Angiosperms are the cornerstone of most terrestrial ecosystems and human livelihoods. A robust understanding of angiosperm evolution is required to explain their rise to ecological dominance. So far, the angiosperm tree of life has been determined primarily by means of analyses of the plastid genome. Many studies have drawn on this foundational work, such as classification and first insights into angiosperm diversification since their Mesozoic origins. However, the limited and biased sampling of both taxa and genomes undermines confidence in the tree and its implications. Here, we build the tree of life for almost 8,000 (about 60%) angiosperm genera using a standardized set of 353 nuclear genes. This 15-fold increase in genus-level sampling relative to comparable nuclear studies provides a critical test of earlier results and brings notable change to key groups, especially in rosids, while substantiating many previously predicted relationships. Scaling this tree to time using 200 fossils, we discovered that early angiosperm evolution was characterized by high gene tree conflict and explosive diversification, giving rise to more than 80% of extant angiosperm orders. Steady diversification ensued through the remaining Mesozoic Era until rates resurged in the Cenozoic Era, concurrent with decreasing global temperatures and tightly linked with gene tree conflict. Taken together, our extensive sampling combined with advanced phylogenomic methods shows the deep history and full complexity in the evolution of a megadiverse clade
A new classification of Cyperaceae (Poales) supported by phylogenomic data
Cyperaceae (sedges) are the third largest monocot family and are of considerable economic and ecological importance. Sedges represent an ideal model family to study evolutionary biology because of their species richness, global distribution, large discrepancies in lineage diversity, broad range of ecological preferences, and adaptations including multiple origins of C4 photosynthesis and holocentric chromosomes. Goetghebeurâs seminal work on Cyperaceae published in 1998 provided the most recent complete classification at tribal and generic level, based on a morphological study of Cyperaceae inflorescence, spikelet, flower and embryo characters plus anatomical and other information. Since then, several familyâlevel molecular phylogenetic studies using Sanger sequence data have been published. Here, more than 20 years after the last comprehensive classification of the family, we present the first familyâwide phylogenomic study of Cyperaceae based on targeted sequencing using the Angiosperms353 probe kit sampling 311 accessions. Additionally, 62 accessions available from GenBank were mined for overlapping reads and included in the phylogenomic analyses. Informed by this backbone phylogeny, a new classification for the family at the tribal, subtribal and generic levels is proposed. The majority of previously recognized suprageneric groups are supported, and for the first time we establish support for tribe Cryptangieae as a clade including the genus Koyamaea. We provide a taxonomic treatment including identification keys and diagnoses for the 2 subfamilies, 24 tribes and 10 subtribes and basic information on the 95 genera. The classification includes five new subtribes in tribe Schoeneae: Anthelepidinae, Caustiinae, Gymnoschoeninae, Lepidospermatinae and Oreobolinae. This article is protected by copyright. All rights reserved
A nuclear phylogenomic study of the angiosperm order Myrtales, exploring the potential and limitations of the universal Angiosperms353 probe set
PREMISE: To further advance the understanding of the species- rich, economically and ecologically important angiosperm order Myrtales in the rosid clade, comprising nine families, approximately 400 genera and almost 14,000 species occurring on all continents (except Antarctica), we tested the Angiosperms353 probe kit.METHODS: We combined high- throughput sequencing and target enrichment with the Angiosperms353 probe kit to evaluate a sample of 485 species across 305 genera (76% of all genera in the order).RESULTS: Results provide the most comprehensive phylogenetic hypothesis for the order to date. Relationships at all ranks, such as the relationship of the early-diverging families, often reflect previous studies, but gene conflict is evident, and relationships previously found to be uncertain often remain so. Technical considerations for processing HTS data are also discussed.CONCLUSIONS: High- throughput sequencing and the Angiosperms353 probe kit are powerful tools for phylogenomic analysis, but better understanding of the genetic data available is required to identify genes and gene trees that account for likely incomplete lineage sorting and/or hybridization events