34 research outputs found
A New Method for Handling Missing Species in Diversification Analysis Applicable to Randomly or Nonrandomly Sampled Phylogenies
Chronograms from molecular dating are increasingly being used to infer rates of diversification and their change over time. A major limitation in such analyses is incomplete species sampling that moreover is usually nonrandom. While the widely used γ statistic with the Monte Carlo constant-rates test or the birth-death likelihood analysis with the δ AICrc test statistic are appropriate for comparing the fit of different diversification models in phylogenies with random species sampling, no objective automated method has been developed for fitting diversification models to nonrandomly sampled phylogenies. Here, we introduce a novel approach, CorSiM, which involves simulating missing splits under a constant rate birth-death model and allows the user to specify whether species sampling in the phylogeny being analyzed is random or nonrandom. The completed trees can be used in subsequent model-fitting analyses. This is fundamentally different from previous diversification rate estimation methods, which were based on null distributions derived from the incomplete trees. CorSiM is automated in an R package and can easily be applied to large data sets. We illustrate the approach in two Araceae clades, one with a random species sampling of 52% and one with a nonrandom sampling of 55%. In the latter clade, the CorSiM approach detects and quantifies an increase in diversification rate, whereas classic approaches prefer a constant rate model; in the former clade, results do not differ among methods (as indeed expected since the classic approaches are valid only for randomly sampled phylogenies). The CorSiM method greatly reduces the type I error in diversification analysis, but type II error remains a methodological proble
Reevaluation of the cox1 group I intron in Araceae and angiosperms indicates a history dominated by loss rather than horizontal transfer
The origin and modes of transmission of introns remain matters of much debate. Previous studies of the group I intron in the angiosperm cox1 gene inferred frequent angiosperm-to-angiosperm horizontal transmission of the intron from apparent incongruence between intron phylogenies and angiosperm phylogenies, patchy distribution of the intron among angiosperms, and differences between cox1 exonic coconversion tracts (the first 22 nt downstream of where the intron inserted). We analyzed the cox1 gene in 179 angiosperms, 110 of them containing the intron (intron+) and 69 lacking it (intron-). Our taxon sampling in Araceae is especially dense to test hypotheses about vertical and horizontal intron transmission put forward by Cho and Palmer (1999. Multiple acquisitions via horizontal transfer of a group I intron in the mitochondrial coxl gene during evolution of the Araceae family. Mol Biol Evol. 16:1155–1165). Maximum likelihood trees of Araceae cox1 introns, and also of all angiosperm cox1 introns, are largely congruent with known phylogenetic relationships in these taxa. The exceptions can be explained by low signal in the intron and long-branch attraction among a few taxa with high mitochondrial substitution rates. Analysis of the 179 coconversion tracts reveals 20 types of tracts (11 of them only found in single species, all involving silent substitutions). The distribution of these tracts on the angiosperm phylogeny shows a common ancestral type, characterizing most intron+ and some intron- angiosperms, and several derivative tract types arising from gradual back mutation of the coconverted nucleotides. Molecular clock dating of small intron+ and intron- sister clades suggests that coconversion tracts have persisted for 70 Myr in Araceae, whose cox1 sequences evolve comparatively slowly. Sequence similarity among the 110 introns ranges from 91% to identical, whereas putative homologs from fungi are highly different, but sampling in fungi is still sparse. Together, these results suggest that the cox1 intron entered angiosperms once, has largely or entirely been transmitted vertically, and has been lost numerous times, with coconversion tract footprints providing unreliable signal of former intron presence
Assembled Plastid and Mitochondrial Genomes, as well as Nuclear Genes, Place the Parasite Family Cynomoriaceae in the Saxifragales
Cynomoriaceae, one of the last unplaced families of flowering plants, comprise one or two species or subspecies of root parasites that occur from the Mediterranean to the Gobi Desert. Using Illumina sequencing, we assembled the mitochondrial and plastid genomes as well as some nuclear genes of a Cynomorium specimen from Italy. Selected genes were also obtained by Sanger sequencing from individuals collected in China and Iran, resulting in matrices of 33 mitochondrial, 6 nuclear, and 14 plastid genes and rDNAs enlarged to include a representative angiosperm taxon sampling based on data available in GenBank. We also compiled a new geographic map to discern possible discontinuities in the parasites' occurrence. Cynomorium has large genomes of 13.70-13.61 (Italy) to 13.95-13.76 pg (China). Its mitochondrial genome consists of up to 49 circular subgenomes and has an overall gene content similar to that of photosynthetic angiosperms, while its plastome retains only 27 of the normally 116 genes. Nuclear, plastid and mitochondrial phylogenies place Cynomoriaceae in Saxifragales, and we found evidence for several horizontal gene transfers from different hosts, as well as intracellular gene transfers
Assembled Plastid and Mitochondrial Genomes, as well as Nuclear Genes, Place the Parasite Family Cynomoriaceae in the Saxifragales
Cynomoriaceae, one of the last unplaced families of flowering plants, comprise one or two species or subspecies of root parasites that occur from the Mediterranean to the Gobi Desert. Using Illumina sequencing, we assembled the mitochondrial and plastid genomes as well as some nuclear genes of a Cynomorium specimen from Italy. Selected genes were also obtained by Sanger sequencing from individuals collected in China and Iran, resulting in matrices of 33 mitochondrial, 6 nuclear, and 14 plastid genes and rDNAs enlarged to include a representative angiosperm taxon sampling based on data available in GenBank. We also compiled a new geographic map to discern possible discontinuities in the parasites' occurrence. Cynomorium has large genomes of 13.70-13.61 (Italy) to 13.95-13.76 pg (China). Its mitochondrial genome consists of up to 49 circular subgenomes and has an overall gene content similar to that of photosynthetic angiosperms, while its plastome retains only 27 of the normally 116 genes. Nuclear, plastid and mitochondrial phylogenies place Cynomoriaceae in Saxifragales, and we found evidence for several horizontal gene transfers from different hosts, as well as intracellular gene transfers
Relationships within the Araceae: comparison of morphological patterns with molecular phylogenies.
Premise of the study: The first family-wide molecular phylogeny of the Araceae, a family of about 3800 published species in 120 genera, became available in 1995, followed by a cladistic analysis of morpho-anatomical data in 1997. The most recent and comprehensive family-wide molecular phylogeny was published in 2008 and included species from 102 genera. We reanalyzed the molecular data with a more complete genus sampling and compared the resulting phylogeny with morphological and anatomical data, with a view to contributing to a new formal classification of the Araceae.
• Methods: We analyzed 113 aroid genera and 4494 aligned nucleotides that resulted from adding 11 genera to the 2008 molecular matrix. We also analyzed 81 morphological characters in the context of the molecular phylogeny, using an extended version of the 1997 morpho-anatomical data set.
• Key results: The resulting maximum-likelihood phylogeny is well resolved and supported, and most of the 44 larger clades also have morphological or anatomical synapomorphies as well as ecological or geographic cohesion. Of the 44 clades, 16 are here newly circumscribed and informally named. However, some relationships remain poorly supported within the Aroideae subfamily. The most problematic placement is Calla within Aroideae, which conflicts with the distribution of morphological, anatomical, and palynological character states.
• Conclusions: The comparison of the molecular analysis with morphological and anatomical data presented here represents an important basis for a new formal classification for the Araceae and for the understanding of the evolution of this ancient family, a monocot group known in the fossil record from the early Cretaceous
Slowdowns in diversification rates from real phylogenies may not be real
Studies of diversification patterns often find a slowing in lineage accumulation toward the present. This seemingly pervasive pattern of rate downturns has been taken as evidence for adaptive radiations, density-dependent regulation, and metacommunity species interactions. The significance of rate downturns is evaluated with statistical tests (the γ statistic and Monte Carlo constant rates (MCCR) test; birth–death likelihood models and Akaike Information Criterion [AIC] scores) that rely on null distributions, which assume that the included species are a random sample of the entire clade. Sampling in real phylogenies, however, often is nonrandom because systematists try to include early-diverging species or representatives of previous intrataxon classifications. We studied the effects of biased sampling, structured sampling, and random sampling by experimentally pruning simulated trees (60 and 150 species) as well as a completely sampled empirical tree (58 species) and then applying the γ statistic/MCCR test and birth–death likelihood models/AIC scores to assess rate changes. For trees with random species sampling, the true model (i.e., the one fitting the complete phylogenies) could be inferred in most cases. Oversampling deep nodes, however, strongly biases inferences toward downturns, with simulations of structured and biased sampling suggesting that this occurs when sampling percentages drop below 80%. The magnitude of the effect and the sensitivity of diversification rate models is such that a useful rule of thumb may be not to infer rate downturns from real trees unless they have >80% species sampling
Appendix_S3B_plastid_genes_94_1971
The alignment underlying Fig. S3
Appendix_S4D_plastid_genes_11_5567
The alignment underlying Fig. S4
Appendix_S2F_cox1_exon_ed_sites_excl_180_1307
The alignment underlying Fig. S2