359 research outputs found

    The importance of data partitioning and the utility of bayes factors in bayesian phylogenetics

    Get PDF
    As larger, more complex data sets are being used to infer phylogenies, accuracy of these phylogenies increasingly requires models of evolution that accommodate heterogeneity in the processes of molecular evolution. We investigated the effect of improper data partitioning on phylogenetic accuracy, as well as the type I error rate and sensitivity of Bayes factors, a commonly used method for choosing among different partitioning strategies in Bayesian analyses. We also used Bayes factors to test empirical data for the need to divide data in a manner that has no expected biological meaning. Posterior probability estimates are misleading when an incorrect partitioning strategy is assumed. The error was greatest when the assumed model was underpartitioned. These results suggest that model partitioning is important for large data sets. Bayes factors performed well, giving a 5% type I error rate, which is remarkably consistent with standard frequentist hypothesis tests. The sensitivity of Bayes factors was found to be quite high when the across-class model heterogeneity reflected that of empirical data. These results suggest that Bayes factors represent a robust method of choosing among partitioning strategies. Lastly, results of tests for the inclusion of unexpected divisions in empirical data mirrored the simulation results, although the outcome of such tests is highly dependent on accounting for rate variation among classes. We conclude by discussing other approaches for partitioning data, as well as other applications of Bayes factors. Copyright © Society of Systematic Biologists

    When trees grow too long: Investigating the causes of highly inaccurate bayesian branch-length estimates

    Get PDF
    A surprising number of recent Bayesian phylogenetic analyses contain branch-length estimates that are several orders of magnitude longer than corresponding maximum-likelihood estimates. The levels of divergence implied by such branch lengths are unreasonable for studies using biological data and are known to be false for studies using simulated data. We conducted additional Bayesian analyses and studied approximate-posterior surfaces to investigate the causes underlying these large errors. We manipulated the starting parameter values of the Markov chain Monte Carlo (MCMC) analyses, the moves used by the MCMC analyses, and the prior-probability distribution on branch lengths. We demonstrate that inaccurate branch-length estimates result from either 1) poor mixing of MCMC chains or 2) posterior distributions with excessive weight at long tree lengths. Both effects are caused by a rapid increase in the volume of branch-length space as branches become longer. In the former case, both an MCMC move that scales all branch lengths in the tree simultaneously and the use of overdispersed starting branch lengths allow the chain to accurately sample the posterior distribution and should be used in Bayesian analyses of phylogeny. In the latter case, branch-length priors can have strong effects on resulting inferences and should be carefully chosen to reflect biological expectations. We provide a formula to calculate an exponential rate parameter for the branch-length prior that should eliminate inference of biased branch lengths in many cases. In any phylogenetic analysis, the biological plausibility of branch-length output must be carefully considered

    Polyploidy breaks speciation barriers in Australian burrowing frogs Neobatrachus

    Get PDF
    Polyploidy has played an important role in evolution across the tree of life but it is still unclear how polyploid lineages may persist after their initial formation. While both common and well-studied in plants, polyploidy is rare in animals and generally less understood. The Australian burrowing frog genus Neobatrachus is comprised of six diploid and three polyploid species and offers a powerful animal polyploid model system. We generated exome-capture sequence data from 87 individuals representing all nine species of Neobatrachus to investigate species-level relationships, the origin and inheritance mode of polyploid species, and the population genomic effects of polyploidy on genus-wide demography. We describe rapid speciation of diploid Neobatrachus species and show that the three independently originated polyploid species have tetrasomic or mixed inheritance. We document higher genetic diversity in tetraploids, resulting from widespread gene flow between the tetraploids, asymmetric inter-ploidy gene flow directed from sympatric diploids to tetraploids, and isolation of diploid species from each other. We also constructed models of ecologically suitable areas for each species to investigate the impact of climate on differing ploidy levels. These models suggest substantial change in suitable areas compared to past climate, which correspond to population genomic estimates of demographic histories. We propose that Neobatrachus diploids may be suffering the early genomic impacts of climate-induced habitat loss, while tetraploids appear to be avoiding this fate, possibly due to widespread gene flow. Finally, we demonstrate that Neobatrachus is an attractive model to study the effects of ploidy on the evolution of adaptation in animals

    Comparing species tree estimation with large anchored phylogenomic and small Sanger-sequenced molecular datasets: an empirical study on Malagasy pseudoxyrhophiine snakes

    Get PDF
    Background Using molecular data generated by high throughput next generation sequencing (NGS) platforms to infer phylogeny is becoming common as costs go down and the ability to capture loci from across the genome goes up. While there is a general consensus that greater numbers of independent loci should result in more robust phylogenetic estimates, few studies have compared phylogenies resulting from smaller datasets for commonly used genetic markers with the large datasets captured using NGS. Here, we determine how a 5-locus Sanger dataset compares with a 377-locus anchored genomics dataset for understanding the evolutionary history of the pseudoxyrhophiine snake radiation centered in Madagascar. The Pseudoxyrhophiinae comprise ~86 % of Madagascar’s serpent diversity, yet they are poorly known with respect to ecology, behavior, and systematics. Using the 377-locus NGS dataset and the summary statistics species-tree methods STAR and MP-EST, we estimated a well-supported species tree that provides new insights concerning intergeneric relationships for the pseudoxyrhophiines. We also compared how these and other methods performed with respect to estimating tree topology using datasets with varying numbers of loci. Methods Using Sanger sequencing and an anchored phylogenomics approach, we sequenced datasets comprised of 5 and 377 loci, respectively, for 23 pseudoxyrhophiine taxa. For each dataset, we estimated phylogenies using both gene-tree (concatenation) and species-tree (STAR, MP-EST) approaches. We determined the similarity of resulting tree topologies from the different datasets using Robinson-Foulds distances. In addition, we examined how subsets of these data performed compared to the complete Sanger and anchored datasets for phylogenetic accuracy using the same tree inference methodologies, as well as the program *BEAST to determine if a full coalescent model for species tree estimation could generate robust results with fewer loci compared to the summary statistics species tree approaches. We also examined the individual gene trees in comparison to the 377-locus species tree using the program MetaTree. Results Using the full anchored dataset under a variety of methods gave us the same, well-supported phylogeny for pseudoxyrhophiines. The African pseudoxyrhophiine Duberria is the sister taxon to the Malagasy pseudoxyrhophiines genera, providing evidence for a monophyletic radiation in Madagascar. In addition, within Madagascar, the two major clades inferred correspond largely to the aglyphous and opisthoglyphous genera, suggesting that feeding specializations associated with tooth venom delivery may have played a major role in the early diversification of this radiation. The comparison of tree topologies from the concatenated and species-tree methods using different datasets indicated the 5-locus dataset cannot beused to infer a correct phylogeny for the pseudoxyrhophiines under any method tested here and that summary statistics methods require 50 or more loci to consistently recover the species-tree inferred using the complete anchored dataset. However, as few as 15 loci may infer the correct topology when using the full coalescent species tree method *BEAST. MetaTree analyses of each gene tree from the Sanger and anchored datasets found that none of the individual gene trees matched the 377-locus species tree, and that no gene trees were identical with respect to topology. Conclusions Our results suggest that ≥50 loci may be necessary to confidently infer phylogenies when using summaryspecies-tree methods, but that the coalescent-based method *BEAST consistently recovers the same topology using only 15 loci. These results reinforce that datasets with small numbers of markers may result in misleading topologies, and further, that the method of inference used to generate a phylogeny also has a major influence on the number of loci necessary to infer robust species trees. Electronic supplementary material The online version of this article (doi:10.1186/s12862-015-0503-1) contains supplementary material, which is available to authorized users

    A pilot study applying the plant Anchored Hybrid Enrichment method to New World sages (Salvia subgenus Calosphace; Lamiaceae)

    Get PDF
    We conducted a pilot study using Anchored Hybrid Enrichment to resolve relationships among a mostly Neotropical sage lineage that may have undergone a recent evolutionary radiation. Conventional markers (ITS, trnL-trnF and trnH-psbA) have not been able to resolve the relationships among species nor within portions of the backbone of the lineage. We sampled 12 representative species of subgenus Calosphace and included one species of Salvia´s s.l. closest relative, Lepechinia, as outgroup. Hybrid enrichment and sequencing were successful, yielding 448 alignments of individual loci with an average length of 704. bp. The performance of the phylogenomic data in phylogenetic reconstruction was superior to that of conventional markers, increasing both support and resolution. Because the captured loci vary in the amount of net phylogenetic informativeness at different phylogenetic depths, these data are promising in phylogenetic reconstruction of this group and likely other lineages within Lamiales. However, special attention should be placed on the amount of phylogenetic noise that the data could potentially contain. A prior exploration step using phylogenetic informativeness profiles to detect loci with sites with disproportionately high substitution rates (showing "phantom" spikes) and, if required, the ensuing filtering of the problematic data is recommended. In our dataset, filtering resulted in increased support and resolution for the shallow nodes in maximum likelihood phylogenetic trees resulting from concatenated analyses of all the loci. Additionally, it is expected that an increase in sampling (loci and taxa) will aid in resolving weakly supported, short deep internal branches.Fil: Fragoso Martínez, Itzi. Universidad Nacional Autónoma de México; México. Institute Of Biology Of Unam;Fil: Salazar, Gerardo A.. Universidad Nacional Autónoma de México; MéxicoFil: Martínez Gordillo, Martha. Universidad Nacional Autónoma de México; MéxicoFil: Magallón, Susana. Universidad Nacional Autónoma de México; MéxicoFil: Sánchez-Reyes, Luna. Universidad Nacional Autónoma de México; MéxicoFil: Moriarty Lemmon, Emily. Florida State University; Estados UnidosFil: Lemmon, Alan R.. Florida State University; Estados UnidosFil: Sazatornil, Federico David. Consejo Nacional de Investigaciones Científicas y Técnicas. Centro Científico Tecnológico Conicet - Córdoba. Instituto Multidisciplinario de Biología Vegetal. Universidad Nacional de Córdoba. Facultad de Ciencias Exactas Físicas y Naturales. Instituto Multidisciplinario de Biología Vegetal; ArgentinaFil: Granados Mendoza, Carolina. Instituto Potosino de Investigación Científica y Tecnológica; México. Universidad Nacional Autónoma de México; Méxic

    Are 100 enough? Inferring acanthomorph teleost phylogeny using Anchored Hybrid Enrichment

    Get PDF
    BACKGROUND: The past decade has witnessed remarkable progress towards resolution of the Tree of Life. However, despite the increased use of genomic scale datasets, some phylogenetic relationships remain difficult to resolve. Here we employ anchored phylogenomics to capture 107 nuclear loci in 29 species of acanthomorph teleost fishes, with 25 of these species sampled from the recently delimited clade Ovalentaria. Previous studies employing multilocus nuclear exon datasets have not been able to resolve the nodes at the base of the Ovalentaria tree with confidence. Here we test whether a phylogenomic approach will provide better support for these nodes, and if not, why this may be. RESULTS: After using a novel method to account for paralogous loci, we estimated phylogenies with maximum likelihood and species tree methods using DNA sequence alignments of over 80,000 base pairs. Several key relationships within Ovalentaria are well resolved, including 1) the sister taxon relationship between Cichlidae and Pholidichthys, 2) a clade containing blennies, grammas, clingfishes, and jawfishes, and 3) monophyly of Atherinomorpha (topminnows, flyingfishes, and silversides). However, many nodes in the phylogeny associated with the early diversification of Ovalentaria are poorly resolved in several analyses. Through the use of rarefaction curves we show that limited phylogenetic resolution among the earliest nodes in the Ovalentaria phylogeny does not appear to be due to a deficiency of data, as average global node support ceases to increase when only 1/3rd of the sampled loci are used in analyses. Instead this lack of resolution may be driven by model misspecification as a Bayesian mixed model analysis of the amino acid dataset provided good support for parts of the base of the Ovalentaria tree. CONCLUSIONS: Although it does not appear that the limited phylogenetic resolution among the earliest nodes in the Ovalentaria phylogeny is due to a deficiency of data, it may be that both stochastic and systematic error resulting from model misspecification play a role in the poor resolution at the base of the Ovalentaria tree as a Bayesian approach was able to resolve some of the deeper nodes, where the other methods failed. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12862-015-0415-0) contains supplementary material, which is available to authorized users

    Anchored enrichment dataset for true flies (order Diptera) reveals insights into the phylogeny of flower flies (family Syrphidae)

    Get PDF
    Background: Anchored hybrid enrichment is a form of next-generation sequencing that uses oligonucleotide probes to target conserved regions of the genome flanked by less conserved regions in order to acquire data useful for phylogenetic inference from a broad range of taxa. Once a probe kit is developed, anchored hybrid enrichment is superior to traditional PCR-based Sanger sequencing in terms of both the amount of genomic data that can be recovered and effective cost. Due to their incredibly diverse nature, importance as pollinators, and historical instability with regard to subfamilial and tribal classification, Syrphidae (flower flies or hoverflies) are an ideal candidate for anchored hybrid enrichment-based phylogenetics, especially since recent molecular phylogenies of the syrphids using only a few markers have resulted in highly unresolved topologies. Over 6200 syrphids are currently known and uncovering their phylogeny will help us to understand how these species have diversified, providing insight into an array of ecological processes, from the development of adult mimicry, the origin of adult migration, to pollination patterns and the evolution of larval resource utilization. Results: We present the first use of anchored hybrid enrichment in insect phylogenetics on a dataset containing 30 flower fly species from across all four subfamilies and 11 tribes out of 15. To produce a phylogenetic hypothesis, 559 loci were sampled to produce a final dataset containing 217,702 sites. We recovered a well resolved topology with bootstrap support values that were almost universally >95 %. The subfamily Eristalinae is recovered as paraphyletic, with the strongest support for this hypothesis to date. The ant predators in the Microdontinae are sister to all other syrphids. Syrphinae and Pipizinae are monophyletic and sister to each other. Larval predation on soft-bodied hemipterans evolved only once in this family. Conclusions: Anchored hybrid enrichment was successful in producing a robustly supported phylogenetic hypothesis for the syrphids. Subfamilial reconstruction is concordant with recent phylogenetic hypotheses, but with much higher support values. With the newly designed probe kit this analysis could be rapidly expanded with further sampling, opening the door to more comprehensive analyses targeting problem areas in syrphid phylogenetics and ecology.Peer reviewe

    Off-target capture data, endosymbiont genes and morphology reveal a relict lineage that is sister to all other singing cicadas

    Get PDF
    Phylogenetic asymmetry is common throughout the tree of life and results from contrasting patterns of speciation and extinction in the paired descendant lineages of ancestral nodes. On the depauperate side of a node, we find extant ´relict´ taxa that sit atop long, unbranched lineages. Here, we show that a tiny, pale green, inconspicuous and poorly known cicada in the genus Derotettix, endemic to degraded salt-plain habitats in arid regions of central Argentina, is a relict lineage that is sister to all other modern cicadas. Nuclear and mitochondrial phylogenies of cicadas inferred from probe-based genomic hybrid capture data of both target and non-target loci and a morphological cladogram support this hypothesis. We strengthen this conclusion with genomic data from one of the cicada nutritional bacterial endosymbionts, Sulcia, an ancient and obligate endosymbiont of the larger plant-sucking bugs (Auchenorrhyncha) and an important source of maternally inherited phylogenetic data. We establish Derotettiginae subfam. nov. as a new, monogeneric, fifth cicada subfamily, and compile existing and new data on the distribution, ecology and diet of Derotettix. Our consideration of the palaeoenvironmental literature and host-plant phylogenetics allows us to predict what might have led to the relict status of Derotettix over 100 Myr of habitat change in South America.Fil: Simon, Chris. University of Connecticut; Estados UnidosFil: Gordon, Eric R. L.. University of Connecticut; Estados UnidosFil: Moulds, M.S.. Australian Museum Research Institute; AustraliaFil: Cole, Jeffrey A.. Pasadena City College; Estados UnidosFil: Haji, Diler. University of Connecticut; Estados UnidosFil: Lemmon, Alan R.. Florida State University; Estados UnidosFil: Lemmon, Emily Moriarty. Florida State University; Estados UnidosFil: Kortyna, Michelle. Florida State University; Estados UnidosFil: Nazario, Katherine. University of Connecticut; Estados UnidosFil: Wade, Elizabeth J.. Curry College. Department of Natural Sciences and Mathematics; Estados Unidos. University of Connecticut; Estados UnidosFil: Meister, Russell C.. University of Connecticut; Estados UnidosFil: Goemans, Geert. University of Connecticut; Estados UnidosFil: Chiswell, Stephen M.. National Institute of Water and Atmospheric Research; Nueva ZelandaFil: Pessacq, Pablo. Consejo Nacional de Investigaciones Científicas y Técnicas. Centro Científico Tecnológico Conicet - Patagonia Norte. Centro de Investigación Esquel de Montaña y Estepa Patagónica. Universidad Nacional de la Patagonia "San Juan Bosco". Centro de Investigación Esquel de Montaña y Estepa Patagónica; ArgentinaFil: Veloso, Claudio. Universidad de Chile; ChileFil: McCutcheon, John P.. University of Montana; Estados UnidosFil: Lukasik, Piotr. University of Montana; Estados Unidos. Swedish Museum of Natural History. Department of Bioinformatics and Genetics; Sueci

    Phylogenomics Reveals Ancient Gene Tree Discordance in the Amphibian Tree of Life

    Get PDF
    The Author(s) 2020. Published by Oxford University Press, on behalf of the Society of Systematic Biologists. Molecular phylogenies have yielded strong support for many parts of the amphibian Tree of Life, but poor support for the resolution of deeper nodes, including relationships among families and orders. To clarify these relationships, we provide a phylogenomic perspective on amphibian relationships by developing a taxon-specific Anchored Hybrid Enrichment protocol targeting hundreds of conserved exons which are effective across the class. After obtaining data from 220 loci for 286 species (representing 94% of the families and 44% of the genera), we estimate a phylogeny for extant amphibians and identify gene tree-species tree conflict across the deepest branches of the amphibian phylogeny. We perform locus-by-locus genealogical interrogation of alternative topological hypotheses for amphibian monophyly, focusing on interordinal relationships. We find that phylogenetic signal deep in the amphibian phylogeny varies greatly across loci in a manner that is consistent with incomplete lineage sorting in the ancestral lineage of extant amphibians. Our results overwhelmingly support amphibian monophyly and a sister relationship between frogs and salamanders, consistent with the Batrachia hypothesis. Species tree analyses converge on a small set of topological hypotheses for the relationships among extant amphibian families. These results clarify several contentious portions of the amphibian Tree of Life, which in conjunction with a set of vetted fossil calibrations, support a surprisingly younger timescale for crown and ordinal amphibian diversification than previously reported. More broadly, our study provides insight into the sources, magnitudes, and heterogeneity of support across loci in phylogenomic data sets.[AIC; Amphibia; Batrachia; Phylogeny; gene tree-species tree discordance; genomics; information theory.].This work was supported by grants from a graduate student research award from the Society of Systematic Biologists and the University of Kentucky G.F. Ribble Endowment (to P.M.H.), by Coordenação de Aperfeiçoamento de Pessoal de Nível Superior (CAPES/BEX 2806/09-6 to P.L.V.P.), and by the National Science Foundation (DEB-0949532 and DEB-1355000 to D.W.W., DEB-1120516 to E.M.L., IIP-1313554 to A.R.L. and E.M.L, DEB-1355071 to J.M.B., DEB-1441719 to R.A.P., DEB-1311442 to P.L.V.P., DEB-1354506 to R.C.T., DEB-1021247 to E.P. and C.J.R., DEB-1021299 to K.M. Kjer, and DEB-1257610, DEB-0641023, DEB-0423286, and DEB-9984496 to C.J.R.), and the Australian Research Council (DP120104146 to J.S.K. and S.C.D.). S.R.R. thanks SENESCYT (Arca de Noé Initiative; SRR and O. Torres-Carvajal principal investigators) for funding for tissue collection. J.L. was supported by the Systematics Association and the Linnean Society Systematics Research Fund. This material is based upon work supported by the National Science Foundation Graduate Research Fellowship Program (DGE-3048109801 to P.M.H.) and by the National Science Foundation-supported National Center for Supercomputing Applications Blue Waters Graduate Research Fellowship Program (under Grant No. 0725070, subaward 15836, to P.M.H.). Any opinions, findings, and conclusions or recommendations expressed in this material are those of the author(s) and do not necessarily reflect the views of the National Science Foundation

    Long-Term Effects of April, August, or October Prescribed Fire on Yearling Stocker Cattle Performance and Native Rangeland Plant Composition in the Kansas Flint Hills

    Get PDF
    Objective:The objective of our experiment was to determine if prescribed fire applied in April, August, or October influenced stocker growth performance or plant community characteristics in the Kansas Flint Hills over a 6-year period. Study Description:A total of 1,939 yearling stocker cattle were assigned to one of three prescribed-burn treatments: spring (April 11 ± 5.7 days), summer (August 25 ± 6.2 days or fall (October 2 ± 9.0 days) over a 5-year period. Calves were grazed from May to August for 90 days. Individual body weights were recorded at the start and end of the grazing season. Native plant composition and soil cover were evaluated annually in June using a modified step-point method. The Bottom Line:Shifting prescribed fire from April to August or October reduced yearling stocker cattle weight gains by 10 to 14 lb during a 90-day grazing season. Ranchers are encouraged to consider the cost associated with herbicides versus the costs associated with reduced growth performance when developing a strategy for sericea lespedeza (Lespedeza cuneata) control
    • …
    corecore