157 research outputs found

    Trees of Unusual Size: Biased Inference of Early Bursts from Large Molecular Phylogenies

    Get PDF
    An early burst of speciation followed by a subsequent slowdown in the rate of diversification is commonly inferred from molecular phylogenies. This pattern is consistent with some verbal theory of ecological opportunity and adaptive radiations. One often-overlooked source of bias in these studies is that of sampling at the level of whole clades, as researchers tend to choose large, speciose clades to study. In this paper, we investigate the performance of common methods across the distribution of clade sizes that can be generated by a constant-rate birth-death process. Clades which are larger than expected for a given constant-rate branching process tend to show a pattern of an early burst even when both speciation and extinction rates are constant through time. All methods evaluated were susceptible to detecting this false signature when extinction was low. Under moderate extinction, both the gamma-statistic and diversity-dependent models did not detect such a slowdown but only because the signature of a slowdown was masked by subsequent extinction. Some models which estimate time-varying speciation rates are able to detect early bursts under higher extinction rates, but are extremely prone to sampling bias. We suggest that examining clades in isolation may result in spurious inferences that rates of diversification have changed through time.Comment: 17 pages, 5 figure

    Bacterial diversification through geological time

    Get PDF
    Numerous studies have estimated plant and animal diversification dynamics; however, no comparable rigorous estimates exist for bacteria—the most ancient and widespread form of life on Earth. Here, we analyse phylogenies comprising up to 448,112 bacterial lineages to reconstruct global bacterial diversification dynamics. To handle such large phylogenies, we developed methods based on the statistical properties of infinitely large trees. We further analysed sequencing data from 60 environmental studies to determine the fraction of extant bacterial diversity missing from the phylogenies—a crucial parameter for estimating speciation and extinction rates. We estimate that there are about 1.4–1.9 million extant bacterial lineages when lineages are defined by 99% similarity in the 16S ribosomal RNA gene, and that bacterial diversity has been continuously increasing over the past 1 billion years (Gyr). Recent bacterial extinction rates are estimated at 0.03–0.05 per lineage per million years (lineage^(–1) Myr^(–1)), and are only slightly below estimated recent bacterial speciation rates. Most bacterial lineages ever to have inhabited this planet are estimated to be extinct. Our findings disprove the notion that bacteria are unlikely to go extinct, and provide a valuable perspective on the evolutionary history of a domain of life with a sparse and cryptic fossil record

    Bacterial diversification through geological time

    Get PDF
    Numerous studies have estimated plant and animal diversification dynamics; however, no comparable rigorous estimates exist for bacteria—the most ancient and widespread form of life on Earth. Here, we analyse phylogenies comprising up to 448,112 bacterial lineages to reconstruct global bacterial diversification dynamics. To handle such large phylogenies, we developed methods based on the statistical properties of infinitely large trees. We further analysed sequencing data from 60 environmental studies to determine the fraction of extant bacterial diversity missing from the phylogenies—a crucial parameter for estimating speciation and extinction rates. We estimate that there are about 1.4–1.9 million extant bacterial lineages when lineages are defined by 99% similarity in the 16S ribosomal RNA gene, and that bacterial diversity has been continuously increasing over the past 1 billion years (Gyr). Recent bacterial extinction rates are estimated at 0.03–0.05 per lineage per million years (lineage^(–1) Myr^(–1)), and are only slightly below estimated recent bacterial speciation rates. Most bacterial lineages ever to have inhabited this planet are estimated to be extinct. Our findings disprove the notion that bacteria are unlikely to go extinct, and provide a valuable perspective on the evolutionary history of a domain of life with a sparse and cryptic fossil record

    Reply to: “Global Conservation of Phylogenetic Diversity Captures More Than Just Functional Diversity”

    Get PDF
    Academic biologists have long advocated for conserving phylogenetic diversity (PD), often (but not exclusively) on the basis that PD is a useful proxy for “feature diversity”, defined as the variety of forms and functions represented in set of organisms (see below for an extended discussion of this definition). In a recent paper, we assess the extent to which this proxy (which we coined the “phylogenetic gambit”) holds in three empirical datasets (terrestrial mammals, birds, and tropical marine fishes) when using functional traits and functional diversity (FD) to operationalize feature diversity. Owen et al. offer a criticism of our methods for quantifying feature diversity with FD and disagree with our conclusions. We are grateful that Owen et al. have engaged thoughtfully with our work, but we believe there are more points of agreement than Owen et al. imply

    Prioritizing Phylogenetic Diversity Captures Functional Diversity Unreliably

    Get PDF
    In the face of the biodiversity crisis, it is argued that we should prioritize species in order to capture high functional diversity (FD). Because species traits often reflect shared evolutionary history, many researchers have assumed that maximizing phylogenetic diversity (PD) should indirectly capture FD, a hypothesis that we name the “phylogenetic gambit”. Here, we empirically test this gambit using data on ecologically relevant traits from \u3e15,000 vertebrate species. Specifically, we estimate a measure of surrogacy of PD for FD. We find that maximizing PD results in an average gain of 18% of FD relative to random choice. However, this average gain obscures the fact that in over one-third of the comparisons, maximum PD sets contain less FD than randomly chosen sets of species. These results suggest that, while maximizing PD protection can help to protect FD, it represents a risky conservation strategy

    Phylotastic! Making Tree-of-Life Knowledge Accessible, Reusable and Convenient

    Get PDF
    Scientists rarely reuse expert knowledge of phylogeny, in spite of years of effort to assemble a great "Tree of Life" (ToL). A notable exception involves the use of Phylomatic, which provides tools to generate custom phylogenies from a large, pre-computed, expert phylogeny of plant taxa. This suggests great potential for a more generalized system that, starting with a query consisting of a list of any known species, would rectify non-standard names, identify expert phylogenies containing the implicated taxa, prune away unneeded parts, and supply branch lengths and annotations, resulting in a custom phylogeny suited to the user's needs. Such a system could become a sustainable community resource if implemented as a distributed system of loosely coupled parts that interact through clearly defined interfaces. Results: With the aim of building such a "phylotastic" system, the NESCent Hackathons, Interoperability, Phylogenies (HIP) working group recruited 2 dozen scientist-programmers to a weeklong programming hackathon in June 2012. During the hackathon (and a three-month follow-up period), 5 teams produced designs, implementations, documentation, presentations, and tests including: (1) a generalized scheme for integrating components; (2) proof-of-concept pruners and controllers; (3) a meta-API for taxonomic name resolution services; (4) a system for storing, finding, and retrieving phylogenies using semantic web technologies for data exchange, storage, and querying; (5) an innovative new service, DateLife.org, which synthesizes pre-computed, time-calibrated phylogenies to assign ages to nodes; and (6) demonstration projects. These outcomes are accessible via a public code repository (GitHub.com), a website (www.phylotastic.org), and a server image. Conclusions: Approximately 9 person-months of effort (centered on a software development hackathon) resulted in the design and implementation of proof-of-concept software for 4 core phylotastic components, 3 controllers, and 3 end-user demonstration tools. While these products have substantial limitations, they suggest considerable potential for a distributed system that makes phylogenetic knowledge readily accessible in computable form. Widespread use of phylotastic systems will create an electronic marketplace for sharing phylogenetic knowledge that will spur innovation in other areas of the ToL enterprise, such as annotation of sources and methods and third-party methods of quality assessment.NESCent (the National Evolutionary Synthesis Center)NSF EF-0905606iPlant Collaborative (NSF) DBI-0735191Biodiversity Synthesis Center (BioSync) of the Encyclopedia of LifeComputer Science

    Phylotastic! Making tree-of-life knowledge accessible, reusable and convenient

    Full text link
    Abstract Background Scientists rarely reuse expert knowledge of phylogeny, in spite of years of effort to assemble a great “Tree of Life” (ToL). A notable exception involves the use of Phylomatic, which provides tools to generate custom phylogenies from a large, pre-computed, expert phylogeny of plant taxa. This suggests great potential for a more generalized system that, starting with a query consisting of a list of any known species, would rectify non-standard names, identify expert phylogenies containing the implicated taxa, prune away unneeded parts, and supply branch lengths and annotations, resulting in a custom phylogeny suited to the user’s needs. Such a system could become a sustainable community resource if implemented as a distributed system of loosely coupled parts that interact through clearly defined interfaces. Results With the aim of building such a “phylotastic” system, the NESCent Hackathons, Interoperability, Phylogenies (HIP) working group recruited 2 dozen scientist-programmers to a weeklong programming hackathon in June 2012. During the hackathon (and a three-month follow-up period), 5 teams produced designs, implementations, documentation, presentations, and tests including: (1) a generalized scheme for integrating components; (2) proof-of-concept pruners and controllers; (3) a meta-API for taxonomic name resolution services; (4) a system for storing, finding, and retrieving phylogenies using semantic web technologies for data exchange, storage, and querying; (5) an innovative new service, DateLife.org, which synthesizes pre-computed, time-calibrated phylogenies to assign ages to nodes; and (6) demonstration projects. These outcomes are accessible via a public code repository (GitHub.com), a website ( http://www.phylotastic.org ), and a server image. Conclusions Approximately 9 person-months of effort (centered on a software development hackathon) resulted in the design and implementation of proof-of-concept software for 4 core phylotastic components, 3 controllers, and 3 end-user demonstration tools. While these products have substantial limitations, they suggest considerable potential for a distributed system that makes phylogenetic knowledge readily accessible in computable form. Widespread use of phylotastic systems will create an electronic marketplace for sharing phylogenetic knowledge that will spur innovation in other areas of the ToL enterprise, such as annotation of sources and methods and third-party methods of quality assessment.http://deepblue.lib.umich.edu/bitstream/2027.42/112888/1/12859_2013_Article_5897.pd

    Pan-Cancer Analysis of lncRNA Regulation Supports Their Targeting of Cancer Genes in Each Tumor Context

    Get PDF
    Long noncoding RNAs (lncRNAs) are commonly dys-regulated in tumors, but only a handful are known toplay pathophysiological roles in cancer. We inferredlncRNAs that dysregulate cancer pathways, onco-genes, and tumor suppressors (cancer genes) bymodeling their effects on the activity of transcriptionfactors, RNA-binding proteins, and microRNAs in5,185 TCGA tumors and 1,019 ENCODE assays.Our predictions included hundreds of candidateonco- and tumor-suppressor lncRNAs (cancerlncRNAs) whose somatic alterations account for thedysregulation of dozens of cancer genes and path-ways in each of 14 tumor contexts. To demonstrateproof of concept, we showed that perturbations tar-geting OIP5-AS1 (an inferred tumor suppressor) andTUG1 and WT1-AS (inferred onco-lncRNAs) dysre-gulated cancer genes and altered proliferation ofbreast and gynecologic cancer cells. Our analysis in-dicates that, although most lncRNAs are dysregu-lated in a tumor-specific manner, some, includingOIP5-AS1, TUG1, NEAT1, MEG3, and TSIX, synergis-tically dysregulate cancer pathways in multiple tumorcontexts
    corecore