149 research outputs found

    The Dawn of Open Access to Phylogenetic Data

    Get PDF
    The scientific enterprise depends critically on the preservation of and open access to published data. This basic tenet applies acutely to phylogenies (estimates of evolutionary relationships among species). Increasingly, phylogenies are estimated from increasingly large, genome-scale datasets using increasingly complex statistical methods that require increasing levels of expertise and computational investment. Moreover, the resulting phylogenetic data provide an explicit historical perspective that critically informs research in a vast and growing number of scientific disciplines. One such use is the study of changes in rates of lineage diversification (speciation - extinction) through time. As part of a meta-analysis in this area, we sought to collect phylogenetic data (comprising nucleotide sequence alignment and tree files) from 217 studies published in 46 journals over a 13-year period. We document our attempts to procure those data (from online archives and by direct request to corresponding authors), and report results of analyses (using Bayesian logistic regression) to assess the impact of various factors on the success of our efforts. Overall, complete phylogenetic data for ~60% of these studies are effectively lost to science. Our study indicates that phylogenetic data are more likely to be deposited in online archives and/or shared upon request when: (1) the publishing journal has a strong data-sharing policy; (2) the publishing journal has a higher impact factor, and; (3) the data are requested from faculty rather than students. Although the situation appears dire, our analyses suggest that it is far from hopeless: recent initiatives by the scientific community -- including policy changes by journals and funding agencies -- are improving the state of affairs

    Phylogenetic relationships of cone snails endemic to Cabo Verde based on mitochondrial genomes

    Get PDF
    Background: Due to their great species and ecological diversity as well as their capacity to produce hundreds of different toxins, cone snails are of interest to evolutionary biologists, pharmacologists and amateur naturalists alike. Taxonomic identification of cone snails still relies mostly on the shape, color, and banding patterns of the shell. However, these phenotypic traits are prone to homoplasy. Therefore, the consistent use of genetic data for species delimitation and phylogenetic inference in this apparently hyperdiverse group is largely wanting. Here, we reconstruct the phylogeny of the cones endemic to Cabo Verde archipelago, a well-known radiation of the group, using mitochondrial (mt) genomes. Results: The reconstructed phylogeny grouped the analyzed species into two main clades, one including Kalloconus from West Africa sister to Trovaoconus from Cabo Verde and the other with a paraphyletic Lautoconus due to the sister group relationship of Africonus from Cabo Verde and Lautoconus ventricosus from Mediterranean Sea and neighboring Atlantic Ocean to the exclusion of Lautoconus endemic to Senegal (plus Lautoconus guanche from Mauritania, Morocco, and Canary Islands). Within Trovaoconus, up to three main lineages could be distinguished. The clade of Africonus included four main lineages (named I to IV), each further subdivided into two monophyletic groups. The reconstructed phylogeny allowed inferring the evolution of the radula in the studied lineages as well as biogeographic patterns. The number of cone species endemic to Cabo Verde was revised under the light of sequence divergence data and the inferred phylogenetic relationships. Conclusions: The sequence divergence between continental members of the genus Kalloconus and island endemics ascribed to the genus Trovaoconus is low, prompting for synonymization of the latter. The genus Lautoconus is paraphyletic. Lautoconus ventricosus is the closest living sister group of genus Africonus. Diversification of Africonus was in allopatry due to the direct development nature of their larvae and mainly triggered by eustatic sea level changes during the Miocene-Pliocene. Our study confirms the diversity of cone endemic to Cabo Verde but significantly reduces the number of valid species. Applying a sequence divergence threshold, the number of valid species within the sampled Africonus is reduced to half.Spanish Ministry of Science and Innovation [CGL2013-45211-C2-2-P, CGL2016-75255-C2-1-P, BES-2011-051469, BES-2014-069575, Doctorado Nacional-567]info:eu-repo/semantics/publishedVersio

    Dating Phylogenies with Hybrid Local Molecular Clocks

    Get PDF
    BACKGROUND: Because rates of evolution and species divergence times cannot be estimated directly from molecular data, all current dating methods require that specific assumptions be made before inferring any divergence time. These assumptions typically bear either on rates of molecular evolution (molecular clock hypothesis, local clocks models) or on both rates and times (penalized likelihood, Bayesian methods). However, most of these assumptions can affect estimated dates, oftentimes because they underestimate large amounts of rate change. PRINCIPAL FINDINGS: A significant modification to a recently proposed ad hoc rate-smoothing algorithm is described, in which local molecular clocks are automatically placed on a phylogeny. This modification makes use of hybrid approaches that borrow from recent theoretical developments in microarray data analysis. An ad hoc integration of phylogenetic uncertainty under these local clock models is also described. The performance and accuracy of the new methods are evaluated by reanalyzing three published data sets. CONCLUSIONS: It is shown that the new maximum likelihood hybrid methods can perform better than penalized likelihood and almost as well as uncorrelated Bayesian models. However, the new methods still tend to underestimate the actual amount of rate change. This work demonstrates the difficulty of estimating divergence times using local molecular clocks

    Diversity dynamics in New Caledonia: towards the end of the museum model?

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>The high diversity of New Caledonia has traditionally been seen as a result of its Gondwanan origin, old age and long isolation under stable climatic conditions (the museum model). Under this scenario, we would expect species diversification to follow a constant rate model. Alternatively, if New Caledonia was completely submerged after its breakup from Gondwana, as geological evidence indicates, we would expect species diversification to show a characteristic slowdown over time according to a diversity-dependent model where species accumulation decreases as space is filled.</p> <p>Results</p> <p>We reanalyze available datasets for New Caledonia and reconstruct the phylogenies using standardized methodologies; we use two ultrametrization alternatives; and we take into account phylogenetic uncertainty as well as incomplete taxon sampling when conducting diversification rate constancy tests. Our results indicate that for 8 of the 9 available phylogenies, there is significant evidence for a diversification slowdown. For the youngest group under investigation, the apparent lack of evidence of a significant slowdown could be because we are still observing the early phase of a logistic growth (i.e. the clade may be too young to exhibit a change in diversification rates).</p> <p>Conclusions</p> <p>Our results are consistent with a diversity-dependent model of diversification in New Caledonia. In opposition to the museum model, our results provide additional evidence that original New Caledonian biodiversity was wiped out during the episode of submersion, providing an open and empty space facilitating evolutionary radiations.</p

    Genomic Diversity and Evolution of the Lyssaviruses

    Get PDF
    Lyssaviruses are RNA viruses with single-strand, negative-sense genomes responsible for rabies-like diseases in mammals. To date, genomic and evolutionary studies have most often utilized partial genome sequences, particularly of the nucleoprotein and glycoprotein genes, with little consideration of genome-scale evolution. Herein, we report the first genomic and evolutionary analysis using complete genome sequences of all recognised lyssavirus genotypes, including 14 new complete genomes of field isolates from 6 genotypes and one genotype that is completely sequenced for the first time. In doing so we significantly increase the extent of genome sequence data available for these important viruses. Our analysis of these genome sequence data reveals that all lyssaviruses have the same genomic organization. A phylogenetic analysis reveals strong geographical structuring, with the greatest genetic diversity in Africa, and an independent origin for the two known genotypes that infect European bats. We also suggest that multiple genotypes may exist within the diversity of viruses currently classified as ‘Lagos Bat’. In sum, we show that rigorous phylogenetic techniques based on full length genome sequence provide the best discriminatory power for genotype classification within the lyssaviruses

    The Evolution of the Major Hepatitis C Genotypes Correlates with Clinical Response to Interferon Therapy

    Get PDF
    Patients chronically infected with hepatitis C virus (HCV) require significantly different durations of therapy and achieve substantially different sustained virologic response rates to interferon-based therapies, depending on the HCV genotype with which they are infected. There currently exists no systematic framework that explains these genotype-specific response rates. Since humans are the only known natural hosts for HCV-a virus that is at least hundreds of years old-one possibility is that over the time frame of this relationship, HCV accumulated adaptive mutations that confer increasing resistance to the human immune system. Given that interferon therapy functions by triggering an immune response, we hypothesized that clinical response rates are a reflection of viral evolutionary adaptations to the immune system.We have performed the first phylogenetic analysis to include all available full-length HCV genomic sequences (n = 345). This resulted in a new cladogram of HCV. This tree establishes for the first time the relative evolutionary ages of the major HCV genotypes. The outcome data from prospective clinical trials that studied interferon and ribavirin therapy was then mapped onto this new tree. This mapping revealed a correlation between genotype-specific responses to therapy and respective genotype age. This correlation allows us to predict that genotypes 5 and 6, for which there currently are no published prospective trials, will likely have intermediate response rates, similar to genotype 3. Ancestral protein sequence reconstruction was also performed, which identified the HCV proteins E2 and NS5A as potential determinants of genotype-specific clinical outcome. Biochemical studies have independently identified these same two proteins as having genotype-specific abilities to inhibit the innate immune factor double-stranded RNA-dependent protein kinase (PKR).An evolutionary analysis of all available HCV genomes supports the hypothesis that immune selection was a significant driving force in the divergence of the major HCV genotypes and that viral factors that acquired the ability to inhibit the immune response may play a role in determining genotype-specific response rates to interferon therapy

    Molecular phylogenetics and temporal diversification in the genus Aeromonas based on the sequences of five housekeeping genes

    Get PDF
    Several approaches have been developed to estimate both the relative and absolute rates of speciation and extinction within clades based on molecular phylogenetic reconstructions of evolutionary relationships, according to an underlying model of diversification. However, the macroevolutionary models established for eukaryotes have scarcely been used with prokaryotes. We have investigated the rate and pattern of cladogenesis in the genus Aeromonas (γ-Proteobacteria, Proteobacteria, Bacteria) using the sequences of five housekeeping genes and an uncorrelated relaxed-clock approach. To our knowledge, until now this analysis has never been applied to all the species described in a bacterial genus and thus opens up the possibility of establishing models of speciation from sequence data commonly used in phylogenetic studies of prokaryotes. Our results suggest that the genus Aeromonas began to diverge between 248 and 266 million years ago, exhibiting a constant divergence rate through the Phanerozoic, which could be described as a pure birth process

    A Comparison of the Effects of Random and Selective Mass Extinctions on Erosion of Evolutionary History in Communities of Digital Organisms

    Get PDF
    The effect of mass extinctions on phylogenetic diversity and branching history of clades remains poorly understood in paleobiology. We examined the phylogenies of communities of digital organisms undergoing open-ended evolution as we subjected them to instantaneous “pulse” extinctions, choosing survivors at random, and to prolonged “press” extinctions involving a period of low resource availability. We measured age of the phylogenetic root and tree stemminess, and evaluated how branching history of the phylogenetic trees was affected by the extinction treatments. We found that strong random (pulse) and strong selective extinction (press) both left clear long-term signatures in root age distribution and tree stemminess, and eroded deep branching history to a greater degree than did weak extinction and control treatments. The widely-used Pybus-Harvey gamma statistic showed a clear short-term response to extinction and recovery, but differences between treatments diminished over time and did not show a long-term signature. The characteristics of post-extinction phylogenies were often affected as much by the recovery interval as by the extinction episode itself
    corecore