371 research outputs found

    Neighborhoods of trees in circular orderings

    Get PDF
    In phylogenetics, a common strategy used to construct an evolutionary tree for a set of species X is to search in the space of all such trees for one that optimizes some given score function (such as the minimum evolution, parsimony or likelihood score). As this can be computationally intensive, it was recently proposed to restrict such searches to the set of all those trees that are compatible with some circular ordering of the set X. To inform the design of efficient algorithms to perform such searches, it is therefore of interest to find bounds for the number of trees compatible with a fixed ordering in the neighborhood of a tree that is determined by certain tree operations commonly used to search for trees: the nearest neighbor interchange (nni), the subtree prune and regraft (spr) and the tree bisection and reconnection (tbr) operations. We show that the size of such a neighborhood of a binary tree associated with the nni operation is independent of the tree’s topology, but that this is not the case for the spr and tbr operations. We also give tight upper and lower bounds for the size of the neighborhood of a binary tree for the spr and tbr operations and characterize those trees for which these bounds are attained

    Synonymous Substitution Rates Predict HIV Disease Progression as a Result of Underlying Replication Dynamics

    Get PDF
    Upon HIV transmission, some patients develop AIDS in only a few months, while others remain disease free for 20 or more years. This variation in the rate of disease progression is poorly understood and has been attributed to host genetics, host immune responses, co-infection, viral genetics, and adaptation. Here, we develop a new “relaxed-clock” phylogenetic method to estimate absolute rates of synonymous and nonsynonymous substitution through time. We identify an unexpected association between the synonymous substitution rate of HIV and disease progression parameters. Since immune activation is the major determinant of HIV disease progression, we propose that this process can also determine viral generation times, by creating favourable conditions for HIV replication. These conclusions may apply more generally to HIV evolution, since we also observed an overall low synonymous substitution rate for HIV-2, which is known to be less pathogenic than HIV-1 and capable of tempering the detrimental effects of immune activation. Humoral immune responses, on the other hand, are the major determinant of nonsynonymous rate changes through time in the envelope gene, and our relaxed-clock estimates support a decrease in selective pressure as a consequence of immune system collapse

    Single Cell Analysis of Lymph Node Tissue from HIV-1 Infected Patients Reveals that the Majority of CD4<sup>+</sup> T-cells Contain One HIV-1 DNA Molecule

    Get PDF
    Genetic recombination contributes to the diversity of human immunodeficiency virus (HIV-1). Productive HIV-1 recombination is, however, dependent on both the number of HIV-1 genomes per infected cell and the genetic relationship between these viral genomes. A detailed analysis of the number of proviruses and their genetic relationship in infected cells isolated from peripheral blood and tissue compartments is therefore important for understanding HIV-1 recombination, genetic diversity and the dynamics of HIV-1 infection. To address these issues, we used a previously developed single-cell sequencing technique to quantify and genetically characterize individual HIV-1 DNA molecules from single cells in lymph node tissue and peripheral blood. Analysis of memory and naïve CD4+ T cells from paired lymph node and peripheral blood samples from five untreated chronically infected patients revealed that the majority of these HIV-1-infected cells (>90%) contain only one copy of HIV-1 DNA, implying a limited potential for productive recombination in virus produced by these cells in these two compartments. Phylogenetic analysis revealed genetic similarity of HIV-1 DNA in memory and naïve CD4+ T-cells from lymph node, peripheral blood and HIV-1 RNA from plasma, implying exchange of virus and/or infected cells between these compartments in untreated chronic infection

    HIV Types, Groups, Subtypes and Recombinant Forms: Errors in Replication, Selection Pressure and Quasispecies

    Get PDF
    HIV-1 is a chimpanzee virus which was transmitted to humans by several zoonotic events resulting in infection with HIV-1 groups M P, and in parallel transmission events from sooty mangabey monkey viruses leading to infections with HIV-2 groups A H. Both viruses have circulated in the human population for about 80 years. In the infected patient, HIV mutates, and by elimination of some of the viruses by the action of the immune system individual quasispecies are formed. Along with the selection of the fittest viruses, mutation and recombination after superinfection with HIV from different groups or subtypes have resulted in the diversity of their patterns of geographic distribution. Despite the high variability observed, some essential parts of the HIV genome are highly conserved. Viral diversity is further facilitated in some parts of the HIV genome by drug selection pressure and may also be enhanced by different genetic factors, including HLA in patients from different regions of the world. Viral and human genetic factors influence pathogenesis. Viral genetic factors are proteins such as Tat, Vif and Rev. Human genetic factors associated with a better clinical outcome are proteins such as APOBEC, langerin, tetherin and chemokine receptor 5 (CCR5) and HLA B27, B57, DRB1{*}1303, KIR and PARD3B. Copyright (C) 2012 S. Karger AG, Base

    Natural selection in the evolution of SARS-CoV-2 in bats created a generalist virus and highly capable human pathogen

    Get PDF
    Virus host shifts are generally associated with novel adaptations to exploit the cells of the new host species optimally. Surprisingly, Severe Acute Respiratory Syndrome Coronavirus 2 (SARS-CoV-2) has apparently required little to no significant adaptation to humans since the start of the Coronavirus Disease 2019 (COVID-19) pandemic and to October 2020. Here we assess the types of natural selection taking place in Sarbecoviruses in horseshoe bats versus the early SARS-CoV-2 evolution in humans. While there is moderate evidence of diversifying positive selection in SARS-CoV-2 in humans, it is limited to the early phase of the pandemic, and purifying selection is much weaker in SARS-CoV-2 than in related bat Sarbecoviruses. In contrast, our analysis detects evidence for significant positive episodic diversifying selection acting at the base of the bat virus lineage SARS-CoV-2 emerged from, accompanied by an adaptive depletion in CpG composition presumed to be linked to the action of antiviral mechanisms in these ancestral bat hosts. The closest bat virus to SARS-CoV-2, RmYN02 (sharing an ancestor about 1976), is a recombinant with a structure that includes differential CpG content in Spike; clear evidence of coinfection and evolution in bats without involvement of other species. While an undiscovered “facilitating” intermediate species cannot be discounted, collectively, our results support the progenitor of SARS-CoV-2 being capable of efficient human–human transmission as a consequence of its adaptive evolutionary history in bats, not humans, which created a relatively generalist virus

    Phylogeography of Japanese encephalitis virus:genotype is associated with climate

    Get PDF
    The circulation of vector-borne zoonotic viruses is largely determined by the overlap in the geographical distributions of virus-competent vectors and reservoir hosts. What is less clear are the factors influencing the distribution of virus-specific lineages. Japanese encephalitis virus (JEV) is the most important etiologic agent of epidemic encephalitis worldwide, and is primarily maintained between vertebrate reservoir hosts (avian and swine) and culicine mosquitoes. There are five genotypes of JEV: GI-V. In recent years, GI has displaced GIII as the dominant JEV genotype and GV has re-emerged after almost 60 years of undetected virus circulation. JEV is found throughout most of Asia, extending from maritime Siberia in the north to Australia in the south, and as far as Pakistan to the west and Saipan to the east. Transmission of JEV in temperate zones is epidemic with the majority of cases occurring in summer months, while transmission in tropical zones is endemic and occurs year-round at lower rates. To test the hypothesis that viruses circulating in these two geographical zones are genetically distinct, we applied Bayesian phylogeographic, categorical data analysis and phylogeny-trait association test techniques to the largest JEV dataset compiled to date, representing the envelope (E) gene of 487 isolates collected from 12 countries over 75 years. We demonstrated that GIII and the recently emerged GI-b are temperate genotypes likely maintained year-round in northern latitudes, while GI-a and GII are tropical genotypes likely maintained primarily through mosquito-avian and mosquito-swine transmission cycles. This study represents a new paradigm directly linking viral molecular evolution and climate

    Model Averaging and Bayes Factor Calculation of Relaxed Molecular Clocks in Bayesian Phylogenetics

    Get PDF
    We describe a procedure for model averaging of relaxed molecular clock models in Bayesian phylogenetics. Our approach allows us to model the distribution of rates of substitution across branches, averaged over a set of models, rather than conditioned on a single model. We implement this procedure and test it on simulated data to show that our method can accurately recover the true underlying distribution of rates. We applied the method to a set of alignments taken from a data set of 12 mammalian species and uncovered evidence that lognormally distributed rates better describe this data set than do exponentially distributed rates. Additionally, our implementation of model averaging permits accurate calculation of the Bayes factor(s) between two or more relaxed molecular clock models. Finally, we introduce a new computational approach for sampling rates of substitution across branches that improves the convergence of our Markov chain Monte Carlo algorithms in this context. Our methods are implemented under the BEAST 1.6 software package, available at http://beast-mcmc.googlecode.com
    corecore