25 research outputs found

    Inferring Transmission Bottleneck Size from Viral Sequence Data Using a Novel Haplotype Reconstruction Method

    Get PDF
    The transmission bottleneck is defined as the number of viral particles that transmit from one host to establish an infection in another. Genome sequence data have been used to evaluate the size of the transmission bottleneck between humans infected with the influenza virus; however, the methods used to make these estimates have some limitations. Specifically, viral allele frequencies, which form the basis of many calculations, may not fully capture a process which involves the transmission of entire viral genomes. Here, we set out a novel approach for inferring viral transmission bottlenecks; our method combines an algorithm for haplotype reconstruction with maximum likelihood methods for bottleneck inference. This approach allows for rapid calculation and performs well when applied to data from simulated transmission events; errors in the haplotype reconstruction step did not adversely affect inferences of the population bottleneck. Applied to data from a previous household transmission study of influenza A infection, we confirm the result that the majority of transmission events involve a small number of viruses, albeit with slightly looser bottlenecks being inferred, with between 1 and 13 particles transmitted in the majority of cases. While influenza A transmission involves a tight population bottleneck, the bottleneck is not so tight as to universally prevent the transmission of within-host viral diversity. IMPORTANCE Viral populations undergo a repeated cycle of within-host growth followed by transmission. Viral evolution is affected by each stage of this cycle. The number of viral particles transmitted from one host to another, known as the transmission bottleneck, is an important factor in determining how the evolutionary dynamics of the population play out, restricting the extent to which the evolved diversity of the population can be passed from one host to another. Previous study of viral sequence data has suggested that the transmission bottleneck size for influenza A transmission between human hosts is small. Reevaluating these data using a novel and improved method, we largely confirm this result, albeit that we infer a slightly higher bottleneck size in some cases, of between 1 and 13 virions. While a tight bottleneck operates in human influenza transmission, it is not extreme in nature; some diversity can be meaningfully retained between hosts.Peer reviewe

    Estimating SARS-CoV-2 variant fitness and the impact of interventions in England using statistical and geo-spatial agent-based models

    Get PDF
    The SARS-CoV-2 epidemic has been extended by the evolution of more transmissible viral variants. In autumn 2020, the B.1.177 lineage became the dominant variant in England, before being replaced by the B.1.1.7 (Alpha) lineage in late 2020, with the sweep occurring at different times in each region. This period coincided with a large number of non-pharmaceutical interventions (e.g. lockdowns) to control the epidemic, making it difficult to estimate the relative transmissibility of variants. In this paper, we model the spatial spread of these variants in England using a meta-population agent-based model which correctly characterizes the regional variation in cases and distribution of variants. As a test of robustness, we additionally estimated the relative transmissibility of multiple variants using a statistical model based on the renewal equation, which simultaneously estimates the effective reproduction number R. Relative to earlier variants, the transmissibility of B.1.177 is estimated to have increased by 1.14 (1.12–1.16) and that of Alpha by 1.71 (1.65–1.77). The vaccination programme starting in December 2020 is also modelled. Counterfactual simulations demonstrate that the vaccination programme was essential for reopening in March 2021, and that if the January lockdown had started one month earlier, up to 30 k (24 k–38 k) deaths could have been prevented

    Revisiting the origins of the Sobemovirus genus: A case for ancient origins of plant viruses

    Get PDF
    The discrepancy between short- and long-term rate estimates, known as the time-dependent rate phenomenon (TDRP), poses a challenge to extrapolating evolutionary rates over time and reconstructing evolutionary history of viruses. The TDRP reveals a decline in evolutionary rate estimates with the measurement timescale, explained empirically by a power-law rate decay, notably observed in animal and human viruses. A mechanistic evolutionary model, the Prisoner of War (PoW) model, has been proposed to address TDRP in viruses. Although TDRP has been studied in animal viruses, its impact on plant virus evolutionary history remains largely unexplored. Here, we investigated the consequences of TDRP in plant viruses by applying the PoW model to reconstruct the evolutionary history of sobemoviruses, plant pathogens with significant importance due to their impact on agriculture and plant health. Our analysis showed that the Sobemovirus genus dates back over four million years, indicating an ancient origin. We found evidence that supports deep host jumps to Poaceae, Fabaceae, and Solanaceae occurring between tens to hundreds of thousand years ago, followed by specialization. Remarkably, the TDRP-corrected evolutionary history of sobemoviruses was extended far beyond previous estimates that had suggested their emergence nearly 9,000 years ago, a time coinciding with the Neolithic period in the Near East. By incorporating sequences collected through metagenomic analyses, the resulting phylogenetic tree showcases increased genetic diversity, reflecting a deep history of sobemovirus species. We identified major radiation events beginning between 4,600 to 2,000 years ago, which aligns with the Neolithic period in various regions, suggesting a period of rapid diversification from then to the present. Our findings make a case for the possibility of deep evolutionary origins of plant viruses

    Prevalence of persistent SARS-CoV-2 in a large community surveillance study

    Get PDF
    Persistent SARS-CoV-2 infections may act as viral reservoirs that could seed future outbreaks 1–5, give rise to highly divergent lineages 6–8, and contribute to cases with post-acute COVID-19 sequelae (Long Covid) 9,10. However, the population prevalence of persistent infections, their viral load kinetics, and evolutionary dynamics over the course of infections remain largely unknown. Using viral sequence data collected as part of a national infection survey, we identified 381 individuals with SARS-CoV-2 RNA at high titre persisting for at least 30 days, of which 54 had viral RNA persisting at least 60 days. We refer to these as ‘persistent infections’ since available evidence suggests they represent ongoing viral replication, although the persistence of non-replicating RNA cannot be ruled out in all. Persistently infected individuals had more than 50% higher odds of self-reporting Long Covid compared to non-persistently infected individuals. We estimate that 0.1- 0.5% of infections may become persistent with typically rebounding high viral loads and last for at least 60 days. In some individuals, we identified many viral amino acid substitutions, indicating periods of strong positive selection, while others had no consensus change in the sequences for prolonged periods, consistent with weak selection. Substitutions included mutations that are lineage-defining for SARS-CoV-2 variants, at target sites for monoclonal antibodies, and/or commonly found in immunocompromised patients 11–14. This work has significant implications for understanding and characterising SARS-CoV-2 infection, epidemiology, and evolution

    Estimating SARS-CoV-2 variant fitness and the impact of interventions in England using statistical and geo-spatial agent-based models

    Get PDF
    The SARS-CoV-2 epidemic has been extended by the evolution of more transmissible viral variants. In autumn 2020, the B.1.177 lineage became the dominant variant in England, before being replaced by the B.1.1.7 (Alpha) lineage in late 2020, with the sweep occurring at different times in each region. This period coincided with a large number of non-pharmaceutical interventions (e.g. lockdowns) to control the epidemic, making it difficult to estimate the relative transmissibility of variants. In this paper, we model the spatial spread of these variants in England using a meta-population agent-based model which correctly characterizes the regional variation in cases and distribution of variants. As a test of robustness, we additionally estimated the relative transmissibility of multiple variants using a statistical model based on the renewal equation, which simultaneously estimates the effective reproduction number R. Relative to earlier variants, the transmissibility of B.1.177 is estimated to have increased by 1.14 (1.12-1.16) and that of Alpha by 1.71 (1.65-1.77). The vaccination programme starting in December 2020 is also modelled. Counterfactual simulations demonstrate that the vaccination programme was essential for reopening in March 2021, and that if the January lockdown had started one month earlier, up to 30 k (24 k-38 k) deaths could have been prevented. This article is part of the theme issue 'Technical challenges of modelling real-life epidemics and examples of overcoming these'

    Luria–Delbrück, revisited: the classic experiment does not rule out Lamarckian evolution

    No full text
    We re-examined data from the classic Luria–Delbrück fluctuation experiment, which is often credited with establishing a Darwinian basis for evolution. We argue that, for the Lamarckian model of evolution to be ruled out by the experiment, the experiment must favor pure Darwinian evolution over both the Lamarckian model and a model that allows both Darwinian and Lamarckian mechanisms (as would happen for bacteria with CRISPR-Cas immunity). Analysis of the combined model was not performed in the original 1943 paper. The Luria–Delbrück paper also did not consider the possibility of neither model fitting the experiment. Using Bayesian model selection, we find that the Luria–Delbrück experiment, indeed, favors the Darwinian evolution over purely Lamarckian. However, our analysis does not rule out the combined model, and hence cannot rule out Lamarckian contributions to the evolutionary dynamics

    The evolution of SARS-CoV-2

    No full text
    Severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) has caused millions of deaths and substantial morbidity worldwide. Intense scientific effort to understand the biology of SARS-CoV-2 has resulted in daunting numbers of genomic sequences. We witnessed evolutionary events that could mostly be inferred indirectly before, such as the emergence of variants with distinct phenotypes, for example transmissibility, severity and immune evasion. This Review explores the mechanisms that generate genetic variation in SARS-CoV-2, underlying the within-host and population-level processes that underpin these events. We examine the selective forces that likely drove the evolution of higher transmissibility and, in some cases, higher severity during the first year of the pandemic and the role of antigenic evolution during the second and third years, together with the implications of immune escape and reinfections, and the increasing evidence for and potential relevance of recombination. In order to understand how major lineages, such as variants of concern (VOCs), are generated, we contrast the evidence for the chronic infection model underlying the emergence of VOCs with the possibility of an animal reservoir playing a role in SARS-CoV-2 evolution, and conclude that the former is more likely. We evaluate uncertainties and outline scenarios for the possible future evolutionary trajectories of SARS-CoV-2

    Purifying Selection Determines the Short-Term Time Dependency of Evolutionary Rates in SARS-CoV-2 and pH1N1 Influenza

    No full text
    High-throughput sequencing enables rapid genome sequencing during infectious disease outbreaks and provides an opportunity to quantify the evolutionary dynamics of pathogens in near real-time. One difficulty of undertaking evolutionary analyses over short timescales is the dependency of the inferred evolutionary parameters on the timespan of observation. Crucially, there are an increasing number of molecular clock analyses using external evolutionary rate priors to infer evolutionary parameters. However, it is not clear which rate prior is appropriate for a given time window of observation due to the time-dependent nature of evolutionary rate estimates. Here, we characterize the molecular evolutionary dynamics of SARS-CoV-2 and 2009 pandemic H1N1 (pH1N1) influenza during the first 12 months of their respective pandemics. We use Bayesian phylogenetic methods to estimate the dates of emergence, evolutionary rates, and growth rates of SARS-CoV-2 and pH1N1 over time and investigate how varying sampling window and data set sizes affect the accuracy of parameter estimation. We further use a generalized McDonald–Kreitman test to estimate the number of segregating nonneutral sites over time. We find that the inferred evolutionary parameters for both pandemics are time dependent, and that the inferred rates of SARS-CoV-2 and pH1N1 decline by ∼50% and ∼100%, respectively, over the course of 1 year. After at least 4 months since the start of sequence sampling, inferred growth rates and emergence dates remain relatively stable and can be inferred reliably using a logistic growth coalescent model. We show that the time dependency of the mean substitution rate is due to elevated substitution rates at terminal branches which are 2–4 times higher than those of internal branches for both viruses. The elevated rate at terminal branches is strongly correlated with an increasing number of segregating nonneutral sites, demonstrating the role of purifying selection in generating the time dependency of evolutionary parameters during pandemics.ISSN:0737-4038ISSN:1537-171
    corecore