90 research outputs found

    Inference of evolutionary and ecological processes from reticulate evolution in RNA viruses

    Get PDF
    RNA viruses have the fastest evolutionary rates amongst protein-coding organisms on the planet. Ease of sequencing, advanced techniques of analysis and global health and economic concerns have all contributed to the recognition of RNA viruses as a robust research platform. Phylogenetic methods have been at the forefront of analytical techniques used to understand the dynamics of RNA viruses - during natural circulation in populations and in individual hosts, within epidemics, across species barriers and over billions of years that viruses have been around. Most of the work presented in this thesis employs phylogenetic incongruity arising from reassortment and recombination to gain insights into the genomes and populations of RNA viruses. Chapter 2 explores the selection regimes Ebola virus has experienced following a year of circulation in humans inWest Africa, as well as its recent history. Chapter 3 investigates the extent of recombination in MERS-CoV, a novel human pathogen with an obscure epidemiology, which is suggestive of frequent co-infection of some hosts. Chapter 4, on the other hand, documents a pattern of non-intuitive linkage between some segments of the human-endemic influenza B virus genome and explores its potential to speciate. Chapter 5 builds upon chapter 4 and attempts to describe small-scale reassortment between two segments of influenza B virus and the overall migration patterns of influenza B virus in Scotland. Chapter 6 exploits the independence of segments of influenza D virus, a recently described cattle pathogen, and coalescent theory to disentangle the origins of this virus. This thesis exemplifies the success of modern sequencing methods, which, together with the use of sophisticated analytical techniques, have uncovered a wealth of information hidden away in molecular sequences of RNA viruses. The work presented herein demonstrates how reticulate evolution can be exploited as a reliable, and sometimes indispensable, marker to improve inference of evolutionary forces in RNA viruses

    Fitting stochastic epidemic models to gene genealogies using linear noise approximation

    Get PDF
    Phylodynamics is a set of population genetics tools that aim at reconstructing demographic history of a population based on molecular sequences of individuals sampled from the population of interest. One important task in phylodynamics is to estimate changes in (effective) population size. When applied to infectious disease sequences such estimation of population size trajectories can provide information about changes in the number of infections. To model changes in the number of infected individuals, current phylodynamic methods use non-parametric approaches, parametric approaches, and stochastic modeling in conjunction with likelihood-free Bayesian methods. The first class of methods yields results that are hard-to-interpret epidemiologically. The second class of methods provides estimates of important epidemiological parameters, such as infection and removal/recovery rates, but ignores variation in the dynamics of infectious disease spread. The third class of methods is the most advantageous statistically, but relies on computationally intensive particle filtering techniques that limits its applications. We propose a Bayesian model that combines phylodynamic inference and stochastic epidemic models, and achieves computational tractability by using a linear noise approximation (LNA) --- a technique that allows us to approximate probability densities of stochastic epidemic model trajectories. LNA opens the door for using modern Markov chain Monte Carlo tools to approximate the joint posterior distribution of the disease transmission parameters and of high dimensional vectors describing unobserved changes in the stochastic epidemic model compartment sizes (e.g., numbers of infectious and susceptible individuals). We apply our estimation technique to Ebola genealogies estimated using viral genetic data from the 2014 epidemic in Sierra Leone and Liberia.Comment: 43 pages, 6 figures in the main tex

    Phylogenetic Analysis of Guinea 2014 EBOV Ebolavirus Outbreak

    Get PDF
    Members of the genus Ebolavirus have caused outbreaks of haemorrhagic fever in humans in Africa. The most recent outbreak in Guinea, which began in February of 2014, is still ongoing. Recently published analyses of sequences from this outbreak suggest that the outbreak in Guinea is caused by a divergent lineage of Zaire ebolavirus. We report evidence that points to the same Zaire ebolavirus lineage that has previously caused outbreaks in the Democratic Republic of Congo, the Republic of Congo and Gabon as the culprit behind the outbreak in Guinea

    Reassortment between Influenza B Lineages and the Emergence of a Coadapted PB1-PB2-HA Gene Complex

    Get PDF
    Influenza B viruses make a considerable contribution to morbidity attributed to seasonal influenza. Currently circulating influenza B isolates are known to belong to two antigenically distinct lineages referred to as B/Victoria and B/Yamagata. Frequent exchange of genomic segments of these two lineages has been noted in the past, but the observed patterns of reassortment have not been formalized in detail. We investigate interlineage reassortments by comparing phylogenetic trees across genomic segments. Our analyses indicate that of the eight segments of influenza B viruses only segments coding for polymerase basic 1 and 2 (PB1 and PB2) and hemagglutinin (HA) proteins have maintained separate Victoria and Yamagata lineages and that currently circulating strains possess PB1, PB2, and HA segments derived entirely from one or the other lineage; other segments have repeatedly reassorted between lineages thereby reducing genetic diversity. We argue that this difference between segments is due to selection against reassortant viruses with mixed-lineage PB1, PB2, and HA segments. Given sufficient time and continued recruitment to the reassortment-isolated PB1–PB2–HA gene complex, we expect influenza B viruses to eventually undergo sympatric speciation

    Global disparities in SARS-CoV-2 genomic surveillance

    Full text link
    Genomic sequencing is essential to track the evolution and spread of SARS-CoV-2, optimize molecular tests, treatments, vaccines, and guide public health responses. To investigate the global SARS-CoV-2 genomic surveillance, we used sequences shared via GISAID to estimate the impact of sequencing intensity and turnaround times on variant detection in 189 countries. In the first two years of the pandemic, 78% of high-income countries sequenced >0.5% of their COVID-19 cases, while 42% of low- and middle-income countries reached that mark. Around 25% of the genomes from high income countries were submitted within 21 days, a pattern observed in 5% of the genomes from low- and middle-income countries. We found that sequencing around 0.5% of the cases, with a turnaround time <21 days, could provide a benchmark for SARS-CoV-2 genomic surveillance. Socioeconomic inequalities undermine the global pandemic preparedness, and efforts must be made to support low- and middle-income countries improve their local sequencing capacity

    Conserved ancestral tropical niche but different continental histories explain the latitudinal diversity gradient in brush-footed butterflies.

    Get PDF
    The global increase in species richness toward the tropics across continents and taxonomic groups, referred to as the latitudinal diversity gradient, stimulated the formulation of many hypotheses to explain the underlying mechanisms of this pattern. We evaluate several of these hypotheses to explain spatial diversity patterns in a butterfly family, the Nymphalidae, by assessing the contributions of speciation, extinction, and dispersal, and also the extent to which these processes differ among regions at the same latitude. We generate a time-calibrated phylogeny containing 2,866 nymphalid species (~45% of extant diversity). Neither speciation nor extinction rate variations consistently explain the latitudinal diversity gradient among regions because temporal diversification dynamics differ greatly across longitude. The Neotropical diversity results from low extinction rates, not high speciation rates, and biotic interchanges with other regions are rare. Southeast Asia is also characterized by a low speciation rate but, unlike the Neotropics, is the main source of dispersal events through time. Our results suggest that global climate change throughout the Cenozoic, combined with tropical niche conservatism, played a major role in generating the modern latitudinal diversity gradient of nymphalid butterflies

    MERS coronaviruses from camels in Africa exhibit region-dependent genetic diversity

    Get PDF
    International audienceMiddle East respiratory syndrome coronavirus (MERS-CoV) causes a zoonotic respiratory disease of global public health concern, and dromedary camels are the only proven source of zoonotic infection. Although MERS-CoV infection is ubiquitous in dromedaries across Africa as well as in the Arabian Peninsula, zoonotic disease appears confined to the Arabian Peninsula. MERS-CoVs from Africa have hitherto been poorly studied. We genetically and phenotypically characterized MERS-CoV from dromedaries sampled in Morocco, Burkina Faso, Nigeria, and Ethiopia. Viruses from Africa (clade C) are phylogenetically distinct from contemporary viruses from the Arabian Peninsula (clades A and B) but remain antigenically similar in microneutralization tests. Viruses from West (Nigeria, Burkina Faso) and North (Morocco) Africa form a subclade, C1, that shares clade-defining genetic signatures including deletions in the accessory gene ORF4b. Compared with human and camel MERS-CoV from Saudi Arabia, virus isolates from Burkina Faso (BF785) and Nigeria (Nig1657) had lower virus replication competence in Calu-3 cells and in ex vivo cultures of human bronchus and lung. BF785 replicated to lower titer in lungs of human DPP4-transduced mice. A reverse genetics-derived recombinant MERS-CoV (EMC) lacking ORF4b elicited higher type I and III IFN responses than the isogenic EMC virus in Calu-3 cells. However, ORF4b deletions may not be the major determinant of the reduced replication competence of BF785 and Nig1657. Genetic and phenotypic differences in West African viruses may be relevant to zoonotic potential. There is an urgent need for studies of MERS-CoV at the animal-human interface

    Genome-wide evolutionary dynamics of influenza B viruses on a global scale

    Get PDF
    The global-scale epidemiology and genome-wide evolutionary dynamics of influenza B remain poorly understood compared with influenza A viruses. We compiled a spatio-temporally comprehensive dataset of influenza B viruses, comprising over 2,500 genomes sampled worldwide between 1987 and 2015, including 382 newly-sequenced genomes that fill substantial gaps in previous molecular surveillance studies. Our contributed data increase the number of available influenza B virus genomes in Europe, Africa and Central Asia, improving the global context to study influenza B viruses. We reveal Yamagata-lineage diversity results from co-circulation of two antigenically-distinct groups that also segregate genetically across the entire genome, without evidence of intra-lineage reassortment. In contrast, Victoria-lineage diversity stems from geographic segregation of different genetic clades, with variability in the degree of geographic spread among clades. Differences between the lineages are reflected in their antigenic dynamics, as Yamagata-lineage viruses show alternating dominance between antigenic groups, while Victoria-lineage viruses show antigenic drift of a single lineage. Structural mapping of amino acid substitutions on trunk branches of influenza B gene phylogenies further supports these antigenic differences and highlights two potential mechanisms of adaptation for polymerase activity. Our study provides new insights into the epidemiological and molecular processes shaping influenza B virus evolution globally

    Global disparities in SARS-CoV-2 genomic surveillance

    Get PDF
    Genomic sequencing is essential to track the evolution and spread of SARS-CoV-2, optimize molecular tests, treatments, vaccines, and guide public health responses. To investigate the global SARS-CoV-2 genomic surveillance, we used sequences shared via GISAID to estimate the impact of sequencing intensity and turnaround times on variant detection in 189 countries. In the first two years of the pandemic, 78% of high-income countries sequenced >0.5% of their COVID-19 cases, while 42% of low- and middle-income countries reached that mark. Around 25% of the genomes from high income countries were submitted within 21 days, a pattern observed in 5% of the genomes from low- and middle-income countries. We found that sequencing around 0.5% of the cases, with a turnaround time <21 days, could provide a benchmark for SARS-CoV-2 genomic surveillance. Socioeconomic inequalities undermine the global pandemic preparedness, and efforts must be made to support low- and middle-income countries improve their local sequencing capacity
    • 

    corecore