134 research outputs found

    The evolutionary dynamics of norovirus

    Full text link
    Norovirus (NoV) is the leading cause of both outbreak and sporadic community acquired acute gastroenteritis. The overall incidence of NoV infection has grown dramatically since the emergence of epidemic NoV strains of the GII.4 lineage in the mid-1990s that have been associated with five pandemics and account for 80% of NoV infections. This thesis aimed to describe the mechanisms of evolution that facilitate the emergence of epidemic GII.4 variants and to elucidate factors that contribute to their higher epidemiological fitness. Two molecular epidemiological studies were performed to characterise the NoV strains linked to epidemics in New South Wales, Australia and those in circulation globally between 2007 and 2010. The pandemic GII.4 variant 2006b was identified as the cause of the 2007 and 2008 epidemics and the GII.4 variant New Orleans 2010 was the aetiological agent of the epidemics of 2009 and 2010. Each variant demonstrated antigenic drift in the capsid P2 domain that likely contributed to their epidemic potential. These studies also highlighted the role that recombination played in the emergence of New Orleans 2010. A number of factors were identified that may have contributed to the higher epidemiological fitness of the pandemic NoV GII.4 variants. Firstly, by comparing the enzymatic properties of different NoV polymerases, including replication efficiency and fidelity, it was shown that GII.4 variants have higher replication and mutation rates. It was also shown that polymerases from more prevalent genotypes, such as GII.4 and GII.b, are phosphorylated by an important cellular kinase, Akt, at a residue (Thr33) that decreases the de novo polymerase activity. Using next-generation sequencing technology, patterns of intra-host evolution were compared between acute and chronic NoV infections. Extensive heterogeneity and toggling at antigenic sites of the viral capsid was observed in the chronic patient, which suggests that immune-compromised individuals with chronic NoV infections could be a source for novel antigenic variants. In the same study, a transmission cluster was also examined and a strong genetic bottleneck was identified at the point of transmission. Overall, this thesis suggests that a complex pattern of mutation, recombination and adaptation drive NoV evolution in response to herd immunity

    A divergent articulavirus in an Australian gecko identified using meta-transcriptomics and protein structure comparisons

    Get PDF
    The discovery of highly divergent RNA viruses is compromised by their limited sequence similarity to known viruses. Evolutionary information obtained from protein structural modelling offers a powerful approach to detect distantly related viruses based on the conservation of tertiary structures in key proteins such as the RNA-dependent RNA polymerase (RdRp). We utilised a template-based approach for protein structure prediction from amino acid sequences to identify distant evolutionary relationships among viruses detected in meta-transcriptomic sequencing data from Australian wildlife. The best predicted protein structural model was compared with the results of similarity searches against protein databases. Using this combination of meta-transcriptomics and protein structure prediction we identified the RdRp (PB1) gene segment of a divergent negative-sense RNA virus, denoted Lauta virus (LTAV), in a native Australian gecko (Gehyra lauta). The presence of this virus was confirmed by PCR and Sanger sequencing. Phylogenetic analysis revealed that Lauta virus likely represents a newly described genus within the family Amnoonviridae, order Articulavirales, that is most closely related to the fish virus Tilapia tilapinevirus (TiLV). These findings provide important insights into the evolution of negative-sense RNA viruses and structural conservation of the viral replicase among members of the order Articulavirales.This research was funded by the Australian Research Council, grant number FL170100022

    The elusive source of HIV-1 rebound after treatment interruption

    Get PDF
    Identifying the source of viral rebound during a monitored analytical treatment interruption (ATI) would reveal potential targets for cure strategies. Therefore, we examined the genetic composition of proviral DNA in different subsets from participants on antiretroviral therapy and compared this to rebounding virus after an ATI. Eleven participants underwent a monitored ATI and were sampled from different anatomical sites prior to and after the ATI. From the peripheral blood, NaĂŻve (TNA), central (TCM), transitional (TTM) and effector (TEM) memory CD4+ T cells were sorted as were CD45 cells from gut-associated lymphoid tissue (GALT). Using single-genome sequencing (SGS) the env region of HIV DNA and plasma-derived RNA was sequenced. In an ongoing study, Full-Length Individual Proviral Sequencing (FLIPS) and Integration Site Loop Amplification (ISLA) assays were performed on the T cell subsets from 2 participants. For participant STAR10, 87 integration sites (IS) and 113 proviral genomes were sequenced while only 3 unique intact proviruses (3%) were identified. A cluster of 17 identical defective proviruses were linked to an IS (9% of all IS) in STAT5B located in TCM, TNA, TEM and TTM. When comparing the FLIPS to SGS env sequences a 100% match was found between one defective provirus and one plasma HIV RNA sequence after rebound. For participant STAR11, 37 IS and 105 proviral genomes were sequenced yielding 14 intact proviruses (13%) with the highest proportion found predominantly in the TEM subset (n=13, 45%). Four different clusters of identical sequences could be identified of which 2 (n=3 and n=9) consisted of intact TEM sequences with the smaller cluster linked to an IS in ZNF274. A 99% match between 2 env from rebounding plasma RNA and this smaller cluster of intact proviral genomes was identified. Comparing proviral sequences and their IS to plasma-derived RNA sequences after an ATI reveals additional information in terms of the source of viral rebound. However, this comparison is complicated by multiple factors. For example, we found a plasma-derived RNA sequence obtained during viral rebound matched a defective proviral sequence which highlights the problem of using one HIV RNA subgenomic region for identifying replication-competent virus. In addition, ongoing viral replication during rebound may prevent a 100% match with genetically intact proviral sequences making it challenging to determine the absolute source of rebound

    Cross-validation to select Bayesian hierarchical models in phylogenetics.

    Get PDF
    BACKGROUND: Recent developments in Bayesian phylogenetic models have increased the range of inferences that can be drawn from molecular sequence data. Accordingly, model selection has become an important component of phylogenetic analysis. Methods of model selection generally consider the likelihood of the data under the model in question. In the context of Bayesian phylogenetics, the most common approach involves estimating the marginal likelihood, which is typically done by integrating the likelihood across model parameters, weighted by the prior. Although this method is accurate, it is sensitive to the presence of improper priors. We explored an alternative approach based on cross-validation that is widely used in evolutionary analysis. This involves comparing models according to their predictive performance. RESULTS: We analysed simulated data and a range of viral and bacterial data sets using a cross-validation approach to compare a variety of molecular clock and demographic models. Our results show that cross-validation can be effective in distinguishing between strict- and relaxed-clock models and in identifying demographic models that allow growth in population size over time. In most of our empirical data analyses, the model selected using cross-validation was able to match that selected using marginal-likelihood estimation. The accuracy of cross-validation appears to improve with longer sequence data, particularly when distinguishing between relaxed-clock models. CONCLUSIONS: Cross-validation is a useful method for Bayesian phylogenetic model selection. This method can be readily implemented even when considering complex models where selecting an appropriate prior for all parameters may be difficult

    Large-scale GWAS reveals insights into the genetic architecture of same-sex sexual behavior.

    Get PDF
    Twin and family studies have shown that same-sex sexual behavior is partly genetically influenced, but previous searches for specific genes involved have been underpowered. We performed a genome-wide association study (GWAS) on 477,522 individuals, revealing five loci significantly associated with same-sex sexual behavior. In aggregate, all tested genetic variants accounted for 8 to 25% of variation in same-sex sexual behavior, only partially overlapped between males and females, and do not allow meaningful prediction of an individual's sexual behavior. Comparing these GWAS results with those for the proportion of same-sex to total number of sexual partners among nonheterosexuals suggests that there is no single continuum from opposite-sex to same-sex sexual behavior. Overall, our findings provide insights into the genetics underlying same-sex sexual behavior and underscore the complexity of sexuality

    Rapid Evolution of Pandemic Noroviruses of the GII.4 Lineage

    Get PDF
    Over the last fifteen years there have been five pandemics of norovirus (NoV) associated gastroenteritis, and the period of stasis between each pandemic has been progressively shortening. NoV is classified into five genogroups, which can be further classified into 25 or more different human NoV genotypes; however, only one, genogroup II genotype 4 (GII.4), is associated with pandemics. Hence, GII.4 viruses have both a higher frequency in the host population and greater epidemiological fitness. The aim of this study was to investigate if the accuracy and rate of replication are contributing to the increased epidemiological fitness of the GII.4 strains. The replication and mutation rates were determined using in vitro RNA dependent RNA polymerase (RdRp) assays, and rates of evolution were determined by bioinformatics. GII.4 strains were compared to the second most reported genotype, recombinant GII.b/GII.3, the rarely detected GII.3 and GII.7 and as a control, hepatitis C virus (HCV). The predominant GII.4 strains had a higher mutation rate and rate of evolution compared to the less frequently detected GII.b, GII.3 and GII.7 strains. Furthermore, the GII.4 lineage had on average a 1.7-fold higher rate of evolution within the capsid sequence and a greater number of non-synonymous changes compared to other NoVs, supporting the theory that it is undergoing antigenic drift at a faster rate. Interestingly, the non-synonymous mutations for all three NoV genotypes were localised to common structural residues in the capsid, indicating that these sites are likely to be under immune selection. This study supports the hypothesis that the ability of the virus to generate genetic diversity is vital for viral fitness

    Large-scale analyses of common and rare variants identify 12 new loci associated with atrial fibrillation

    Get PDF
    Atrial fibrillation affects more than 33 million people worldwide and increases the risk of stroke, heart failure, and death. Fourteen genetic loci have been associated with atrial fibrillation in European and Asian ancestry groups. To further define the genetic basis of atrial fibrillation, we performed large-scale, trans-ancestry meta-analyses of common and rare variant association studies. The genome-wide association studies (GWAS) included 17,931 individuals with atrial fibrillation and 115,142 referents; the exome-wide association studies (ExWAS) and rare variant association studies (RVAS) involved 22,346 cases and 132,086 referents. We identified 12 new genetic loci that exceeded genome-wide significance, implicating genes involved in cardiac electrical and structural remodeling. Our results nearly double the number of known genetic loci for atrial fibrillation, provide insights into the molecular basis of atrial fibrillation, and may facilitate the identification of new potential targets for drug discovery

    New insights into the genetic etiology of Alzheimer's disease and related dementias

    Get PDF
    Characterization of the genetic landscape of Alzheimer's disease (AD) and related dementias (ADD) provides a unique opportunity for a better understanding of the associated pathophysiological processes. We performed a two-stage genome-wide association study totaling 111,326 clinically diagnosed/'proxy' AD cases and 677,663 controls. We found 75 risk loci, of which 42 were new at the time of analysis. Pathway enrichment analyses confirmed the involvement of amyloid/tau pathways and highlighted microglia implication. Gene prioritization in the new loci identified 31 genes that were suggestive of new genetically associated processes, including the tumor necrosis factor alpha pathway through the linear ubiquitin chain assembly complex. We also built a new genetic risk score associated with the risk of future AD/dementia or progression from mild cognitive impairment to AD/dementia. The improvement in prediction led to a 1.6- to 1.9-fold increase in AD risk from the lowest to the highest decile, in addition to effects of age and the APOE Δ4 allele
    • 

    corecore