498 research outputs found

    jpHMM: Improving the reliability of recombination prediction in HIV-1

    Get PDF
    Previously, we developed jumping profile hidden Markov model (jpHMM), a new method to detect recombinations in HIV-1 genomes. The jpHMM predicts recombination breakpoints in a query sequence and assigns to each position of the sequence one of the major HIV-1 subtypes. Since incorrect subtype assignment or recombination prediction may lead to wrong conclusions in epidemiological or vaccine research, information about the reliability of the predicted parental subtypes and breakpoint positions is valuable. For this reason, we extended the output of jpHMM to include such information in terms of ‘uncertainty’ regions in the recombination prediction and an interval estimate of the breakpoint. Both types of information are computed based on the posterior probabilities of the subtypes at each query sequence position. Our results show that this extension strongly improves the reliability of the jpHMM recombination prediction. The jpHMM is available online at http://jphmm.gobics.de/

    HIV Types, Groups, Subtypes and Recombinant Forms: Errors in Replication, Selection Pressure and Quasispecies

    Get PDF
    HIV-1 is a chimpanzee virus which was transmitted to humans by several zoonotic events resulting in infection with HIV-1 groups M P, and in parallel transmission events from sooty mangabey monkey viruses leading to infections with HIV-2 groups A H. Both viruses have circulated in the human population for about 80 years. In the infected patient, HIV mutates, and by elimination of some of the viruses by the action of the immune system individual quasispecies are formed. Along with the selection of the fittest viruses, mutation and recombination after superinfection with HIV from different groups or subtypes have resulted in the diversity of their patterns of geographic distribution. Despite the high variability observed, some essential parts of the HIV genome are highly conserved. Viral diversity is further facilitated in some parts of the HIV genome by drug selection pressure and may also be enhanced by different genetic factors, including HLA in patients from different regions of the world. Viral and human genetic factors influence pathogenesis. Viral genetic factors are proteins such as Tat, Vif and Rev. Human genetic factors associated with a better clinical outcome are proteins such as APOBEC, langerin, tetherin and chemokine receptor 5 (CCR5) and HLA B27, B57, DRB1{*}1303, KIR and PARD3B. Copyright (C) 2012 S. Karger AG, Base

    Selection at a single locus leads to widespread expansion of toxoplasma gondii lineages that are virulent in mice

    Get PDF
    The determinants of virulence are rarely defined for eukaryotic parasites such as T. gondii, a widespread parasite of mammals that also infects humans, sometimes with serious consequences. Recent laboratory studies have established that variation in a single secreted protein, a serine/threonine kinase known as ROPO18, controls whether or not mice survive infection. Here, we establish the extent and nature of variation in ROP18among a collection of parasite strains from geographically diverse regions. Compared to other genes, ROP18 showed extremely high levels of diversification and changes in expression level, which correlated with severity of infection in mice. Comparison with an out-group demonstrated that changes in the upstream region that regulates expression of ROP18 led to an historical increase in the expression and exposed the protein to diversifying selective pressure. Surprisingly, only three atypically distinct protein variants exist despite marked genetic divergence elsewhere in the genome. These three forms of ROP18 are likely adaptations for different niches in nature, and they confer markedly different virulence to mice. The widespread distribution of a single mouse-virulent allele among geographically and genetically disparate parasites may have consequences for transmission and disease in other hosts, including humans

    T-Cell Epitope Prediction: Rescaling Can Mask Biological Variation between MHC Molecules

    Get PDF
    Theoretical methods for predicting CD8+ T-cell epitopes are an important tool in vaccine design and for enhancing our understanding of the cellular immune system. The most popular methods currently available produce binding affinity predictions across a range of MHC molecules. In comparing results between these MHC molecules, it is common practice to apply a normalization procedure known as rescaling, to correct for possible discrepancies between the allelic predictors. Using two of the most popular prediction software packages, NetCTL and NetMHC, we tested the hypothesis that rescaling removes genuine biological variation from the predicted affinities when comparing predictions across a number of MHC molecules. We found that removing the condition of rescaling improved the prediction software's performance both qualitatively, in terms of ranking epitopes, and quantitatively, in the accuracy of their binding affinity predictions. We suggest that there is biologically significant variation among class 1 MHC molecules and find that retention of this variation leads to significantly more accurate epitope prediction

    Validation of an NSP-based (negative selection pattern) gene family identification strategy

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>Gene family identification from ESTs can be a valuable resource for analysis of genome evolution but presents unique challenges in organisms for which the entire genome is not yet sequenced. We have developed a novel gene family identification method based on negative selection patterns (NSP) between family members to screen EST-generated contigs. This strategy was tested on five known gene families in Arabidopsis to see if individual paralogs could be identified with accuracy from EST data alone when compared to the actual gene sequences in this fully sequenced genome.</p> <p>Results</p> <p>The NSP method uniquely identified family members in all the gene families tested. Two members of the FtsH gene family, three members each of the PAL, RF1, and ribosomal L6 gene families, and four members of the CAD gene family were correctly identified. Additionally all ESTs from the representative contigs when checked against MapViewer data successfully identify the gene locus predicted.</p> <p>Conclusion</p> <p>We demonstrate the effectiveness of the NSP strategy in identifying specific gene family members in Arabidopsis using only EST data and we describe how this strategy can be used to identify many gene families in agronomically important crop species where they are as yet undiscovered.</p

    On Evaluating MHC-II Binding Peptide Prediction Methods

    Get PDF
    Choice of one method over another for MHC-II binding peptide prediction is typically based on published reports of their estimated performance on standard benchmark datasets. We show that several standard benchmark datasets of unique peptides used in such studies contain a substantial number of peptides that share a high degree of sequence identity with one or more other peptide sequences in the same dataset. Thus, in a standard cross-validation setup, the test set and the training set are likely to contain sequences that share a high degree of sequence identity with each other, leading to overly optimistic estimates of performance. Hence, to more rigorously assess the relative performance of different prediction methods, we explore the use of similarity-reduced datasets. We introduce three similarity-reduced MHC-II benchmark datasets derived from MHCPEP, MHCBN, and IEDB databases. The results of our comparison of the performance of three MHC-II binding peptide prediction methods estimated using datasets of unique peptides with that obtained using their similarity-reduced counterparts shows that the former can be rather optimistic relative to the performance of the same methods on similarity-reduced counterparts of the same datasets. Furthermore, our results demonstrate that conclusions regarding the superiority of one method over another drawn on the basis of performance estimates obtained using commonly used datasets of unique peptides are often contradicted by the observed performance of the methods on the similarity-reduced versions of the same datasets. These results underscore the importance of using similarity-reduced datasets in rigorously comparing the performance of alternative MHC-II peptide prediction methods

    Impact of Clade, Geography, and Age of the Epidemic on HIV-1 Neutralization by Antibodies

    Get PDF
    Neutralizing antibodies (nAbs) are a high priority for vaccines that aim to prevent the acquisition of HIV-1 infection. Vaccine effectiveness will depend on the extent to which induced antibodies neutralize the global diversity of circulating HIV-1 variants. Using large panels of genetically and geographically diverse HIV-1 Env-pseudotyped viruses and chronic infection plasma samples, we unambiguously show that cross-clade nAb responses are commonly induced in response to infection by any virus clade. Nonetheless, neutralization was significantly greater when the plasma clade matched the clade of the virus being tested. This within-clade advantage was diminished in older, more-diverse epidemics in southern Africa, the United States, and Europe compared to more recent epidemics in Asia. It was most pronounced for circulating recombinant form (CRF) 07_BC, which is common in China and is the least-divergent lineage studied; this was followed by the slightly more diverse Asian CRF01_AE. We found no evidence that transmitted/founder viruses are generally more susceptible to neutralization and are therefore easier targets for vaccination than chronic viruses. Features of the gp120 V1V2 loop, in particular, length, net charge, and number of N-linked glycans, were associated with Env susceptibility and plasma neutralization potency in a manner consistent with neutralization escape being a force that drives viral diversification and plasma neutralization breadth. The overall susceptibility of Envs and potencies of plasma samples were highly predictive of the neutralization outcome of any single virus-plasma combination. These findings highlight important considerations for the design and testing of candidate HIV-1 vaccines that aim to elicit effective nAbs

    The Evolutionary Analysis of Emerging Low Frequency HIV-1 CXCR4 Using Variants through Time—An Ultra-Deep Approach

    Get PDF
    Large-scale parallel pyrosequencing produces unprecedented quantities of sequence data. However, when generated from viral populations current mapping software is inadequate for dealing with the high levels of variation present, resulting in the potential for biased data loss. In order to apply the 454 Life Sciences' pyrosequencing system to the study of viral populations, we have developed software for the processing of highly variable sequence data. Here we demonstrate our software by analyzing two temporally sampled HIV-1 intra-patient datasets from a clinical study of maraviroc. This drug binds the CCR5 coreceptor, thus preventing HIV-1 infection of the cell. The objective is to determine viral tropism (CCR5 versus CXCR4 usage) and track the evolution of minority CXCR4-using variants that may limit the response to a maraviroc-containing treatment regimen. Five time points (two prior to treatment) were available from each patient. We first quantify the effects of divergence on initial read k-mer mapping and demonstrate the importance of utilizing population-specific template sequences in relation to the analysis of next-generation sequence data. Then, in conjunction with coreceptor prediction algorithms that infer HIV tropism, our software was used to quantify the viral population structure pre- and post-treatment. In both cases, low frequency CXCR4-using variants (2.5–15%) were detected prior to treatment. Following phylogenetic inference, these variants were observed to exist as distinct lineages that were maintained through time. Our analysis, thus confirms the role of pre-existing CXCR4-using virus in the emergence of maraviroc-insensitive HIV. The software will have utility for the study of intra-host viral diversity and evolution of other fast evolving viruses, and is available from http://www.bioinf.manchester.ac.uk/segminator/
    corecore