133 research outputs found

    Increased Breadth and Depth of Cytotoxic T Lymphocytes Responses against HIV-1-B Nef by Inclusion of Epitope Variant Sequences

    Get PDF
    Different vaccine approaches cope with HIV-1 diversity, ranging from centralized1–4 to variability-encompassing5–7 antigens. For all these strategies, a concern remains: how does HIV-1 diversity impact epitope recognition by the immune system? We studied the relationship between HIV-1 diversity and CD8+ T Lymphocytes (CTL) targeting of HIV-1 subtype B Nef using 944 peptides (10-mers overlapping by nine amino acids (AA)) that corresponded to consensus peptides and their most common variants in the HIV-1-B virus population. IFN-γ ELISpot assays were performed using freshly isolated PBMC from 26 HIV-1-infected persons. Three hundred and fifty peptides elicited a response in at least one individual. Individuals targeted a median of 7 discrete regions. Overall, 33% of responses were directed against viral variants but not elicited against consensus-based test peptides. However, there was no significant relationship between the frequency of a 10-mer in the viral population and either its frequency of recognition (Spearman's correlation coefficient ρ = 0.24) or the magnitude of the responses (ρ = 0.16). We found that peptides with a single mutation compared to the consensus were likely to be recognized (especially if the change was conservative) and to elicit responses of similar magnitude as the consensus peptide. Our results indicate that cross-reactivity between rare and frequent variants is likely to play a role in the expansion of CTL responses, and that maximizing antigenic diversity in a vaccine may increase the breadth and depth of CTL responses. However, since there are few obvious preferred pathways to virologic escape, the diversity that may be required to block all potential escape pathways may be too large for a realistic vaccine to accommodate. Furthermore, since peptides were not recognized based on their frequency in the population, it remains unclear by which mechanisms variability-inclusive antigens (i.e., constructs enriched with frequent variants) expand CTL recognition

    HIV-Specific Probabilistic Models of Protein Evolution

    Get PDF
    Comparative sequence analyses, including such fundamental bioinformatics techniques as similarity searching, sequence alignment and phylogenetic inference, have become a mainstay for researchers studying type 1 Human Immunodeficiency Virus (HIV-1) genome structure and evolution. Implicit in comparative analyses is an underlying model of evolution, and the chosen model can significantly affect the results. In general, evolutionary models describe the probabilities of replacing one amino acid character with another over a period of time. Most widely used evolutionary models for protein sequences have been derived from curated alignments of hundreds of proteins, usually based on mammalian genomes. It is unclear to what extent these empirical models are generalizable to a very different organism, such as HIV-1–the most extensively sequenced organism in existence. We developed a maximum likelihood model fitting procedure to a collection of HIV-1 alignments sampled from different viral genes, and inferred two empirical substitution models, suitable for describing between-and within-host evolution. Our procedure pools the information from multiple sequence alignments, and provided software implementation can be run efficiently in parallel on a computer cluster. We describe how the inferred substitution models can be used to generate scoring matrices suitable for alignment and similarity searches. Our models had a consistently superior fit relative to the best existing models and to parameter-rich data-driven models when benchmarked on independent HIV-1 alignments, demonstrating evolutionary biases in amino-acid substitution that are unique to HIV, and that are not captured by the existing models. The scoring matrices derived from the models showed a marked difference from common amino-acid scoring matrices. The use of an appropriate evolutionary model recovered a known viral transmission history, whereas a poorly chosen model introduced phylogenetic error. We argue that our model derivation procedure is immediately applicable to other organisms with extensive sequence data available, such as Hepatitis C and Influenza A viruses

    Nematode endoparasites do not codiversify with their stick insect hosts.

    Get PDF
    Host-parasite coevolution stems from reciprocal selection on host resistance and parasite infectivity, and can generate some of the strongest selective pressures known in nature. It is widely seen as a major driver of diversification, the most extreme case being parallel speciation in hosts and their associated parasites. Here, we report on endoparasitic nematodes, most likely members of the mermithid family, infecting different Timema stick insect species throughout California. The nematodes develop in the hemolymph of their insect host and kill it upon emergence, completely impeding host reproduction. Given the direct exposure of the endoparasites to the host's immune system in the hemolymph, and the consequences of infection on host fitness, we predicted that divergence among hosts may drive parallel divergence in the endoparasites. Our phylogenetic analyses suggested the presence of two differentiated endoparasite lineages. However, independently of whether the two lineages were considered separately or jointly, we found a complete lack of codivergence between the endoparasitic nematodes and their hosts in spite of extensive genetic variation among hosts and among parasites. Instead, there was strong isolation by distance among the endoparasitic nematodes, indicating that geography plays a more important role than host-related adaptations in driving parasite diversification in this system. The accumulating evidence for lack of codiversification between parasites and their hosts at macroevolutionary scales contrasts with the overwhelming evidence for coevolution within populations, and calls for studies linking micro- versus macroevolutionary dynamics in host-parasite interactions

    Non-Negative Matrix Factorization for Learning Alignment-Specific Models of Protein Evolution

    Get PDF
    Models of protein evolution currently come in two flavors: generalist and specialist. Generalist models (e.g. PAM, JTT, WAG) adopt a one-size-fits-all approach, where a single model is estimated from a number of different protein alignments. Specialist models (e.g. mtREV, rtREV, HIVbetween) can be estimated when a large quantity of data are available for a single organism or gene, and are intended for use on that organism or gene only. Unsurprisingly, specialist models outperform generalist models, but in most instances there simply are not enough data available to estimate them. We propose a method for estimating alignment-specific models of protein evolution in which the complexity of the model is adapted to suit the richness of the data. Our method uses non-negative matrix factorization (NNMF) to learn a set of basis matrices from a general dataset containing a large number of alignments of different proteins, thus capturing the dimensions of important variation. It then learns a set of weights that are specific to the organism or gene of interest and for which only a smaller dataset is available. Thus the alignment-specific model is obtained as a weighted sum of the basis matrices. Having been constrained to vary along only as many dimensions as the data justify, the model has far fewer parameters than would be required to estimate a specialist model. We show that our NNMF procedure produces models that outperform existing methods on all but one of 50 test alignments. The basis matrices we obtain confirm the expectation that amino acid properties tend to be conserved, and allow us to quantify, on specific alignments, how the strength of conservation varies across different properties. We also apply our new models to phylogeny inference and show that the resulting phylogenies are different from, and have improved likelihood over, those inferred under standard models

    CodonTest: Modeling Amino Acid Substitution Preferences in Coding Sequences

    Get PDF
    Codon models of evolution have facilitated the interpretation of selective forces operating on genomes. These models, however, assume a single rate of non-synonymous substitution irrespective of the nature of amino acids being exchanged. Recent developments have shown that models which allow for amino acid pairs to have independent rates of substitution offer improved fit over single rate models. However, these approaches have been limited by the necessity for large alignments in their estimation. An alternative approach is to assume that substitution rates between amino acid pairs can be subdivided into rate classes, dependent on the information content of the alignment. However, given the combinatorially large number of such models, an efficient model search strategy is needed. Here we develop a Genetic Algorithm (GA) method for the estimation of such models. A GA is used to assign amino acid substitution pairs to a series of rate classes, where is estimated from the alignment. Other parameters of the phylogenetic Markov model, including substitution rates, character frequencies and branch lengths are estimated using standard maximum likelihood optimization procedures. We apply the GA to empirical alignments and show improved model fit over existing models of codon evolution. Our results suggest that current models are poor approximations of protein evolution and thus gene and organism specific multi-rate models that incorporate amino acid substitution biases are preferred. We further anticipate that the clustering of amino acid substitution rates into classes will be biologically informative, such that genes with similar functions exhibit similar clustering, and hence this clustering will be useful for the evolutionary fingerprinting of genes

    Modeling HIV-1 Drug Resistance as Episodic Directional Selection

    Get PDF
    The evolution of substitutions conferring drug resistance to HIV-1 is both episodic, occurring when patients are on antiretroviral therapy, and strongly directional, with site-specific resistant residues increasing in frequency over time. While methods exist to detect episodic diversifying selection and continuous directional selection, no evolutionary model combining these two properties has been proposed. We present two models of episodic directional selection (MEDS and EDEPS) which allow the a priori specification of lineages expected to have undergone directional selection. The models infer the sites and target residues that were likely subject to directional selection, using either codon or protein sequences. Compared to its null model of episodic diversifying selection, MEDS provides a superior fit to most sites known to be involved in drug resistance, and neither one test for episodic diversifying selection nor another for constant directional selection are able to detect as many true positives as MEDS and EDEPS while maintaining acceptable levels of false positives. This suggests that episodic directional selection is a better description of the process driving the evolution of drug resistance

    Genome-Wide Interaction Analysis of Air Pollution Exposure and Childhood Asthma with Functional Follow-up

    Get PDF
    Rationale: The evidence supporting an association between traffic-related air pollution exposure and incident childhood asthma is inconsistent and may depend on genetic factors. Objectives: To identify gene–environment interaction effects on childhood asthma using genome-wide single-nucleotide polymorphism (SNP) data and air pollution exposure. Identified loci were further analyzed at epigenetic and transcriptomic levels. Methods: We used land use regression models to estimate individual air pollution exposure (represented by outdoor NO2 levels) at the birth address and performed a genome-wide interaction study for doctors’ diagnoses of asthma up to 8 years in three European birth cohorts (n = 1,534) with look-up for interaction in two separate North American cohorts, CHS (Children’s Health Study) and CAPPS/SAGE (Canadian Asthma Primary Prevention Study/Study of Asthma, Genetics and Environment) (n = 1,602 and 186 subjects, respectively). We assessed expression quantitative trait locus effects in human lung specimens and blood, as well as associations among air pollution exposure, methylation, and transcriptomic patterns. Measurements and Main Results: In the European cohorts, 186 SNPs had an interaction P < 1 × 10−4 and a look-up evaluation of these disclosed 8 SNPs in 4 loci, with an interaction P < 0.05 in the large CHS study, but not in CAPPS/SAGE. Three SNPs within adenylate cyclase 2 (ADCY2) showed the same direction of the interaction effect and were found to influence ADCY2 gene expression in peripheral blood (P = 4.50 × 10−4). One other SNP with P < 0.05 for interaction in CHS, rs686237, strongly influenced UDP-Gal:betaGlcNAc β-1,4-galactosyltransferase, polypeptide 5 (B4GALT5) expression in lung tissue (P = 1.18 × 10−17). Air pollution exposure was associated with differential discs, large homolog 2 (DLG2) methylation and expression. Conclusions: Our results indicated that gene–environment interactions are important for asthma development and provided supportive evidence for interaction with air pollution for ADCY2, B4GALT5, and DLG2

    Genetic Associations and Architecture of Asthma-COPD Overlap

    Get PDF
    BACKGROUND: Some people have characteristics of both asthma and COPD (asthma-COPD overlap), and evidence suggests they experience worse outcomes than those with either condition alone. RESEARCH QUESTION: What is the genetic architecture of asthma-COPD overlap, and do the determinants of risk for asthma-COPD overlap differ from those for COPD or asthma? STUDY DESIGN AND METHODS: We conducted a genome-wide association study in 8,068 asthma-COPD overlap case subjects and 40,360 control subjects without asthma or COPD of European ancestry in UK Biobank (stage 1). We followed up promising signals (P < 5 × 10-6) that remained associated in analyses comparing (1) asthma-COPD overlap vs asthma-only control subjects, and (2) asthma-COPD overlap vs COPD-only control subjects. These variants were analyzed in 12 independent cohorts (stage 2). RESULTS: We selected 31 independent variants for further investigation in stage 2, and discovered eight novel signals (P < 5 × 10-8) for asthma-COPD overlap (meta-analysis of stage 1 and 2 studies). These signals suggest a spectrum of shared genetic influences, some predominantly influencing asthma (FAM105A, GLB1, PHB, TSLP), others predominantly influencing fixed airflow obstruction (IL17RD, C5orf56, HLA-DQB1). One intergenic signal on chromosome 5 had not been previously associated with asthma, COPD, or lung function. Subgroup analyses suggested that associations at these eight signals were not driven by smoking or age at asthma diagnosis, and in phenome-wide scans, eosinophil counts, atopy, and asthma traits were prominent. INTERPRETATION: We identified eight signals for asthma-COPD overlap, which may represent loci that predispose to type 2 inflammation, and serious long-term consequences of asthma
    corecore