318 research outputs found

    Aggregation of biological knowledge for immunological and virological applications

    Get PDF
    Ph.DDOCTOR OF PHILOSOPH

    Conservation and Diversity of Influenza A H1N1 HLA-Restricted T Cell Epitope Candidates for Epitope-Based Vaccines

    Get PDF
    Background: The immune-related evolution of influenza viruses is exceedingly complex and current vaccines against influenza must be reformulated for each influenza season because of the high degree of antigenic drift among circulating influenza strains. Delay in vaccine production is a serious problem in responding to a pandemic situation, such as that of the current H1N1 strain. Immune escape is generally attributed to reduced antibody recognition of the viral hemagglutinin and neuraminidase proteins whose rate of mutation is much greater than that of the internal non-structural proteins. As a possible alternative, vaccines directed at T cell epitope domains of internal influenza proteins, that are less susceptible to antigenic variation, have been investigated. Methodology/Principal Findings: HLA transgenic mouse strains expressing HLA class I A*0201, A*2402, and B*0702, and class II DRB1*1501, DRB1*0301 and DRB1*0401 were immunized with 196 influenza H1N1 peptides that contained residues of highly conserved proteome sequences of the human H1N1, H3N2, H1N2, H5N1, and avian influenza A strains. Fifty-four (54) peptides that elicited 63 HLA-restricted peptide-specific T cell epitope responses were identified by IFN-γ ELISpot assay. The 54 peptides were compared to the 2007-2009 human H1N1 sequences for selection of sequences in the design of a new candidate H1N1 vaccine, specifically targeted to highly-conserved HLA-restricted T cell epitopes. Conclusions/Significance: Seventeen (17) T cell epitopes in PB1, PB2, and M1 were selected as vaccine targets based on sequence conservation over the past 30 years, high functional avidity, non-identity to human peptides, clustered localization, and promiscuity to multiple HLA alleles. These candidate vaccine antigen sequences may be applicable to any avian or human influenza A virus. © 2010 Tan et al

    Complete-Proteome Mapping of Human Influenza A Adaptive Mutations: Implications for Human Transmissibility of Zoonotic Strains

    Get PDF
    BACKGROUND: There is widespread concern that H5N1 avian influenza A viruses will emerge as a pandemic threat, if they become capable of human-to-human (H2H) transmission. Avian strains lack this capability, which suggests that it requires important adaptive mutations. We performed a large-scale comparative analysis of proteins from avian and human strains, to produce a catalogue of mutations associated with H2H transmissibility, and to detect their presence in avian isolates. METHODOLOGY/PRINCIPAL FINDINGS: We constructed a dataset of influenza A protein sequences from 92,343 public database records. Human and avian sequence subsets were compared, using a method based on mutual information, to identify characteristic sites where human isolates present conserved mutations. The resulting catalogue comprises 68 characteristic sites in eight internal proteins. Subtype variability prevented the identification of adaptive mutations in the hemagglutinin and neuraminidase proteins. The high number of sites in the ribonucleoprotein complex suggests interdependence between mutations in multiple proteins. Characteristic sites are often clustered within known functional regions, suggesting their functional roles in cellular processes. By isolating and concatenating characteristic site residues, we defined adaptation signatures, which summarize the adaptive potential of specific isolates. Most adaptive mutations emerged within three decades after the 1918 pandemic, and have remained remarkably stable thereafter. Two lineages with stable internal protein constellations have circulated among humans without reassorting. On the contrary, H5N1 avian and swine viruses reassort frequently, causing both gains and losses of adaptive mutations. CONCLUSIONS: Human host adaptation appears to be complex and systemic, involving nearly all influenza proteins. Adaptation signatures suggest that the ability of H5N1 strains to infect humans is related to the presence of an unusually high number of adaptive mutations. However, these mutations appear unstable, suggesting low pandemic potential of H5N1 in its current form. In addition, adaptation signatures indicate that pandemic H1N1/09 strain possesses multiple human-transmissibility mutations, though not an unusually high number with respect to swine strains that infected humans in the past. Adaptation signatures provide a novel tool for identifying zoonotic strains with the potential to infect humans

    Rule-based knowledge aggregation for large-scale protein sequence analysis of influenza A viruses

    Get PDF
    Background: The explosive growth of biological data provides opportunities for new statistical and comparative analyses of large information sets, such as alignments comprising tens of thousands of sequences. In such studies, sequence annotations frequently play an essential role, and reliable results depend on metadata quality. However, the semantic heterogeneity and annotation inconsistencies in biological databases greatly increase the complexity of aggregating and cleaning metadata. Manual curation of datasets, traditionally favoured by life scientists, is impractical for studies involving thousands of records. In this study, we investigate quality issues that affect major public databases, and quantify the effectiveness of an automated metadata extraction approach that combines structural and semantic rules. We applied this approach to more than 90,000 influenza A records, to annotate sequences with protein name, virus subtype, isolate, host, geographic origin, and year of isolation. Results: Over 40,000 annotated Influenza A protein sequences were collected by combining information from more than 90,000 documents from NCBI public databases. Metadata values were automatically extracted, aggregated and reconciled from several document fields by applying user-defined structural rules. For each property, values were recovered from ≥88.8% of records, with accuracy exceeding 96% in most cases. Because of semantic heterogeneity, each property required up to six different structural rules to be combined. Significant quality differences between databases were found: GenBank documents yield values more reliably than documents extracted from GenPept. Using a simple set of semantic rules and a reasoner, we reconstructed relationships between sequences from the same isolate, thus identifying 7640 isolates. Validation of isolate metadata against a simple ontology highlighted more than 400 inconsistencies, leading to over 3,000 property value corrections. Conclusion: To overcome the quality issues inherent in public databases, automated knowledge aggregation with embedded intelligence is needed for large-scale analyses. Our results show that user-controlled intuitive approaches, based on combination of simple rules, can reliably automate various curation tasks, reducing the need for manual corrections to approximately 5% of the records. Emerging semantic technologies possess desirable features to support today's knowledge aggregation tasks, with a potential to bring immediate benefits to this field

    Conservation and Variability of West Nile Virus Proteins

    Get PDF
    West Nile virus (WNV) has emerged globally as an increasingly important pathogen for humans and domestic animals. Studies of the evolutionary diversity of the virus over its known history will help to elucidate conserved sites, and characterize their correspondence to other pathogens and their relevance to the immune system. We describe a large-scale analysis of the entire WNV proteome, aimed at identifying and characterizing evolutionarily conserved amino acid sequences. This study, which used 2,746 WNV protein sequences collected from the NCBI GenPept database, focused on analysis of peptides of length 9 amino acids or more, which are immunologically relevant as potential T-cell epitopes. Entropy-based analysis of the diversity of WNV sequences, revealed the presence of numerous evolutionarily stable nonamer positions across the proteome (entropy value of ≤1). The representation (frequency) of nonamers variant to the predominant peptide at these stable positions was, generally, low (≤10% of the WNV sequences analyzed). Eighty-eight fragments of length 9–29 amino acids, representing ∼34% of the WNV polyprotein length, were identified to be identical and evolutionarily stable in all analyzed WNV sequences. Of the 88 completely conserved sequences, 67 are also present in other flaviviruses, and several have been associated with the functional and structural properties of viral proteins. Immunoinformatic analysis revealed that the majority (78/88) of conserved sequences are potentially immunogenic, while 44 contained experimentally confirmed human T-cell epitopes. This study identified a comprehensive catalogue of completely conserved WNV sequences, many of which are shared by other flaviviruses, and majority are potential epitopes. The complete conservation of these immunologically relevant sequences through the entire recorded WNV history suggests they will be valuable as components of peptide-specific vaccines or other therapeutic applications, for sequence-specific diagnosis of a wide-range of Flavivivirus infections, and for studies of homologous sequences among other flaviviruses

    Implementing parasite genotyping into national surveillance frameworks: Feedback from control programmes and researchers in the Asia-pacific region

    Get PDF
    The Asia-Pacific region faces formidable challenges in achieving malaria elimination by the proposed target in 2030. Molecular surveillance of Plasmodium parasites can provide important information on malaria transmission and adaptation, which can inform national malaria control programmes (NMCPs) in decision-making processes. In November 2019 a parasite genotyping workshop was held in Jakarta, Indonesia, to review molecular approaches for parasite surveillance and explore ways in which these tools can be integrated into public health systems and inform policy. The meeting was attended by 70 participants from 8 malaria-endemic countries and partners of the Asia Pacific Malaria Elimination Network. The participants acknowledged the utility of multiple use cases for parasite genotyping including: quantifying the prevalence of drug resistant parasites, predicting risks of treatment failure, identifying major routes and reservoirs of infection, monitoring imported malaria and its contribution to local transmission, characterizing the origins and dynamics of malaria outbreaks, and estimating the frequency of Plasmodium vivax relapses. However, the priority of each use case varies with different endemic settings. Although a one-size-fits-all approach to molecular surveillance is unlikely to be applicable across the Asia-Pacific region, consensus on the spectrum of added-value activities will help support data sharing across national boundaries. Knowledge exchange is needed to establish local expertise in different laboratory-based methodologies and bioinformatics processes. Collaborative research involving local and international teams will help maximize the impact of analytical outputs on the operational needs of NMCPs. Research is also needed to explore the cost-effectiveness of genetic epidemiology for different use cases to help to leverage funding for wide-scale implementation. Engagement between NMCPs and local researchers will be critical throughout this process

    Conservation and Variability of Dengue Virus Proteins: Implications for Vaccine Design

    Get PDF
    Dengue viruses (DENVs) circulate in nature as a population of 4 distinct types, each with multiple genotypes and variants, and represent an increasing global public health issue with no prophylactic and therapeutic formulations currently available. Viral genomes contain sites that are evolutionarily stable and therefore highly conserved, presumably because changes in these sites have deleterious effects on viral fitness and survival. The identification and characterization of the historical dynamics of these sites in DENV have relevance to several applications such as diagnosis and drug and vaccine development. In this study, we have identified sequence fragments that were conserved across the majority of available DENV sequences, analyzed their historical dynamics, and evaluated their relevance as candidate vaccine targets, using various bioinformatics-based methods and immune assay in human leukocyte antigen (HLA) transgenic mice. This approach provides a framework for large-scale and systematic analysis of other human pathogens

    Genome-wide SNP analysis of Plasmodium falciparum shows differentiation at drug-resistance-associated loci among malaria transmission settings in southern Mali.

    Get PDF
    Plasmodium falciparum malaria cases in Africa represent over 90% of the global burden with Mali being amongst the 11 highest burden countries that account for 70% of this annual incidence. The persistence of P. falciparum despite massive global interventions is because of its genetic diversity that drives its ability to adapt to environmental changes, develop resistance to drugs, and evade the host immune system. Knowledge on P. falciparum genetic diversity across populations and intervention landscape is thus critical for the implementation of new strategies to eliminate malaria. This study assessed genetic variation with 12,177 high-quality SNPs from 830 Malian P. falciparum isolates collected between 2007 and 2017 from seven locations. The complexity of infections remained high, varied between sites, and showed a trend toward overall decreasing complexity over the decade. Though there was no significant substructure, allele frequencies varied geographically, partly driven by temporal variance in sampling, particularly for drug resistance and antigen loci. Thirty-two mutations in known drug resistance markers (pfcrt, pfdhps, pfdhfr, pfmdr1, pfmdr2, and pfk13) attained a frequency of at least 2% in the populations. SNPs within and around the major markers of resistance to quinolines (pfmdr1 and pfcrt) and antifolates (pfdhfr and pfdhps) varied temporally and geographically, with strong linkage disequilibrium and signatures of directional selection in the genome. These geo-temporal populations also differentiated at alleles in immune-related loci, including, protein E140, pfsurfin8, pfclag8, and pfceltos, as well as pftrap, which showed signatures of haplotype differentiation between populations. Several regions across the genomes, including five known drug resistance loci, showed signatures of differential positive selection. These results suggest that drugs and immune pressure are dominant selective forces against P. falciparum in Mali, but their effect on the parasite genome varies temporally and spatially. Interventions interacting with these genomic variants need to be routinely evaluated as malaria elimination strategies are implemented

    Molecular markers for artemisinin and partner drug resistance in natural Plasmodium falciparum populations following increased insecticide treated net coverage along the slope of mount Cameroon: cross-sectional study.

    Get PDF
    BACKGROUND: Drug resistance is one of the greatest challenges of malaria control programmes, with the monitoring of parasite resistance to artemisinins or to Artemisinin Combination Therapy (ACT) partner drugs critical to elimination efforts. Markers of resistance to a wide panel of antimalarials were assessed in natural parasite populations from southwestern Cameroon. METHODS: Individuals with asymptomatic parasitaemia or uncomplicated malaria were enrolled through cross-sectional surveys from May 2013 to March 2014 along the slope of mount Cameroon. Plasmodium falciparum malaria parasitaemic blood, screened by light microscopy, was depleted of leucocytes using CF11 cellulose columns and the parasite genotype ascertained by sequencing on the Illumina HiSeq platform. RESULTS: A total of 259 participants were enrolled in this study from three different altitudes. While some alleles associated with drug resistance in pfdhfr, pfmdr1 and pfcrt were highly prevalent, less than 3% of all samples carried mutations in the pfkelch13 gene, none of which were amongst those associated with slow artemisinin parasite clearance rates in Southeast Asia. The most prevalent haplotypes were triple mutants Pfdhfr I 51 R 59 N 108 I 164(99%), pfcrt- C72V73 I 74 E 75 T 76 (47.3%), and single mutants PfdhpsS436 G 437K540A581A613(69%) and Pfmdr1 N86 F 184D1246 (53.2%). CONCLUSIONS: The predominance of the Pf pfcrt CVIET and Pf dhfr IRN triple mutant parasites and absence of pfkelch13 resistance alleles suggest that the amodiaquine and pyrimethamine components of AS-AQ and SP may no longer be effective in their role while chloroquine resistance still persists in southwestern Cameroon
    corecore