227 research outputs found

    Protein Meta-Functional Signatures from Combining Sequence, Structure, Evolution, and Amino Acid Property Information

    Get PDF
    Protein function is mediated by different amino acid residues, both their positions and types, in a protein sequence. Some amino acids are responsible for the stability or overall shape of the protein, playing an indirect role in protein function. Others play a functionally important role as part of active or binding sites of the protein. For a given protein sequence, the residues and their degree of functional importance can be thought of as a signature representing the function of the protein. We have developed a combination of knowledge- and biophysics-based function prediction approaches to elucidate the relationships between the structural and the functional roles of individual residues and positions. Such a meta-functional signature (MFS), which is a collection of continuous values representing the functional significance of each residue in a protein, may be used to study proteins of known function in greater detail and to aid in experimental characterization of proteins of unknown function. We demonstrate the superior performance of MFS in predicting protein functional sites and also present four real-world examples to apply MFS in a wide range of settings to elucidate protein sequence–structure–function relationships. Our results indicate that the MFS approach, which can combine multiple sources of information and also give biological interpretation to each component, greatly facilitates the understanding and characterization of protein function

    Molecular clock-like evolution of human immunodeficiency virus type 1

    Get PDF
    AbstractThe molecular clock hypothesis states that the rate of nucleotide substitution per generation is constant across lineages. If generation times were equal across lineages, samples obtained at the same calendar time would have experienced the same number of generations since their common ancestor. However, if sequences are not derived from contemporaneous samples, differences in the number of generations may be misinterpreted as variation in substitution rates and hence may lead to false rejection of the molecular clock hypothesis. A recent study has called into doubt the validity of clock-like evolution for HIV-1, using molecular sequences derived from noncontemporaneous samples. However, after separating their within-individual data according to sampling time, we found that what appeared to be nonclock-like behavior could be attributed, in most cases, to noncontemporaneous sampling, with contributions also likely to derive from recombination. Natural selection alone did not appear to obscure the clock-like evolution of HIV-1

    Coping with Viral Diversity in HIV Vaccine Design

    Get PDF
    The ability of human immunodeficiency virus type 1 (HIV-1) to develop high levels of genetic diversity, and thereby acquire mutations to escape immune pressures, contributes to the difficulties in producing a vaccine. Possibly no single HIV-1 sequence can induce sufficiently broad immunity to protect against a wide variety of infectious strains, or block mutational escape pathways available to the virus after infection. The authors describe the generation of HIV-1 immunogens that minimizes the phylogenetic distance of viral strains throughout the known viral population (the center of tree [COT]) and then extend the COT immunogen by addition of a composite sequence that includes high-frequency variable sites preserved in their native contexts. The resulting COT(+) antigens compress the variation found in many independent HIV-1 isolates into lengths suitable for vaccine immunogens. It is possible to capture 62% of the variation found in the Nef protein and 82% of the variation in the Gag protein into immunogens of three gene lengths. The authors put forward immunogen designs that maximize representation of the diverse antigenic features present in a spectrum of HIV-1 strains. These immunogens should elicit immune responses against high-frequency viral strains as well as against most mutant forms of the virus

    Recognition of HIV-1 Peptides by Host CTL Is Related to HIV-1 Similarity to Human Proteins

    Get PDF
    Background: While human immunodeficiency virus type 1 (HIV-1)-specific cytotoxic T lymphocytes preferentially target specific regions of the viral proteome, HIV-1 features that contribute to immune recognition are not well understood. One hypothesis is that similarities between HIV and human proteins influence the host immune response, i.e., resemblance between viral and host peptides could preclude reactivity against certain HIV epitopes. Methodology/Principal Findings: We analyzed the extent of similarity between HIV-1 and the human proteome. Proteins from the HIV-1 B consensus sequence from 2001 were dissected into overlapping k-mers, which were then probed against a non-redundant database of the human proteome in order to identify segments of high similarity. We tested the relationship between HIV-1 similarity to host encoded peptides and immune recognition in HIV-infected individuals, and found that HIV immunogenicity could be partially modulated by the sequence similarity to the host proteome. ELISpot responses to peptides spanning the entire viral proteome evaluated in 314 individuals showed a trend indicating an inverse relationship between the similarity to the host proteome and the frequency of recognition. In addition, analysis of responses by a group of 30 HIV-infected individuals against 944 overlapping peptides representing a broad range of individual HIV-1B Nef variants, affirmed that the degree of similarity to the host was significantly lower for peptides with reactive epitopes than for those that were not recognized. Conclusions/Significance: Our results suggest that antigenic motifs that are scarcely represented in human proteins might represent more immunogenic CTL targets not selected against in the host. This observation could provide guidance in the design of more effective HIV immunogens, as sequences devoid of host-like features might afford superior immune reactivity

    Refining susceptibility loci of chronic obstructive pulmonary disease with lung eqtls

    Get PDF
    Chronic obstructive pulmonary disease (COPD) is the fourth leading cause of mortality worldwide. Recent genome-wide association studies (GWAS) have identified robust susceptibility loci associated with COPD. However, the mechanisms mediating the risk conferred by these loci remain to be found. The goal of this study was to identify causal genes/variants within susceptibility loci associated with COPD. In the discovery cohort, genome-wide gene expression profiles of 500 non-tumor lung specimens were obtained from patients undergoing lung surgery. Blood-DNA from the same patients were genotyped for 1,2 million SNPs. Following genotyping and gene expression quality control filters, 409 samples were analyzed. Lung expression quantitative trait loci (eQTLs) were identified and overlaid onto three COPD susceptibility loci derived from GWAS; 4q31 (HHIP), 4q22 (FAM13A), and 19q13 (RAB4B, EGLN2, MIA, CYP2A6). Significant eQTLs were replicated in two independent datasets (n = 363 and 339). SNPs previously associated with COPD and lung function on 4q31 (rs1828591, rs13118928) were associated with the mRNA expression of HHIP. An association between mRNA expression level of FAM13A and SNP rs2045517 was detected at 4q22, but did not reach statistical significance. At 19q13, significant eQTLs were detected with EGLN2. In summary, this study supports HHIP, FAM13A, and EGLN2 as the most likely causal COPD genes on 4q31, 4q22, and 19q13, respectively. Strong lung eQTL SNPs identified in this study will need to be tested for association with COPD in case-control studies. Further functional studies will also be needed to understand the role of genes regulated by disease-related variants in COPD

    Neutralization of Diverse Human Cytomegalovirus Strains Conferred by Antibodies Targeting Viral gH/gL/pUL128-131 Pentameric Complex

    Get PDF
    Human cytomegalovirus (HCMV) is the leading cause of congenital viral infection, and developing a prophylactic vaccine is of high priority to public health. We recently reported a replication-defective human cytomegalovirus with restored pentameric complex glycoprotein H (gH)/gL/pUL128-131 for prevention of congenital HCMV infection. While the quantity of vaccine-induced antibody responses can be measured in a viral neutralization assay, assessing the quality of such responses, including the ability of vaccine-induced antibodies to cross-neutralize the field strains of HCMV, remains a challenge. In this study, with a panel of neutralizing antibodies from three healthy human donors with natural HCMV infection or a vaccinated animal, we mapped eight sites on the dominant virus-neutralizing antigen-the pentameric complex of glycoprotein H (gH), gL, and pUL128, pUL130, and pUL131. By evaluating the site-specific antibodies in vaccine immune sera, we demonstrated that vaccination elicited functional antiviral antibodies to multiple neutralizing sites in rhesus macaques, with quality attributes comparable to those of CMV hyperimmune globulin. Furthermore, these immune sera showed antiviral activities against a panel of genetically distinct HCMV clinical isolates. These results highlighted the importance of understanding the quality of vaccine-induced antibody responses, which includes not only the neutralizing potency in key cell types but also the ability to protect against the genetically diverse field strains. IMPORTANCE HCMV is the leading cause of congenital viral infection, and development of a preventive vaccine is a high public health priority. To understand the strain coverage of vaccine-induced immune responses in comparison with natural immunity, we used a panel of broadly neutralizing antibodies to identify the immunogenic sites of a dominant viral antigen-the pentameric complex. We further demonstrated that following vaccination of a replication-defective virus with the restored pentameric complex, rhesus macaques can develop broadly neutralizing antibodies targeting multiple immunogenic sites of the pentameric complex. Such analyses of site-specific antibody responses are imperative to our assessment of the quality of vaccine-induced immunity in clinical studies

    The Overlap of Lung Tissue Transcriptome of Smoke Exposed Mice with Human Smoking and COPD

    Get PDF
    Genome-wide mRNA profiling in lung tissue from human and animal models can provide novel insights into the pathogenesis of chronic obstructive pulmonary disease (COPD). While 6 months of smoke exposure are widely used, shorter durations were also reported. The overlap of short term and long-term smoke exposure in mice is currently not well understood, and their representation of the human condition is uncertain. Lung tissue gene expression profiles of six murine smoking experiments (n = 48) were obtained from the Gene Expression Omnibus (GEO) and analyzed to identify the murine smoking signature. The 'human smoking' gene signature containing 386 genes was previously published in the lung eQTL study (n = 1,111). A signature of mild COPD containing 7 genes was also identified in the same study. The lung tissue gene signature of 'severe COPD' (n = 70) contained 4,071 genes and was previously published. We detected 3,723 differentially expressed genes in the 6 month-exposure mice datasets (FDR <0.1). Of those, 184 genes (representing 48% of human smoking) and 1,003 (representing 27% of human COPD) were shared with the human smoking-related genes and the COPD severity-related genes, respectively. There was 4-fold over-representation of human and murine smoking-related genes (P = 6.7 × 10-26) and a 1.4 fold in the severe COPD -related genes (P = 2.3 × 10-12). There was no significant enrichment of the mice and human smoking-related genes in mild COPD signature. These data suggest that murine smoke models are strongly representative of molecular processes of human smoking but less of COPD
    corecore