68,037 research outputs found

    An Alternative Model of Amino Acid Replacement

    Full text link
    The observed correlations between pairs of homologous protein sequences are typically explained in terms of a Markovian dynamic of amino acid substitution. This model assumes that every location on the protein sequence has the same background distribution of amino acids, an assumption that is incompatible with the observed heterogeneity of protein amino acid profiles and with the success of profile multiple sequence alignment. We propose an alternative model of amino acid replacement during protein evolution based upon the assumption that the variation of the amino acid background distribution from one residue to the next is sufficient to explain the observed sequence correlations of homologs. The resulting dynamical model of independent replacements drawn from heterogeneous backgrounds is simple and consistent, and provides a unified homology match score for sequence-sequence, sequence-profile and profile-profile alignment.Comment: Minor improvements. Added figure and reference

    Deriving a mutation index of carcinogenicity using protein structure and protein interfaces

    Get PDF
    With the advent of Next Generation Sequencing the identification of mutations in the genomes of healthy and diseased tissues has become commonplace. While much progress has been made to elucidate the aetiology of disease processes in cancer, the contributions to disease that many individual mutations make remain to be characterised and their downstream consequences on cancer phenotypes remain to be understood. Missense mutations commonly occur in cancers and their consequences remain challenging to predict. However, this knowledge is becoming more vital, for both assessing disease progression and for stratifying drug treatment regimes. Coupled with structural data, comprehensive genomic databases of mutations such as the 1000 Genomes project and COSMIC give an opportunity to investigate general principles of how cancer mutations disrupt proteins and their interactions at the molecular and network level. We describe a comprehensive comparison of cancer and neutral missense mutations; by combining features derived from structural and interface properties we have developed a carcinogenicity predictor, InCa (Index of Carcinogenicity). Upon comparison with other methods, we observe that InCa can predict mutations that might not be detected by other methods. We also discuss general limitations shared by all predictors that attempt to predict driver mutations and discuss how this could impact high-throughput predictions. A web interface to a server implementation is publicly available at http://inca.icr.ac.uk/

    Pairwise alignment incorporating dipeptide covariation

    Full text link
    Motivation: Standard algorithms for pairwise protein sequence alignment make the simplifying assumption that amino acid substitutions at neighboring sites are uncorrelated. This assumption allows implementation of fast algorithms for pairwise sequence alignment, but it ignores information that could conceivably increase the power of remote homolog detection. We examine the validity of this assumption by constructing extended substitution matrixes that encapsulate the observed correlations between neighboring sites, by developing an efficient and rigorous algorithm for pairwise protein sequence alignment that incorporates these local substitution correlations, and by assessing the ability of this algorithm to detect remote homologies. Results: Our analysis indicates that local correlations between substitutions are not strong on the average. Furthermore, incorporating local substitution correlations into pairwise alignment did not lead to a statistically significant improvement in remote homology detection. Therefore, the standard assumption that individual residues within protein sequences evolve independently of neighboring positions appears to be an efficient and appropriate approximation

    Extreme genetic fragility of the HIV-1 capsid

    Get PDF
    Genetic robustness, or fragility, is defined as the ability, or lack thereof, of a biological entity to maintain function in the face of mutations. Viruses that replicate via RNA intermediates exhibit high mutation rates, and robustness should be particularly advantageous to them. The capsid (CA) domain of the HIV-1 Gag protein is under strong pressure to conserve functional roles in viral assembly, maturation, uncoating, and nuclear import. However, CA is also under strong immunological pressure to diversify. Therefore, it would be particularly advantageous for CA to evolve genetic robustness. To measure the genetic robustness of HIV-1 CA, we generated a library of single amino acid substitution mutants, encompassing almost half the residues in CA. Strikingly, we found HIV-1 CA to be the most genetically fragile protein that has been analyzed using such an approach, with 70% of mutations yielding replication-defective viruses. Although CA participates in several steps in HIV-1 replication, analysis of conditionally (temperature sensitive) and constitutively non-viable mutants revealed that the biological basis for its genetic fragility was primarily the need to coordinate the accurate and efficient assembly of mature virions. All mutations that exist in naturally occurring HIV-1 subtype B populations at a frequency >3%, and were also present in the mutant library, had fitness levels that were >40% of WT. However, a substantial fraction of mutations with high fitness did not occur in natural populations, suggesting another form of selection pressure limiting variation in vivo. Additionally, known protective CTL epitopes occurred preferentially in domains of the HIV-1 CA that were even more genetically fragile than HIV-1 CA as a whole. The extreme genetic fragility of HIV-1 CA may be one reason why cell-mediated immune responses to Gag correlate with better prognosis in HIV-1 infection, and suggests that CA is a good target for therapy and vaccination strategies

    Functional characterization and structure-guided mutational analysis of the transsulfuration enzyme cystathionine Ξ³-lyase from toxoplasma gondii

    Get PDF
    Sulfur-containing amino acids play essential roles in many organisms. The protozoan parasite Toxoplasma gondii includes the genes for cystathionine Ξ²-synthase and cystathionine Ξ³-lyase (TgCGL), as well as for cysteine synthase, which are crucial enzymes of the transsulfuration and de novo pathways for cysteine biosynthesis, respectively. These enzymes are specifically expressed in the oocyst stage of T. gondii. However, their functionality has not been investigated. Herein, we expressed and characterized the putative CGL from T. gondii. Recombinant TgCGL almost exclusively catalyses the Ξ±,Ξ³-hydrolysis of L-cystathionine to form L-cysteine and displays marginal reactivity toward L-cysteine. Structure-guided homology modelling revealed two striking amino acid differences between the human and parasite CGL active-sites (Glu59 and Ser340 in human to Ser77 and Asn360 in toxoplasma). Mutation of Asn360 to Ser demonstrated the importance of this residue in modulating the specificity for the catalysis of Ξ±,Ξ²-versus Ξ±,Ξ³-elimination of L-cystathionine. Replacement of Ser77 by Glu completely abolished activity towards L-cystathionine. Our results suggest that CGL is an important functional enzyme in T. gondii, likely implying that the reverse transsulfuration pathway is operative in the parasite; we also probed the roles of active-site architecture and substrate binding conformations as determinants of reaction specificity in transsulfuration enzymes

    Inferring stabilizing mutations from protein phylogenies : application to influenza hemagglutinin

    Get PDF
    One selection pressure shaping sequence evolution is the requirement that a protein fold with sufficient stability to perform its biological functions. We present a conceptual framework that explains how this requirement causes the probability that a particular amino acid mutation is fixed during evolution to depend on its effect on protein stability. We mathematically formalize this framework to develop a Bayesian approach for inferring the stability effects of individual mutations from homologous protein sequences of known phylogeny. This approach is able to predict published experimentally measured mutational stability effects (ΔΔG values) with an accuracy that exceeds both a state-of-the-art physicochemical modeling program and the sequence-based consensus approach. As a further test, we use our phylogenetic inference approach to predict stabilizing mutations to influenza hemagglutinin. We introduce these mutations into a temperature-sensitive influenza virus with a defect in its hemagglutinin gene and experimentally demonstrate that some of the mutations allow the virus to grow at higher temperatures. Our work therefore describes a powerful new approach for predicting stabilizing mutations that can be successfully applied even to large, complex proteins such as hemagglutinin. This approach also makes a mathematical link between phylogenetics and experimentally measurable protein properties, potentially paving the way for more accurate analyses of molecular evolution

    Dynamic control of selectivity in the ubiquitination pathway revealed by an ASP to GLU substitution in an intra-molecular salt-bridge network

    Get PDF
    Ubiquitination relies on a subtle balance between selectivity and promiscuity achieved through specific interactions between ubiquitin-conjugating enzymes (E2s) and ubiquitin ligases (E3s). Here, we report how a single aspartic to glutamic acid substitution acts as a dynamic switch to tip the selectivity balance of human E2s for interaction toward E3 RING-finger domains. By combining molecular dynamic simulations, experimental yeast-two-hybrid screen of E2-E3 (RING) interactions and mutagenesis, we reveal how the dynamics of an internal salt-bridge network at the rim of the E2-E3 interaction surface controls the balance between an β€œopen”, binding competent, and a β€œclosed”, binding incompetent state. The molecular dynamic simulations shed light on the fine mechanism of this molecular switch and allowed us to identify its components, namely an aspartate/glutamate pair, a lysine acting as the central switch and a remote aspartate. Perturbations of single residues in this network, both inside and outside the interaction surface, are sufficient to switch the global E2 interaction selectivity as demonstrated experimentally. Taken together, our results indicate a new mechanism to control E2-E3 interaction selectivity at an atomic level, highlighting how minimal changes in amino acid side-chain affecting the dynamics of intramolecular salt-bridges can be crucial for protein-protein interactions. These findings indicate that the widely accepted sequence-structure-function paradigm should be extended to sequence-structure-dynamics-function relationship and open new possibilities for control and fine-tuning of protein interaction selectivity
    • …
    corecore