246 research outputs found

    Protein structures and optimal folding emerging from a geometrical variational principle

    Full text link
    Novel numerical techniques, validated by an analysis of barnase and chymotrypsin inhibitor, are used to elucidate the paramount role played by the geometry of the protein backbone in steering the folding to the correct native state. It is found that, irrespective of the sequence, the native state of a protein has exceedingly large number of conformations with a given amount of structural overlap compared to other compact artificial backbones; moreover the conformational entropies of unrelated proteins of the same length are nearly equal at any given stage of folding. These results are suggestive of an extremality principle underlying protein evolution, which, in turn, is shown to be associated with the emergence of secondary structures.Comment: Revtex, 5 pages, 5 postscript figure

    Geometric and Statistical Properties of the Mean-Field HP Model, the LS Model and Real Protein Sequences

    Get PDF
    Lattice models, for their coarse-grained nature, are best suited for the study of the ``designability problem'', the phenomenon in which most of the about 16,000 proteins of known structure have their native conformations concentrated in a relatively small number of about 500 topological classes of conformations. Here it is shown that on a lattice the most highly designable simulated protein structures are those that have the largest number of surface-core switchbacks. A combination of physical, mathematical and biological reasons that causes the phenomenon is given. By comparing the most foldable model peptides with protein sequences in the Protein Data Bank, it is shown that whereas different models may yield similar designabilities, predicted foldable peptides will simulate natural proteins only when the model incorporates the correct physics and biology, in this case if the main folding force arises from the differing hydrophobicity of the residues, but does not originate, say, from the steric hindrance effect caused by the differing sizes of the residues.Comment: 12 pages, 10 figure

    Mean-Field HP Model, Designability and Alpha-Helices in Protein Structures

    Full text link
    Analysis of the geometric properties of a mean-field HP model on a square lattice for protein structure shows that structures with large number of switch backs between surface and core sites are chosen favorably by peptides as unique ground states. Global comparison of model (binary) peptide sequences with concatenated (binary) protein sequences listed in the Protein Data Bank and the Dali Domain Dictionary indicates that the highest correlation occurs between model peptides choosing the favored structures and those portions of protein sequences containing alpha-helices.Comment: 4 pages, 2 figure

    Prediction of backbone dihedral angles and protein secondary structure using support vector machines

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>The prediction of the secondary structure of a protein is a critical step in the prediction of its tertiary structure and, potentially, its function. Moreover, the backbone dihedral angles, highly correlated with secondary structures, provide crucial information about the local three-dimensional structure.</p> <p>Results</p> <p>We predict independently both the secondary structure and the backbone dihedral angles and combine the results in a loop to enhance each prediction reciprocally. Support vector machines, a state-of-the-art supervised classification technique, achieve secondary structure predictive accuracy of 80% on a non-redundant set of 513 proteins, significantly higher than other methods on the same dataset. The dihedral angle space is divided into a number of regions using two unsupervised clustering techniques in order to predict the region in which a new residue belongs. The performance of our method is comparable to, and in some cases more accurate than, other multi-class dihedral prediction methods.</p> <p>Conclusions</p> <p>We have created an accurate predictor of backbone dihedral angles and secondary structure. Our method, called DISSPred, is available online at <url>http://comp.chem.nottingham.ac.uk/disspred/</url>.</p

    Chaperones convert the energy from ATP into the nonequilibrium stabilization of native proteins.

    Get PDF
    During and after protein translation, molecular chaperones require ATP hydrolysis to favor the native folding of their substrates and, under stress, to avoid aggregation and revert misfolding. Why do some chaperones need ATP, and what are the consequences of the energy contributed by the ATPase cycle? Here, we used biochemical assays and physical modeling to show that the bacterial chaperones GroEL (Hsp60) and DnaK (Hsp70) both use part of the energy from ATP hydrolysis to restore the native state of their substrates, even under denaturing conditions in which the native state is thermodynamically unstable. Consistently with thermodynamics, upon exhaustion of ATP, the metastable native chaperone products spontaneously revert to their equilibrium non-native states. In the presence of ATPase chaperones, some proteins may thus behave as open ATP-driven, nonequilibrium systems whose fate is only partially determined by equilibrium thermodynamics

    The Proteomic Code: a molecular recognition code for proteins

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>The Proteomic Code is a set of rules by which information in genetic material is transferred into the physico-chemical properties of amino acids. It determines how individual amino acids interact with each other during folding and in specific protein-protein interactions. The Proteomic Code is part of the redundant Genetic Code.</p> <p>Review</p> <p>The 25-year-old history of this concept is reviewed from the first independent suggestions by Biro and Mekler, through the works of Blalock, Root-Bernstein, Siemion, Miller and others, followed by the discovery of a Common Periodic Table of Codons and Nucleic Acids in 2003 and culminating in the recent conceptualization of partial complementary coding of interacting amino acids as well as the theory of the nucleic acid-assisted protein folding.</p> <p>Methods and conclusions</p> <p>A novel cloning method for the design and production of specific, high-affinity-reacting proteins (SHARP) is presented. This method is based on the concept of proteomic codes and is suitable for large-scale, industrial production of specifically interacting peptides.</p

    Investigating Homology between Proteins using Energetic Profiles

    Get PDF
    Accumulated experimental observations demonstrate that protein stability is often preserved upon conservative point mutation. In contrast, less is known about the effects of large sequence or structure changes on the stability of a particular fold. Almost completely unknown is the degree to which stability of different regions of a protein is generally preserved throughout evolution. In this work, these questions are addressed through thermodynamic analysis of a large representative sample of protein fold space based on remote, yet accepted, homology. More than 3,000 proteins were computationally analyzed using the structural-thermodynamic algorithm COREX/BEST. Estimated position-specific stability (i.e., local Gibbs free energy of folding) and its component enthalpy and entropy were quantitatively compared between all proteins in the sample according to all-vs.-all pairwise structural alignment. It was discovered that the local stabilities of homologous pairs were significantly more correlated than those of non-homologous pairs, indicating that local stability was indeed generally conserved throughout evolution. However, the position-specific enthalpy and entropy underlying stability were less correlated, suggesting that the overall regional stability of a protein was more important than the thermodynamic mechanism utilized to achieve that stability. Finally, two different types of statistically exceptional evolutionary structure-thermodynamic relationships were noted. First, many homologous proteins contained regions of similar thermodynamics despite localized structure change, suggesting a thermodynamic mechanism enabling evolutionary fold change. Second, some homologous proteins with extremely similar structures nonetheless exhibited different local stabilities, a phenomenon previously observed experimentally in this laboratory. These two observations, in conjunction with the principal conclusion that homologous proteins generally conserved local stability, may provide guidance for a future thermodynamically informed classification of protein homology

    Experimental Microbial Evolution of Extremophiles

    Get PDF
    Experimental microbial evolutions (EME) involves studying closely a microbial population after it has been through a large number of generations under controlled conditions (Kussell 2013). Adaptive laboratory evolution (ALE) selects for fitness under experimentally imposed conditions (Bennett and Hughes 2009; Dragosits and Mattanovich 2013). However, experimental evolution studies focusing on the contributions of genetic drift and natural mutation rates to evolution are conducted under non-selective conditions to avoid changes imposed by selection (Hindré et al. 2012). To understand the application of experimental evolutionary methods to extremophiles it is essential to consider the recent growth in this field over the last decade using model non-extremophilic microorganisms. This growth reflects both a greater appreciation of the power of experimental evolution for testing evolutionary hypotheses and, especially recently, the new power of genomic methods for analyzing changes in experimentally evolved lineages. Since many crucial processes are driven by microorganisms in nature, it is essential to understand and appreciate how microbial communities function, particularly with relevance to selection. However, many theories developed to understand microbial ecological patterns focus on the distribution and the structure of diversity within a microbial population comprised of single species (Prosser et al. 2007). Therefore an understanding of the concept of species is needed. A common definition of species using a genetic concept is a group of interbreeding individuals that is isolated from other such groups by barriers of recombination (Prosser et al. 2007). An alternative ecological species concept defines a species as set of individuals that can be considered identical in all relevant ecological traits (Cohan 2001). This is particularly important because of the abundance and deep phylogenetic complexity of microbial communities. Cohan postulated that “bacteria occupy discrete niches and that periodic selection will purge genetic variation within each niche without preventing divergence between the inhabitants of different niches”. The importance of gene exchange mechanisms likely in bacteria and archaea and therefore extremophiles, arises from the fact that their genomes are divided into two distinct parts, the core genome and the accessory genome (Cohan 2001). The core genome consists of genes that are crucial for the functioning of an organism and the accessory genome consists of genes that are capable of adapting to the changing ecosystem through gain and loss of function. Strains that belong to the same species can differ in the composition of accessory genes and therefore their capability to adapt to changing ecosystems (Cohan 2001; Tettelin et al. 2005; Gill et al. 2005). Additional ecological diversity exists in plasmids, transposons and pathogenicity islands as they can be easily shared in a favorable environment but still be absent in the same species found elsewhere (Wertz et al. 2003). This poses a major challenge for studying ALE and community microbial ecology indicating a continued need to develop a fitting theory that connects the fluid nature of microbial communities to their ecology (Wertz et al. 2003; Coleman et al. 2006). Understanding the nature and contribution of different processes that determine the frequencies of genes in any population is the biggest concern in population and evolutionary genetics (Prosser et al. 2007) and it is critical for an understanding of experimental evolution

    From protein sequences to 3D-structures and beyond: the example of the UniProt Knowledgebase

    Get PDF
    With the dramatic increase in the volume of experimental results in every domain of life sciences, assembling pertinent data and combining information from different fields has become a challenge. Information is dispersed over numerous specialized databases and is presented in many different formats. Rapid access to experiment-based information about well-characterized proteins helps predict the function of uncharacterized proteins identified by large-scale sequencing. In this context, universal knowledgebases play essential roles in providing access to data from complementary types of experiments and serving as hubs with cross-references to many specialized databases. This review outlines how the value of experimental data is optimized by combining high-quality protein sequences with complementary experimental results, including information derived from protein 3D-structures, using as an example the UniProt knowledgebase (UniProtKB) and the tools and links provided on its website (http://www.uniprot.org/). It also evokes precautions that are necessary for successful predictions and extrapolations
    corecore