46 research outputs found
Editorial: hypotheses about protein folding - the proteomic code and wonderfolds
Theoretical biology journals can contribute in many ways to the progress of knowledge. They are particularly well-placed to encourage dialogue and debate about hypotheses addressing problematical areas of research. An online journal provides an especially useful forum for such debate because of the option of posting comments within days of the publication of a contentious article
How the other half lives: CRISPR-Cas's influence on bacteriophages
CRISPR-Cas is a genetic adaptive immune system unique to prokaryotic cells
used to combat phage and plasmid threats. The host cell adapts by incorporating
DNA sequences from invading phages or plasmids into its CRISPR locus as
spacers. These spacers are expressed as mobile surveillance RNAs that direct
CRISPR-associated (Cas) proteins to protect against subsequent attack by the
same phages or plasmids. The threat from mobile genetic elements inevitably
shapes the CRISPR loci of archaea and bacteria, and simultaneously the
CRISPR-Cas immune system drives evolution of these invaders. Here we highlight
our recent work, as well as that of others, that seeks to understand phage
mechanisms of CRISPR-Cas evasion and conditions for population coexistence of
phages with CRISPR-protected prokaryotes.Comment: 24 pages, 8 figure
Comparative analysis of thermophilic and mesophilic proteins using Protein Energy Networks
<p>Abstract</p> <p>Background</p> <p>Thermophilic proteins sustain themselves and function at higher temperatures. Despite their structural and functional similarities with their mesophilic homologues, they show enhanced stability. Various comparative studies at genomic, protein sequence and structure levels, and experimental works highlight the different factors and dominant interacting forces contributing to this increased stability.</p> <p>Methods</p> <p>In this comparative structure based study, we have used interaction energies between amino acids, to generate structure networks called as Protein Energy Networks (PENs). These PENs are used to compute network, sub-graph, and node specific parameters. These parameters are then compared between the thermophile-mesophile homologues.</p> <p>Results</p> <p>The results show an increased number of clusters and low energy cliques in thermophiles as the main contributing factors for their enhanced stability. Further more, we see an increase in the number of hubs in thermophiles. We also observe no community of electrostatic cliques forming in PENs.</p> <p>Conclusion</p> <p>In this study we were able to take an energy based network approach, to identify the factors responsible for enhanced stability of thermophiles, by comparative analysis. We were able to point out that the sub-graph parameters are the prominent contributing factors. The thermophiles have a better-packed hydrophobic core. We have also discussed how thermophiles, although increasing stability through higher connectivity retains conformational flexibility, from a cliques and communities perspective.</p
Use of machine learning algorithms to classify binary protein sequences as highly-designable or poorly-designable
<p>Abstract</p> <p>Background</p> <p>By using a standard Support Vector Machine (SVM) with a Sequential Minimal Optimization (SMO) method of training, Naïve Bayes and other machine learning algorithms we are able to distinguish between two classes of protein sequences: those folding to highly-designable conformations, or those folding to poorly- or non-designable conformations.</p> <p>Results</p> <p>First, we generate all possible compact lattice conformations for the specified shape (a hexagon or a triangle) on the 2D triangular lattice. Then we generate all possible binary hydrophobic/polar (H/P) sequences and by using a specified energy function, thread them through all of these compact conformations. If for a given sequence the lowest energy is obtained for a particular lattice conformation we assume that this sequence folds to that conformation. Highly-designable conformations have many H/P sequences folding to them, while poorly-designable conformations have few or no H/P sequences. We classify sequences as folding to either highly – or poorly-designable conformations. We have randomly selected subsets of the sequences belonging to highly-designable and poorly-designable conformations and used them to train several different standard machine learning algorithms.</p> <p>Conclusion</p> <p>By using these machine learning algorithms with ten-fold cross-validation we are able to classify the two classes of sequences with high accuracy – in some cases exceeding 95%.</p
Interplay between pleiotropy and secondary selection determines rise and fall of mutators in stress response
Dramatic rise of mutators has been found to accompany adaptation of bacteria
in response to many kinds of stress. Two views on the evolutionary origin of
this phenomenon emerged: the pleiotropic hypothesis positing that it is a
byproduct of environmental stress or other specific stress response mechanisms
and the second order selection which states that mutators hitchhike to fixation
with unrelated beneficial alleles. Conventional population genetics models
could not fully resolve this controversy because they are based on certain
assumptions about fitness landscape. Here we address this problem using a
microscopic multiscale model, which couples physically realistic molecular
descriptions of proteins and their interactions with population genetics of
carrier organisms without assuming any a priori fitness landscape. We found
that both pleiotropy and second order selection play a crucial role at
different stages of adaptation: the supply of mutators is provided through
destabilization of error correction complexes or fluctuations of production
levels of prototypic mismatch repair proteins (pleiotropic effects), while rise
and fixation of mutators occur when there is a sufficient supply of beneficial
mutations in replication-controlling genes. This general mechanism assures a
robust and reliable adaptation of organisms to unforeseen challenges. This
study highlights physical principles underlying physical biological mechanisms
of stress response and adaptation
Allostery in Its Many Disguises: From Theory to Applications.
Allosteric regulation plays an important role in many biological processes, such as signal transduction, transcriptional regulation, and metabolism. Allostery is rooted in the fundamental physical properties of macromolecular systems, but its underlying mechanisms are still poorly understood. A collection of contributions to a recent interdisciplinary CECAM (Center Européen de Calcul Atomique et Moléculaire) workshop is used here to provide an overview of the progress and remaining limitations in the understanding of the mechanistic foundations of allostery gained from computational and experimental analyses of real protein systems and model systems. The main conceptual frameworks instrumental in driving the field are discussed. We illustrate the role of these frameworks in illuminating molecular mechanisms and explaining cellular processes, and describe some of their promising practical applications in engineering molecular sensors and informing drug design efforts
Random Amino Acid Mutations and Protein Misfolding Lead to Shannon Limit in Sequence-Structure Communication
The transmission of genomic information from coding sequence to protein structure during protein synthesis is subject to stochastic errors. To analyze transmission limits in the presence of spurious errors, Shannon's noisy channel theorem is applied to a communication channel between amino acid sequences and their structures established from a large-scale statistical analysis of protein atomic coordinates. While Shannon's theorem confirms that in close to native conformations information is transmitted with limited error probability, additional random errors in sequence (amino acid substitutions) and in structure (structural defects) trigger a decrease in communication capacity toward a Shannon limit at 0.010 bits per amino acid symbol at which communication breaks down. In several controls, simulated error rates above a critical threshold and models of unfolded structures always produce capacities below this limiting value. Thus an essential biological system can be realistically modeled as a digital communication channel that is (a) sensitive to random errors and (b) restricted by a Shannon error limit. This forms a novel basis for predictions consistent with observed rates of defective ribosomal products during protein synthesis, and with the estimated excess of mutual information in protein contact potentials
Long-Range Intra-Protein Communication Can Be Transmitted by Correlated Side-Chain Fluctuations Alone
Allosteric regulation is a key component of cellular communication, but the way in which information is passed from one site to another within a folded protein is not often clear. While backbone motions have long been considered essential for long-range information conveyance, side-chain motions have rarely been considered. In this work, we demonstrate their potential utility using Monte Carlo sampling of side-chain torsional angles on a fixed backbone to quantify correlations amongst side-chain inter-rotameric motions. Results indicate that long-range correlations of side-chain fluctuations can arise independently from several different types of interactions: steric repulsions, implicit solvent interactions, or hydrogen bonding and salt-bridge interactions. These robust correlations persist across the entire protein (up to 60 Å in the case of calmodulin) and can propagate long-range changes in side-chain variability in response to single residue perturbations
The evolution of cyclodextrin glucanotransferase product specificity
Cyclodextrin glucanotransferases (CGTases) have attracted major interest from industry due to their unique capacity of forming large quantities of cyclic α-(1,4)-linked oligosaccharides (cyclodextrins) from starch. CGTases produce a mixture of cyclodextrins from starch consisting of 6 (α), 7 (β) and 8 (γ) glucose units. In an effort to identify the structural factors contributing to the evolutionary diversification of product specificity amongst this group of enzymes, we selected nine CGTases from both mesophilic, thermophilic and hyperthermophilic organisms for comparative product analysis. These enzymes displayed considerable variation regarding thermostability, initial rates, percentage of substrate conversion and ratio of α-, β- and γ-cyclodextrins formed from starch. Sequence comparison of these CGTases revealed that specific incorporation and/or substitution of amino acids at the substrate binding sites, during the evolutionary progression of these enzymes, resulted in diversification of cyclodextrin product specificity
Reduction in Structural Disorder and Functional Complexity in the Thermal Adaptation of Prokaryotes
Genomic correlates of evolutionary adaptation to very low or very high optimal growth temperature (OGT) values have been the subject of many studies. Whereas these provided a protein-structural rationale of the activity and stability of globular proteins/enzymes, the point has been neglected that adaptation to extreme temperatures could also have resulted from an increased use of intrinsically disordered proteins (IDPs), which are resistant to these conditions in vitro. Contrary to these expectations, we found a conspicuously low level of structural disorder in bacteria of very high (and very low) OGT values. This paucity of disorder does not reflect phylogenetic relatedness, i.e. it is a result of genuine adaptation to extreme conditions. Because intrinsic disorder correlates with important regulatory functions, we asked how these bacteria could exist without IDPs by studying transcription factors, known to harbor a lot of function-related intrinsic disorder. Hyperthermophiles have much less transcription factors, which have reduced disorder compared to their mesophilic counterparts. On the other hand, we found by systematic categorization of proteins with long disordered regions that there are certain functions, such as translation and ribosome biogenesis that depend on structural disorder even in hyperthermophiles. In all, our observations suggest that adaptation to extreme conditions is achieved by a significant functional simplification, apparent at both the level of the genome and individual genes/proteins