231 research outputs found

    Exploring the Universe of Protein Structures beyond the Protein Data Bank

    Get PDF
    It is currently believed that the atlas of existing protein structures is faithfully represented in the Protein Data Bank. However, whether this atlas covers the full universe of all possible protein structures is still a highly debated issue. By using a sophisticated numerical approach, we performed an exhaustive exploration of the conformational space of a 60 amino acid polypeptide chain described with an accurate all-atom interaction potential. We generated a database of around 30,000 compact folds with at least of secondary structure corresponding to local minima of the potential energy. This ensemble plausibly represents the universe of protein folds of similar length; indeed, all the known folds are represented in the set with good accuracy. However, we discover that the known folds form a rather small subset, which cannot be reproduced by choosing random structures in the database. Rather, natural and possible folds differ by the contact order, on average significantly smaller in the former. This suggests the presence of an evolutionary bias, possibly related to kinetic accessibility, towards structures with shorter loops between contacting residues. Beside their conceptual relevance, the new structures open a range of practical applications such as the development of accurate structure prediction strategies, the optimization of force fields, and the identification and design of novel folds

    Effects of Clinically Relevant MPL Mutations in the Transmembrane Domain Revealed at the Atomic Level through Computational Modeling

    Get PDF
    BACKGROUND: Mutations in the thrombopoietin receptor (MPL) may activate relevant pathways and lead to chronic myeloproliferative neoplasms (MPNs). The mechanisms of MPL activation remain elusive because of a lack of experimental structures. Modern computational biology techniques were utilized to explore the mechanisms of MPL protein activation due to various mutations. RESULTS: Transmembrane (TM) domain predictions, homology modeling, ab initio protein structure prediction, and molecular dynamics (MD) simulations were used to build structural dynamic models of wild-type and four clinically observed mutants of MPL. The simulation results suggest that S505 and W515 are important in keeping the TM domain in its correct position within the membrane. Mutations at either of these two positions cause movement of the TM domain, altering the conformation of the nearby intracellular domain in unexpected ways, and may cause the unwanted constitutive activation of MPL's kinase partner, JAK2. CONCLUSIONS: Our findings represent the first full-scale molecular dynamics simulations of the wild-type and clinically observed mutants of the MPL protein, a critical element of the MPL-JAK2-STAT signaling pathway. In contrast to usual explanations for the activation mechanism that are based on the relative translational movement between rigid domains of MPL, our results suggest that mutations within the TM region could result in conformational changes including tilt and rotation (azimuthal) angles along the membrane axis. Such changes may significantly alter the conformation of the adjacent and intrinsically flexible intracellular domain. Hence, caution should be exercised when interpreting experimental evidence based on rigid models of cytokine receptors or similar systems

    Matrix-assisted laser desorption ionization hydrogen/deuterium exchange studies to probe peptide conformational changes

    Get PDF
    AbstractHydrogen/deuterium (H/D) exchange chemistry monitored by matrix-assisted laser desorption ionization time-of-flight (MALDI-TOF) mass spectrometry is used to study solution phase conformational changes of bradykinin, α-melanocyte stimulating hormone, and melittin as water is added to methanol-d4, acetonitrile, and isopropanol-d8 solutions. The results are interpreted in terms of a preference for the peptides to acquire more compact conformations in organic solvents as compared to the random conformations. Our interpretation is supported by circular dichroism spectra of the peptides in the same solvent systems and by previously published structural data for the peptides. These results demonstrate the utility of MALDI-TOF as a method to monitor the H/D exchange chemistry of peptides and investigations of solution-phase conformations of biomolecules

    Fully automated high-quality NMR structure determination of small 2H-enriched proteins

    Get PDF
    Determination of high-quality small protein structures by nuclear magnetic resonance (NMR) methods generally requires acquisition and analysis of an extensive set of structural constraints. The process generally demands extensive backbone and sidechain resonance assignments, and weeks or even months of data collection and interpretation. Here we demonstrate rapid and high-quality protein NMR structure generation using CS-Rosetta with a perdeuterated protein sample made at a significantly reduced cost using new bacterial culture condensation methods. Our strategy provides the basis for a high-throughput approach for routine, rapid, high-quality structure determination of small proteins. As an example, we demonstrate the determination of a high-quality 3D structure of a small 8 kDa protein, E. coli cold shock protein A (CspA), using <4 days of data collection and fully automated data analysis methods together with CS-Rosetta. The resulting CspA structure is highly converged and in excellent agreement with the published crystal structure, with a backbone RMSD value of 0.5 Å, an all atom RMSD value of 1.2 Å to the crystal structure for well-defined regions, and RMSD value of 1.1 Å to crystal structure for core, non-solvent exposed sidechain atoms. Cross validation of the structure with 15N- and 13C-edited NOESY data obtained with a perdeuterated 15N, 13C-enriched 13CH3 methyl protonated CspA sample confirms that essentially all of these independently-interpreted NOE-based constraints are already satisfied in each of the 10 CS-Rosetta structures. By these criteria, the CS-Rosetta structure generated by fully automated analysis of data for a perdeuterated sample provides an accurate structure of CspA. This represents a general approach for rapid, automated structure determination of small proteins by NMR

    βα-Hairpin Clamps Brace βαβ Modules and Can Make Substantive Contributions to the Stability of TIM Barrel Proteins

    Get PDF
    Non-local hydrogen bonding interactions between main chain amide hydrogen atoms and polar side chain acceptors that bracket consecutive βα or αβ elements of secondary structure in αTS from E. coli, a TIM barrel protein, have previously been found to contribute 4–6 kcal mol−1 to the stability of the native conformation. Experimental analysis of similar βα-hairpin clamps in a homologous pair of TIM barrel proteins of low sequence identity, IGPS from S. solfataricus and E. coli, reveals that this dramatic enhancement of stability is not unique to αTS. A survey of 71 TIM barrel proteins demonstrates a 4-fold symmetry for the placement of βα-hairpin clamps, bracing the fundamental βαβ building block and defining its register in the (βα)8 motif. The preferred sequences and locations of βα-hairpin clamps will enhance structure prediction algorithms and provide a strategy for engineering stability in TIM barrel proteins

    Solvent accessible surface area approximations for rapid and accurate protein structure prediction

    Get PDF
    The burial of hydrophobic amino acids in the protein core is a driving force in protein folding. The extent to which an amino acid interacts with the solvent and the protein core is naturally proportional to the surface area exposed to these environments. However, an accurate calculation of the solvent-accessible surface area (SASA), a geometric measure of this exposure, is numerically demanding as it is not pair-wise decomposable. Furthermore, it depends on a full-atom representation of the molecule. This manuscript introduces a series of four SASA approximations of increasing computational complexity and accuracy as well as knowledge-based environment free energy potentials based on these SASA approximations. Their ability to distinguish correctly from incorrectly folded protein models is assessed to balance speed and accuracy for protein structure prediction. We find the newly developed “Neighbor Vector” algorithm provides the most optimal balance of accurate yet rapid exposure measures

    Near-Native Protein Loop Sampling Using Nonparametric Density Estimation Accommodating Sparcity

    Get PDF
    Unlike the core structural elements of a protein like regular secondary structure, template based modeling (TBM) has difficulty with loop regions due to their variability in sequence and structure as well as the sparse sampling from a limited number of homologous templates. We present a novel, knowledge-based method for loop sampling that leverages homologous torsion angle information to estimate a continuous joint backbone dihedral angle density at each loop position. The φ,ψ distributions are estimated via a Dirichlet process mixture of hidden Markov models (DPM-HMM). Models are quickly generated based on samples from these distributions and were enriched using an end-to-end distance filter. The performance of the DPM-HMM method was evaluated against a diverse test set in a leave-one-out approach. Candidates as low as 0.45 Å RMSD and with a worst case of 3.66 Å were produced. For the canonical loops like the immunoglobulin complementarity-determining regions (mean RMSD <2.0 Å), the DPM-HMM method performs as well or better than the best templates, demonstrating that our automated method recaptures these canonical loops without inclusion of any IgG specific terms or manual intervention. In cases with poor or few good templates (mean RMSD >7.0 Å), this sampling method produces a population of loop structures to around 3.66 Å for loops up to 17 residues. In a direct test of sampling to the Loopy algorithm, our method demonstrates the ability to sample nearer native structures for both the canonical CDRH1 and non-canonical CDRH3 loops. Lastly, in the realistic test conditions of the CASP9 experiment, successful application of DPM-HMM for 90 loops from 45 TBM targets shows the general applicability of our sampling method in loop modeling problem. These results demonstrate that our DPM-HMM produces an advantage by consistently sampling near native loop structure. The software used in this analysis is available for download at http://www.stat.tamu.edu/~dahl/software/cortorgles/

    Intrinsic Order and Disorder in the Bcl-2 Member Harakiri: Insights into Its Proapoptotic Activity

    Get PDF
    Harakiri is a BH3-only member of the Bcl-2 family that localizes in membranes and induces cell death by binding to prosurvival Bcl-xL and Bcl-2. The cytosolic domain of Harakiri is largely disorder with residual α-helical conformation according to previous structural studies. As these helical structures could play an important role in Harakiri's function, we have used NMR and circular dichroism to fully characterize them at the residue-atomic level. In addition, we report structural studies on a peptide fragment spanning Harakiri's C-terminal hydrophobic sequence, which potentially operates as a transmembrane domain. We initially checked by enzyme immunoassays and NMR that peptides encompassing different lengths of the cytosolic domain are functional as they bind Bcl-xL and Bcl-2. The structural data in water indicate that the α-helical conformation is restricted to a 25-residue segment comprising the BH3 domain. However, structure calculation was precluded because of insufficient NMR restraints. To bypass this problem we used alcohol-water mixture to increase structure population and confirmed by NMR that the conformation in both milieus is equivalent. The resulting three-dimensional structure closely resembles that of peptides encompassing the BH3 domain of BH3-only members in complex with their prosurvival partners, suggesting that preformed structural elements in the disordered protein are central to binding. In contrast, the transmembrane domain forms in micelles a monomeric α-helix with a population close to 100%. Its three-dimensional structure here reported reveals features that explain its function as membrane anchor. Altogether these results are used to propose a tentative structural model of how Harakiri works
    corecore