Article thumbnail
Location of Repository

Early-Stage Folding in Proteins (In Silico) Sequence-to-Structure Relation

By Michał Brylinski, Leszek Konieczny, Patryk Czerwonko, Wiktor Jurkowski and Irena Roterman


A sequence-to-structure library has been created based on the complete PDB database. The tetrapeptide was selected as a unit representing a well-defined structural motif. Seven structural forms were introduced for structure classification. The early-stage folding conformations were used as the objects for structure analysis and classification. The degree of determinability was estimated for the sequence-to-structure and structure-to-sequence relations. Probability calculus and informational entropy were applied for quantitative estimation of the mutual relation between them. The structural motifs representing different forms of loops and bends were found to favor particular sequences in structure-to-sequence analysis

Topics: Research Article
Publisher: Hindawi Publishing Corporation
Year: 2005
DOI identifier: 10.1155/JBB.2005.65
OAI identifier:
Provided by: PubMed Central

Suggested articles


  1. A hidden Markov model for local sequence-structure correlations in proteins.
  2. A mathematical theory of communication.
  3. (2005). A method for optimizing potential-energy functions by a hierarchical design of the potential-energy landscape:
  4. (2004). A native-like ar-2005:2 (2005) Early-Stage Folding—Sequence-to-Structure Relation 79 tificial protein from antisense DNA. Protein Eng Des Sel.
  5. A novel fingerprint for the characterization of protein folds.
  6. A novel super-secondary structure of proteins and the relation between the structure and theaminoacidsequence.FEBSLett.1984;166(1):33– 38.
  7. (1994). A revised set of potentials for beta-turn formation in proteins. Protein Science.
  8. A segment-based approach to protein secondary structure prediction.
  9. (2005). Algorithms for prediction of alpha-helical and beta-structural regions in globular proteins.
  10. Alpha helix capping in synthetic model peptides by reciprocal side chain-main chain interactions: evidence for an N terminal “capping box”.
  11. (1986). Amino acid sequence homology applied to the prediction of protein secondary structures, and joint prediction with existing methods. Biochim Biophys Acta.
  12. An algorithm for secondary structure determination in proteins based on sequence similarity.
  13. Analysis of the accuracy and implications of simple methods for predicting the secondary structure of globular proteins.
  14. (2003). and flexibility: prediction from protein sequence. Structure (Camb).
  15. Automatic identification of secondary structure in globular proteins.
  16. (2001). Coarse semiempirical solution to the protein folding problem. Physica A.
  17. Conformational parameters for amino acids in helical, beta-sheet, and random coil regions calculated from proteins.
  18. Conformational subspace in simulation of early-stage protein folding.
  19. Conservation analysis and structure prediction of the SH2 family of phosphotyrosine binding domains.
  20. Cummulation-based expression for the multibody terms for the correction between local and electrostatic interaction in the united residue force field.
  21. De novo prediction of three-dimensional structures for major protein families.
  22. Describing protein structure: a general algorithm yielding complete helicoidal parameters and a unique overall axis.
  23. Dictionary of protein secondary structure: pattern recognition of hydrogenbonded and geometrical features.
  24. Distinguishing foldable proteins from nonfolders: when and how do they differ?
  25. DysonHJ,WrightPE.Coupling offolding andbinding for unstructured proteins.
  26. Fasman GD. Prediction of protein conformation.
  27. (2002). Fully automated ab initio protein structure prediction using I-SITES, HMMSTR
  28. (1996). Global properties of the mapping between local amino acid sequence and local structure in proteins.
  29. GlobPlot: exploring protein sequences for globularity and disorder.
  30. Helix stop signals in proteins and peptides: the capping box.
  31. Identification of structuralmotifsfromproteincoordinatedata:secondary structure and first-level supersecondary structure.
  32. Improvements in the prediction of protein backbone topography by reduction of statistical errors.
  33. Intrinsic disorder and protein function.
  34. (2001). Intrinsically disordered protein. JM o lG r a p hM o d e l .
  35. (1999). Intrinsically unconstructed proteins: re-assessing the protein structure-function paradigm.
  36. Limitation of conformational space for proteins— early-stage folding simulation of human α and β hemoglobin chains.
  37. Limitedconformationalspaceforearly-stageprotein folding simulation.
  38. Loops in globular proteins: a novel category of secondary structure.
  39. (2004). Lysozyme folded in silico according to the limited conformational sub-space. JB i o m o lS t r u c tD y n .
  40. Modelling the optimal simulation path in the peptide chain folding—studies based on geometry of alanine heptapeptide.
  41. (1998). Molecular dynamics simulations of hydrophobic collapse of ubiquitin. Protein Sci.
  42. (2003). NORSp: predictions of long regions without regular secondary structure. Nucleic Acids Res.
  43. On the use of sequence homologies to predict protein structure: identical pentapeptides can have completely different conformations.
  44. Optimally informative backbone structural propensities in proteins.
  45. (2000). PappuRV,SrinivasanR,RoseGD.Thefloryisolatedpair hypothesis is not valid for polypeptide chains: implications for protein folding.
  46. (1989). Patterns of divergence in homologous proteins as indicators of tertiary and quaternary structure. Adv Enzyme Regul.
  47. Predicted secondary structure for the Src homology 3 domain.
  48. Predicting the secondary structure of globular proteins using neural network models.
  49. Prediction of local structure in proteins using a library of sequence-structure motifs.
  50. Prediction of protein secondary structure and active sites using the alignment of homologous sequences.
  51. Prediction of protein secondary structure at better than 70% accuracy.
  52. Prediction of protein secondary structure by the hidden Markov model.
  53. Prediction of protein structure by simulating coarse-grained folding pathways: a preliminary report. JB i o m o lS t r u c tD y n .2004;21(5):625– 638.
  54. Prediction of protein structure.
  55. Prediction of secondary structure by evolutionary comparison: application to the alpha subunit of tryptophan synthase.
  56. Predictions without templates: new folds, secondary structure, and contacts
  57. Protein secondary structure prediction based on position-specific scoring matrices.
  58. Protein secondary structure prediction using local alignments.
  59. Protein secondary structure prediction using nearest-neighbor methods.
  60. (1989). Protein secondary structure prediction with a neural network.
  61. (1993). Quantification of secondary structure prediction improvement using multiple alignments. Protein Eng.
  62. (1994). Redefining the goals ofproteinsecondarystructureprediction.JMolBiol.
  63. Relationships between amino acid sequence and backbone torsion angle preferences.
  64. Role of connections in the formation of protein structures, containing 4-helical segments.
  65. Rules for alpha-helix termination by glycine.
  66. Secondary structure prediction: combination of three different methods. Protein Eng.
  67. Single-body residue-level knowledge-based energy score combined with sequence-profile and secondary structure information for fold recognition.
  68. SPI— structure predictability index for proteins.
  69. (1993). Structural analysis based on state-space modeling. Protein Sci.
  70. structures, and amino acid frequencies in structural building blocks, a protein secondary structure classification scheme.
  71. Taxonomy and conformational analysis of loops in proteins.
  72. The geometrical analysis of peptide backbone structure and its local deformations.
  73. The GOR method for predicting secondary structures in proteins. In: Fasman GD,ed.PredictionofProteinStructureandthePrinciples of Protein Conformation.
  74. The protein data bank.
  75. The use of amino acid patterns of classified helices and strands in secondary structure prediction.
  76. (1998). Thousands of proteins likely to have long disordered regions. Pac Symp Biocomput.
  77. UverskyVN,GillespieJR,FinkAL.Whyare“natively unfolded” proteins unstructured under physiologic conditions?
  78. V u c e t i cS ,B r o w nC J ,D u n k e rA K ,O b r a d o v i cZ .F l a -vors of protein disorder.
  79. (2004). WieY,HechtMH.Enzyme-likeproteinsfromanunselected library of designed amino acid sequences. Protein Eng Des Sel.

To submit an update or takedown request for this paper, please submit an Update/Correction/Removal Request.