Skip to main content
Article thumbnail
Location of Repository

Protein Domain Boundary Predictions: A Structural Biology Perspective

By Svetlana Kirillova, Suresh Kumar and Oliviero Carugo


One of the important fields to apply computational tools for domain boundaries prediction is structural biology. They can be used to design protein constructs that must be expressed in a stable and functional form and must produce diffraction-quality crystals. However, prediction of protein domain boundaries on the basis of amino acid sequences is still very problematical. In present study the performance of several computational approaches are compared. It is observed that the statistical significance of most of the predictions is rather poor. Nevertheless, when the right number of domains is correctly predicted, domain boundaries are predicted within very few residues from their real location. It can be concluded that prediction methods cannot be used yet as routine tools in structural biology, though some of them are rather promising

Topics: Article
Publisher: Bentham Open
OAI identifier:
Provided by: PubMed Central

Suggested articles


  1. (1995). A large-scale experiment to assess protein structure prediction methods. Proteins,
  2. (2006). A machine learning information retrieval approach to protein fold recognition. Bioinformatics,
  3. (1995). A novel approach to predicting protein structural classes in a (20-1)-D amino acid composition space. Proteins,
  4. (2007). A structural proteomics filter: Prediction of the quaternary structural type of hetero-oligomeric proteins on the basis of their sequences.
  5. (2007). A subsite-coupled and windowfusing approach for predicting signal peptides.
  6. (1993). A vectorized sequence-coupling model for predicting HIV protease cleavage sites in proteins.
  7. Agaritine and its derivatives are potential inhibitors against HIV proteases.
  8. (1984). An extension of Chou's graphical rules for deriving enzyme kinetic equations to system involving parallel reaction pathways.
  9. (2007). Assessment of predictions submitted for the CASP7 domain prediction category. Proteins,
  10. (2005). Automated prediction of domain boundaries
  11. (2003). Binding mechanism of coronavirus main proteinase with ligands and its implication to drug design against SARS.
  12. (2008). Cell-PLoc: A package of web-servers for predicting subcellular localization of proteins in various organisms.
  13. (2004). CHOP proteins into structural domain-like fragments. Proteins,
  14. (1975). Comparison of the predicted and observed secondary structure of T4 phage lysozyme.
  15. Computational approach to drug design for oxazolidinones as antibacterial agents.
  16. (2007). Computer-assisted protein domain boundary prediction using the DomPred server.
  17. (2008). Consensus prediction of protein conformational disorder from amoni acidic sequence.
  18. (2006). Distill: A suite of web servers for the prediction of one-, twoand three-dimensional structural features of proteins.
  19. (2005). Domain prediction meta-server. Bioinformatics,
  20. (1998). Domain structural class prediction. Protein Eng.,
  21. (1984). Energetic approach to packing of a-helices: 2. General treatment of nonequivalent and nonregular helices.
  22. (1988). Energetics of the structure of the four-alpha-helix bundle in proteins.
  23. (1992). Energy-optimized structure of antifreeze protein and its binding mechanism.
  24. (2007). Euk-mPLoc: A fusion classifier for largescale eukaryotic protein subcellular location prediction by incorporating multiple sites.
  25. (2007). EzyPred: A top-down approach for predicting enzyme functional classes and subclasses.
  26. (1989). Graphical rules in steady and non-steady enzyme kinetics.
  27. (2008). HIVcleave: A web-server for predicting HIV protease cleavage sites in proteins.
  28. (2006). Hum-PLoc: A novel ensemble classifier for predicting human protein subcellular localization.
  29. (2007). Identification of domain in protein crystal structures.
  30. (2008). Kinetic plasticity and the determination of product ratios for kinetic schemes leading to multiple products without rate laws: New methods based on directed graphs.
  31. (2007). Likelihood of crystallization: Experimental and computational approaches.
  32. (2007). MemType-2L: A Web server for predicting membrane proteins and their types by incorporating evolution information through Pse-PSSM.
  33. (1985). Microcomputer tools for steady-state enzyme kinetics. Bioinformatics,
  34. (2008). Molecular modeling of two CYP2C19 SNPs and its implications for personalized drug design. Protien Pept.
  35. (2006). Molecular modeling studies of peptide drug candidates against
  36. (2008). Multiple field three dimensional quantitative structure-activity relationship (MF-3D-QSAR).
  37. (2006). Predicting eukaryotic protein subcellular location by fusing optimized evidence-theoretic K-nearest neighbor classifiers.
  38. (2008). Predicting protein structural class based on multi-features fusion.
  39. (2005). Predicting protein structural class by functional domain composition.
  40. (2007). Prediction of apoptosis protein subcellular location using improved hybrid approach and pseudo amino acid composition.
  41. (2008). Prediction of C-to-U RNA editing sites in plant mitochondria using both biochemical and evolutionary information.
  42. (2007). Prediction of number and position of domain boundaries in multi-domain proteins by use of amino acid sequence alone.
  43. (2003). Prediction of polypeptide fragments exposed to the solvent.
  44. (2007). Prediction of the subcellular location of apoptosis proteins.
  45. (2008). ProtIdent: A web server for identifying proteases and their types by fusing functional domain and sequential evolution information.
  46. (2003). Recognition, 2 nd ed.; Academic Press:
  47. (1990). Rev: Applications of graph theory to enzyme kinetics and protein folding kinetics. Steady and non-steady state systems.
  48. (1988). Rev: Low-frequency collective motion in biomacromolecules and its biological functions.
  49. (1996). Rev: Prediction of HIV protease cleavage sites in proteins.
  50. (2000). Rev: Prediction of protein structural classes and subcellular locations.
  51. (1995). Rev: Prediction of protein structural classes.
  52. (2009). Rev: Recent progresses in protein subcellular location prediction.
  53. (1994). Rev: Steady-state inhibition kinetics of processive nucleic acid polymerases and nucleases.
  54. (2007). robots, crystals and biology: What use high throughput solving structures of challenging targets?
  55. (1982). Role of the protein outside active site on the diffusion-controlled reaction of enzyme.
  56. Screening for new agonists against Alzheimer's disease.
  57. (2004). Sequence-based prediction of protein domains.
  58. (2007). Signal-3L: A 3-layer approach for predicting signal peptide.
  59. (2004). Structural bioinformatics and its impact to biomedical science.
  60. (2005). The HHpred interactive server for protein homology detection and structure prediction.
  61. (2008). The modified Mahalanobis discriminant for predicting outer membrane proteins by using Chou's pseudo amino acid composition.
  62. (1993). The quinoline U-78036 is a potent inhibitor of HIV-1 reverse
  63. (2008). Unified QSAR approach to antimicrobials. Part 3: First multi-tasking QSAR model for Input-Coded prediction, structural back-projection, and complex networks clustering of antiprotozoal compounds.
  64. (2006). Unified QSAR approach to antimicrobials. Part 3: First multitasking QSAR model for Input-Coded prediction, structural backprojection, and complex networks clustering of antiprotozoal compounds.
  65. (2007). Using Chou's amphiphilic pseudo-amino acid composition and support vector machine for prediction of enzyme subfamily classes.
  66. (2004). Virtual screening for SARS-CoV protease based on KZ7088 pharmacophore points.

To submit an update or takedown request for this paper, please submit an Update/Correction/Removal Request.