Article thumbnail
Location of Repository

An automatic method for assessing structural importance of amino acid positions

By M.I. Sadowski and D.T. Jones

Abstract

Background: A great deal is known about the qualitative aspects of the sequence-structure relationship, for example that buried residues are usually more conserved between structurally similar homologues, but no attempts have been made to quantitate the relationship between evolutionary conservation at a sequence position and change to global tertiary structure. In this paper we demonstrate that the Spearman correlation between sequence and structural change is suitable for this purpose. Results: Buried residues, bends, cysteines, prolines and leucines were significantly more likely to occupy positions highly correlated with structural change than expected by chance. Some buried residues were found to be less informative than expected, particularly residues involved in active sites and the binding of small molecules. Conclusion: The correlation-based method generates predictions of structural importance for superfamily positions which agree well with previous results of manual analyses, and may be of use in automated residue annotation piplines. A PERL script which implements the method is provided

Year: 2009
OAI identifier: oai:eprints.ucl.ac.uk.OAI2:20124
Provided by: UCL Discovery

Suggested articles

Citations

  1. (2003). A: Automatic methods for predicting functionally important residues.
  2. (1992). Anatomy and evolution of proteins displaying the viral capsid jellyroll topology.
  3. (2007). Chothia C: The selection of acceptable protein mutations.
  4. (1983). Dictionary of protein secondary structure: Pattern recognition of hydrogen-bonded and geometrical features. Biopolymers
  5. (2007). DT: Benchmarking template selection and model quality assessment for high-resolution comparative modeling. Proteins: Struct Func Bioinf
  6. (2003). G: The European Bioinformatics Institute's data resources. Nucl Acid Res
  7. (1992). Henikoff JG: Amino-acid substitution matrices from protein blocks.
  8. (1980). How different amino-acid-sequences determine similar protein structures – structure and evolutionary dynamics of the globins.
  9. (2005). Improvement in protein functional site prediction by distinguishing structural and functional constraints on protein family evolution using computational design. Nucl Acid Res
  10. (1987). Lesk AM: Determinants of a protein fold – unique features of the globin amino-acid-sequences.
  11. (1985). Lesk AM: Helix movements and the reconstruction of the heme pocket during the evolution of the cytochromeC family.
  12. (2004). Notredame C: 3DCoffee: Combining Protein Sequences and Structures within Multiple Sequence Alignments.
  13. (2007). On the relationship between sequence and structure similarities in proteomics. Bioinformatics
  14. Orengo C: The CATH Domain Structure Database and related resources Gene3D and DHS provide comprehensive domain family information for genome analysis. Nucl Acid Res 33:D247-D251.
  15. (1984). Principles that determine the structures of proteins. Ann Rev Biochem
  16. (1999). Protein structure comparison using iterated double dynamic programming. Protein Sci
  17. Scoring amino acid conservation. Proteins: Struct Func Genet 2002, 43:227-241. Additional file 2 Sets of quality values, multiple sequence alignment (aln format) and CC values for each of the six datasets. Click here for file
  18. (1994). Structural features can be unconserved in proteins with similar folds: An analysis of side chain to side-chain contacts, secondary structure and accessibility.
  19. (1992). The rapid generation of mutation data matrices from protein sequences. CABIOS
  20. (1999). The structural and functional analysis of the hemoglobin D component from chicken.
  21. (2003). Thorne JL: Protein evolution with dependence among codons due to tertiary structure. Mol Biol Evol
  22. (2007). TL: A structural bioinformatics approach to the analysis of nonsynonymous single nucleotide polymorphisms (nsSNPs) and their relation to disease.
  23. (2005). TM-align: a protein structure alignment algorithm based on the TM-Score. Nucl Acid Res
  24. (1999). TP: Modulation of Phospholipase A2 activity generated by molecular evolution. Cell Mol Life Sci
  25. (2008). ZD: An improved prediction of catalytic residues in enzyme structures. PEDS

To submit an update or takedown request for this paper, please submit an Update/Correction/Removal Request.