Article thumbnail

Predicting conserved protein motifs with Sub-HMMs

By Kevin Horan, Christian R Shelton and Thomas Girke
Topics: Research article
Publisher: BioMed Central
OAI identifier:
Provided by: PubMed Central

To submit an update or takedown request for this paper, please submit an Update/Correction/Removal Request.

Suggested articles


  1. (2005). A new approach for HMM based protein sequence family modeling and its application to remote homology classification. Statistical Signal Processing,
  2. (2008). A probabilistic model of local sequence alignment that simplifies statistical significance estimation. PLoS Comput Biol
  3. A tutorial on hidden Markov models and selected applications in speech recognition.
  4. A: Pfam: clans, web tools and services.
  5. A: The Pfam protein families database.
  6. AJ: MEROPS: the peptidase database.
  7. (1995). Automated construction and graphical presentation of protein blocks from unaligned sequences. Gene
  8. (2002). Bairoch A: ScanProsite: a reference implementation of a PROSITE scanning tool. Appl Bioinformatics
  9. Ben-Tal N: ConSurf 2005: the projection of evolutionary conservation scores of residues on protein structures.
  10. (1998). Biological Sequence Analysis: Probabilistic Models of Proteins and Nucleic Acids Cambridge
  11. C: New developments in the InterPro database.
  12. (2008). C: The 20 years of PROSITE.
  13. (2005). Calibrating E-values for hidden Markov models using reverse-sequence null models. Bioinformatics
  14. CASTp: computed atlas of surface topography of proteins with structural and topographical mapping of functionally annotated residues.
  15. (2008). Characterization and prediction of residues determining protein functional specificity. Bioinformatics
  16. (2007). Comparing clusterings--an information based distance.
  17. (2009). Designing Patterns and Profiles for Faster HMM Search.
  18. (1995). Elkan C: Unsupervised Learning of Multiple Motifs in Biopolymers Using Expectation Maximization. Machine Learning
  19. (2004). Feature extraction for improved Profile HMM based biological sequence analysis.
  20. (1994). Haussler D: Hidden Markov Models in Computational Biology: Applications to Protein Modeling.
  21. (1998). Hidden Markov models for detecting remote protein homologies. Bioinformatics
  22. (1994). Hidden Markov Models of Biological Primary Sequence Information.
  23. (1996). Hidden Markov models.
  24. (2008). HMMEditor: a visual editing tool for profile hidden Markov model.
  25. (2007). Identification of amino acid residues involved in substrate specificity of plant acyl-ACP thioesterases using a bioinformatics-guided approach.
  26. (2004). JM: Searching for functional sites in protein structures. Curr Opin Chem Biol
  27. (1973). Jr: The Viterbi algorithm.
  28. Kolchanov NA: PDBSite: a database of the 3D structure of protein functional sites.
  29. (2005). Lengauer T: ROCR: visualizing classifier performance in R. Bioinformatics
  30. (1997). Lipman DJ: Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. Nucleic Acids Res
  31. (1977). Maximum likelihood from incomplete data via the EM algorithm.
  32. (1997). Meta-MEME: motif-based hidden Markov models of biological sequences.
  33. (1995). Multiple alignment using hidden Markov models.
  34. (1951). On information and sufficiency.
  35. (2007). Orengo CA: CATHEDRAL: a fast and effective algorithm to predict folds and domain boundaries from multidomain protein structures. PLoS Comput Biol
  36. (1997). Pfam: A comprehensive database of protein domain families based on seed alignments. Proteins Structure Function and Genetics
  37. (2007). Predicting active site residue annotations in the Pfam database.
  38. (2007). Predicting functionally important residues from sequence conservation. Bioinformatics
  39. (2008). Prediction of protein functional residues from sequence by probability density estimation. Bioinformatics
  40. (1963). Probability Inequalities for Sums of Bounded Random Variables.
  41. (2008). Profile Comparer: a program for scoring and aligning profile hidden Markov models. Bioinformatics
  42. (2006). Protein binding site prediction using an empirical scoring function. Nucleic Acids Res
  43. (2005). Regan L: Sequence variation in ligand binding sites in proteins.
  44. (2009). ResBoost: characterizing and predicting catalytic residues in enzymes.
  45. Sigrist CJ: The PROSITE database.
  46. (2008). Sjölander K: INTREPID-INformation-theoretic TREe traversal for Protein functional site IDentification. Bioinformatics
  47. SR: The Pfam protein families database.
  48. (2003). Taylor WR: Protein fold comparison by the alignment of topological strings. Protein Eng
  49. (2004). The Catalytic Site Atlas: a resource of catalytic sites and residues identified in enzymes using structural data. Nucl Acids Res
  50. (1994). TJ: CLUSTAL W: improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choice. Nucleic Acids Res
  51. Tress ML: Firestar-prediction of functionally important residues using structural templates and alignment reliability.