Skip to main content
Article thumbnail
Location of Repository

STRING 7—recent developments in the integration and prediction of protein interactions

By Christian von Mering, Lars J. Jensen, Michael Kuhn, Samuel Chaffron, Tobias Doerks, Beate Krüger, Berend Snel and Peer Bork


Information on protein–protein interactions is still mostly limited to a small number of model organisms, and originates from a wide variety of experimental and computational techniques. The database and online resource STRING generalizes access to protein interaction data, by integrating known and predicted interactions from a variety of sources. The underlying infrastructure includes a consistent body of completely sequenced genomes and exhaustive orthology classifications, based on which interaction evidence is transferred between organisms. Although primarily developed for protein interaction analysis, the resource has also been successfully applied to comparative genomics, phylogenetics and network studies, which are all facilitated by programmatic access to the database backend and the availability of compact download files. As of release 7, STRING has almost doubled to 373 distinct organisms, and contains more than 1.5 million proteins for which associations have been pre-computed. Novel features include AJAX-based web-navigation, inclusion of additional resources such as BioGRID, and detailed protein domain annotation. STRING is available a

Topics: Articles
Publisher: Oxford University Press
OAI identifier:
Provided by: PubMed Central
Download PDF:
Sorry, we are unable to provide the full text but you may find it at the following location(s):
  • http://www.pubmedcentral.nih.g... (external link)
  • Suggested articles


    1. (1999). Assigning protein functions by comparative genome analysis: protein phylogenetic profiles.
    2. (2006). BioGRID: a general repository for interaction datasets.
    3. (2002). Computational methods for the prediction of protein interactions.
    4. (1998). Conservation of gene order: a fingerprint of proteins that physically interact.
    5. (1999). Detecting protein function and protein–protein interactions from genome sequences.
    6. (2006). DILIMOT: discovery of linear motifs in proteins.
    7. (2006). From genomics to chemical genomics: new developments in KEGG.
    8. (2006). Genome Snapshot: a new resource at the Saccharomyces Genome Database (SGD) presenting an overview of the Saccharomyces cerevisiae genome.
    9. (2006). Human protein reference database—2006 update.
    10. (1998). Measuring genome evolution.
    11. (2002). MINT: a Molecular INTeraction database.
    12. (1999). Protein interaction maps for complete genomes based on gene fusion events.
    13. (2005). Reactome: a knowledgebase of biological pathways.
    14. (2006). SIMAP: the similarity matrix of proteins.
    15. (2006). SMART 5: domains in the context of genomes and networks.
    16. (2003). STRING: a database of predicted functional associations between proteins.
    17. (2000). STRING: a web-server to retrieve and display the repeatedly occurring neighbourhood of a gene.
    18. (2005). STRING: known and predicted protein–protein associations, integrated and transferred across organisms.
    19. (2005). The biomolecular interaction network database and related tools
    20. (2003). The COG database: an updated version includes eukaryotes.
    21. (2004). The Database of Interacting Proteins:
    22. (1999). The use of gene clusters to infer functional coupling.
    23. (2006). WormBase: better software, richer content.

    To submit an update or takedown request for this paper, please submit an Update/Correction/Removal Request.