Skip to main content
Article thumbnail
Location of Repository

GS2: an efficiently computable measure of GO-based similarity of gene sets

By Troy Ruths, Derek Ruths and Luay Nakhleh


Motivation: The growing availability of genome-scale datasets has attracted increasing attention to the development of computational methods for automated inference of functional similarities among genes and their products. One class of such methods measures the functional similarity of genes based on their distance in the Gene Ontology (GO). To measure the functional relatedness of a gene set, these measures consider every pair of genes in the set, and the average of all pairwise distances is calculated. However, as more data becomes available and gene sets used for analysis become larger, such pair-based calculation becomes prohibitive

Topics: Original Papers
Publisher: Oxford University Press
OAI identifier:
Provided by: PubMed Central
Download PDF:
Sorry, we are unable to provide the full text but you may find it at the following location(s):
  • http://www.pubmedcentral.nih.g... (external link)
  • Suggested articles


    1. (2007). A new method to measure the semantic similarity of go terms.
    2. (2005). A semantic analysis of the annotations of the human genome.
    3. (1998). An information-theoretic definition of similarity, semantic similarity based on corpus statistics and lexical taxonomy.
    4. (2005). Bingo: a cytoscape plugin to assess overrepresentation of gene ontology categories in biological networks.
    5. (2005). Correlation between gene expression and go semantic similarity.
    6. (2007). David bioinformatics resources: expanded annotation databaseandnovelalgorithmstobetterextractbiologyfromlargegenelists.Nucleic Acids Res., 35, W169–W175. Jiang,J.andConrath,D.(1997)Semanticsimilaritybasedoncorpusstatisticsandlexical taxonomy.
    7. (1997). for post-genome analysis.
    8. (2000). Gene ontology: tool for the unification of biology.
    9. (2006). Genetools—application for functional annotation and statistical hypothesis testing.
    10. (2007). Genome-wide atlas of gene expression in the adult mouse brain.
    11. (2000). KEGG: Kyoto encyclopedia of genes and genomes.
    12. (2002). Large-scale analysis of the human and mouse transcriptomes.
    13. (1999). Semantic similarity in a taxonomy: an information-based measure and its application to problems of ambiguity in natural language.

    To submit an update or takedown request for this paper, please submit an Update/Correction/Removal Request.