Skip to main content
Article thumbnail
Location of Repository

Systematic analysis of experimental phenotype data reveals gene functions

By Robert Hoehndorf, Nigel William Hardy, David Osumi-Sutherland, Susan Tweedie, Paul N Schofield and Georgios Gkoutos


High-throughput phenotyping projects in model organisms have the potential to improve our understanding of gene functions and their role in living organisms. We have developed a computational, knowledge-based approach to automatically infer gene functions from phenotypic manifestations and applied this approach to yeast (Saccharomyces cerevisiae), nematode worm , (Caenorhabditis elegans), zebrafish (Danio rerio:), fruitfly (Drosophila melanogaster) and mouse (Mus musculus:) phenotypes. Our approach is based on the assumption that, if a mutation in a gene G leads to a phenotypic abnormality in a process P, then G must have been involved in P, either directly or indirectly. We systematically analyze recorded phenotypes in animal models using the formal denfiitions created for phenotype ontologies. We evaluate the validity of the inferred functions manually and by demonstrating a significant improvement in predicting genetic interactions and protein-protein interactions based on functional similarity. Our knowledge-based approach is generally applicable to phenotypes recorded in model organism databases, including phenotypes cataloged by consortia that are not recorded in the literature.publishersversionPeer reviewe

Year: 2013
DOI identifier: 10.1371/journal.pone.0060847
OAI identifier:

Suggested articles


  1. (1960). A coefficient of agreement for nominal scales. doi
  2. (2010). A guide to web tools to prioritize candidate genes. doi
  3. (2005). An evaluation of GO annotation retrieval for BioCreAtIvE and GOA. doi
  4. (2006). An introduction to ROC analysis. doi
  5. (2009). An orthogonal proteomicgenomic screen identifies aim2 as a cytoplasmic dna sensor for the inammasome.
  6. (2013). Analysis of Phenotypes Reveals Gene Functions
  7. (2004). Bidirectional signaling mediated by ephrin-b2 and ephb2 controls urorectal development. doi
  8. (2006). BioGRID: a general repository for interaction datasets. doi
  9. (2006). Combining evidence, biomedical literature and statistical dependence: new insights for functional annotation of gene sets.
  10. (2008). Combining evidence, specificity, and proximity towards the normalization of gene ontology terms in text. EURASIP journal on bioinformatics & systems biology. doi
  11. (2012). CvManGO, a method for leveraging computational predictions to improve literature-based Gene Ontology annotations. Database doi
  12. (2008). Evaluation of GO-based functional similarity measures using s. cerevisiae protein interaction and expression profile data. doi
  13. (2008). FlyBase Consortium doi
  14. (2005). Gene set enrichment analysis: A knowledge-based approach for interpreting genome-wide expression profiles. doi
  15. (2003). Investigating semantic similarity measures across the gene ontology: the relationship between sequence and annotation. doi
  16. (2012). Ontology-based cross-species integration and analysis of saccharomyces cerevisiae phenotypes. doi
  17. (2011). PREDICT: a method for inferring novel drug indications with application to personalized medicine. doi
  18. (2006). Predicting gene functions from text using a crossspecies approach. In: doi
  19. (2009). QuickGO: a web-based tool for gene ontology searching. doi
  20. (2011). Semantic similarity analysis of protein data: assessment with biological features and issues. doi
  21. (2009). Semantic similarity in biomedical ontologies. doi
  22. (2011). The Mouse Genome Database (MGD): premier model organism resource for mammalian genomics and genetics. doi
  23. (2011). The STRING database in 2011: functional interaction networks of proteins, globally integrated and scored. doi
  24. (2011). Worm phenotype ontology: integrating phenotype data within and beyond the C. elegans community. doi
  25. (2010). WormBase: a comprehensive resource for nematode research. doi
  26. (2011). ZFIN: enhancements and updates to the zebrafish model organism database. doi

To submit an update or takedown request for this paper, please submit an Update/Correction/Removal Request.