Article thumbnail

A Gene-Phenotype Network for the Laboratory Mouse and Its Implications for Systematic Phenotyping

By Octavio Espinosa and John M. Hancock


The laboratory mouse is the pre-eminent model organism for the dissection of human disease pathways. With the advent of a comprehensive panel of gene knockouts, projects to characterise the phenotypes of all knockout lines are being initiated. The range of genotype-phenotype associations can be represented using the Mammalian Phenotype ontology. Using publicly available data annotated with this ontology we have constructed gene and phenotype networks representing these associations. These networks show a scale-free, hierarchical and modular character and community structure. They also exhibit enrichment for gene coexpression, protein-protein interactions and Gene Ontology annotation similarity. Close association between gene communities and some high-level ontology terms suggests that systematic phenotyping can provide a direct insight into underlying pathways. However some phenotypes are distributed more diffusely across gene networks, likely reflecting the pleiotropic roles of many genes. Phenotype communities show a many-to-many relationship to human disease communities, but stronger overlap at more granular levels of description. This may suggest that systematic phenotyping projects should aim for high granularity annotations to maximise their relevance to human disease

Topics: Research Article
Publisher: Public Library of Science
OAI identifier:
Provided by: PubMed Central

To submit an update or takedown request for this paper, please submit an Update/Correction/Removal Request.

Suggested articles


  1. (2008). A critical assessment of Mus musculus gene function prediction using integrated genomic evidence.
  2. (2008). A single gene network accurately predicts phenotypic effects of gene perturbation in Caenorhabditis elegans.
  3. (1999). Assigning protein functions by comparative genome analysis: protein phylogenetic profiles.
  4. (2006). BioGRID: a general repository for interaction datasets.
  5. (2007). Broad network-based predictability of Saccharomyces cerevisiae gene loss-of-function phenotypes.
  6. (2010). Clustering phenotype populations by genome-wide RNAi and multiparametric imaging.
  7. (2006). Creation and implications of a phenome-genome network.
  8. (2009). EnsemblCompara GeneTrees: Complete, duplication-aware phylogenetic trees in vertebrates.
  9. (2010). EuroPhenome: a repository for high-throughput mouse phenotyping data.
  10. (2008). fdrtool: a versatile R package for estimating local and tail area-based false discovery rates.
  11. (2000). Gene ontology: tool for the unification of biology. The Gene Ontology Consortium.
  12. (2009). Genome-wide prioritization of disease genes and identification of disease-disease associations from an integrated human functional linkage network.
  13. (2004). Genomic analysis of essentiality within protein networks.
  14. (2004). Global mapping of the yeast genetic interaction network.
  15. (2000). GXD: a Gene Expression Database for the laboratory mouse: current status and recent enhancements. The Gene Expression Database group.
  16. (2008). Inferring mouse gene functions from genomic-scale data using a combined functional network/classification strategy.
  17. (2007). IntAct–open source resource for molecular interaction data.
  18. (2001). Lethality and centrality in protein networks.
  19. (2008). Mining phenotypes for gene function prediction.
  20. (2006). Modularity and community structure in networks.
  21. (2010). Molecular diagnosis reveals genetic heterogeneity for the overlapping MKKS and BBS phenotypes.
  22. (2009). Mouse, man, and meaning: bridging the semantics of mouse phenotype and human disease.
  23. (2004). Network biology: understanding the cell’s functional organization.
  24. (2008). Network-based global inference of human disease genes.
  25. (2007). Phenobabelomics–mouse phenotype data resources.
  26. (2010). Phenomics: the next challenge.
  27. (2010). Phenotype ontologies for mouse and man; bridging the semantic gap.
  28. (2009). Practical application of ontologies to annotate and analyse large scale raw mouse phenotype data.
  29. (2008). Predicting gene function in a hierarchical context with an ensemble of classifiers.
  30. (2005). Predictive models of molecular machines involved in Caenorhabditis elegans early embryogenesis.
  31. (2006). Pvclust: an R package for assessing the uncertainty in hierarchical clustering.
  32. (2010). Rational association of genes with traits using a genome-scale gene network for Arabidopsis thaliana.
  33. (2010). Systematic discovery of nonobvious human disease models through orthologous phenotypes.
  34. (2010). Systemic factors dominate mammal protein evolution.
  35. (2008). Testing Regions with Nonsmooth Boundaries via Multiscale Bootstrap.
  36. (2004). The Database of Interacting Proteins:
  37. (2009). The Functional Annotation of Mammalian Genomes: The Challenge of Phenotyping.
  38. (2007). The human disease network.
  39. (2005). The Mammalian Phenotype Ontology as a tool for annotating, analyzing and comparing phenotypic information.
  40. (2005). The MIPS mammalian protein-protein interaction database.
  41. (2010). The mouse Gene Expression Database (GXD):
  42. (2010). The Mouse Genome Database: enhancements and updates.
  43. (2010). Towards prediction and prioritization of disease genes by the modularity of human phenome-genome assembled network.