Article thumbnail

ToppCluster: a multiple gene list feature analyzer for comparative enrichment clustering and network-based dissection of biological systems

By Vivek Kaimal, Eric E. Bardes, Scott C. Tabar, Anil G. Jegga and Bruce J. Aronow

Abstract

ToppCluster is a web server application that leverages a powerful enrichment analysis and underlying data environment for comparative analyses of multiple gene lists. It generates heatmaps or connectivity networks that reveal functional features shared or specific to multiple gene lists. ToppCluster uses hypergeometric tests to obtain list-specific feature enrichment P-values for currently 17 categories of annotations of human-ortholog genes, and provides user-selectable cutoffs and multiple testing correction methods to control false discovery. Each nameable gene list represents a column input to a resulting matrix whose rows are overrepresented features, and individual cells per-list P-values and corresponding genes per feature. ToppCluster provides users with choices of tabular outputs, hierarchical clustering and heatmap generation, or the ability to interactively select features from the functional enrichment matrix to be transformed into XGMML or GEXF network format documents for use in Cytoscape or Gephi applications, respectively. Here, as example, we demonstrate the ability of ToppCluster to enable identification of list-specific phenotypic and regulatory element features (both cis-elements and 3′UTR microRNA binding sites) among tissue-specific gene lists. ToppCluster’s functionalities enable the identification of specialized biological functions and regulatory networks and systems biology-based dissection of biological states. ToppCluster can be accessed freely at http://toppcluster.cchmc.org

Topics: Articles
Publisher: Oxford University Press
OAI identifier: oai:pubmedcentral.nih.gov:2896202
Provided by: PubMed Central

To submit an update or takedown request for this paper, please submit an Update/Correction/Removal Request.

Suggested articles

Citations

  1. (2009). Bioinformatics enrichment tools: paths toward the comprehensive
  2. (2003). DAVID: Database for annotation, visualization, and integrated discovery.
  3. (2004). FatiGO: a web tool for finding significant associations of Gene Ontology terms with groups of genes.
  4. (2007). g:Profiler–a web-based toolset for functional profiling of gene lists from large-scale experiments.
  5. (2005). Ontological analysis of gene expression data: current tools, limitations, and open problems.