Location of Repository

2010 IEEE International Conference on Bioinformatics and Biomedicine A Supervised Learning Approach to the Unsupervised Clustering of Genes

By Andrew Rider, Scott Emrich, Michael Ferdig and Nitesh V. Chawla

Abstract

Abstract—Clustering is a common step in the analysis of microarray data. Microarrays enable simultaneous highthroughput measurement of the expression level of genes. These data can be used to explore relationships between genes and can guide development of drugs and further research. A typical first step in the analysis of these data is to use an agglomerative hierarchical clustering algorithm on the correlation between all gene pairs. While this simple approach has been successful it fails to identify many genetic interactions that may be important for drug design and other important applications. We present an approach to the clustering of expression data that utilizes known gene-gene interaction data to improve results for already commonly used clustering techniques. The approach creates an ensemble similarity measure that can be used as input to common clustering techniques and provides results with increased biological significance while not altering the clustering approach at all. Keywords-clustering; ensemble; random subspaces; classifier; microarray; I

Year: 2013
OAI identifier: oai:CiteSeerX.psu:10.1.1.363.4241
Provided by: CiteSeerX
Download PDF:
Sorry, we are unable to provide the full text but you may find it at the following location(s):
  • http://citeseerx.ist.psu.edu/v... (external link)
  • http://www.cse.nd.edu/~nchawla... (external link)
  • Suggested articles


    To submit an update or takedown request for this paper, please submit an Update/Correction/Removal Request.