Location of Repository

A temporal precedence based clustering method for gene expression microarray data

By Ritesh V. Krishna, Chang-Tsun Li and Vicky Buchanan-Wollaston


Background: Time-course microarray experiments can produce useful data which can help in understanding the underlying dynamics of the system. Clustering is an important stage in microarray data analysis where the data is grouped together according to certain characteristics. The majority of clustering techniques are based on distance or visual similarity measures which may not be suitable for clustering of temporal microarray data where the sequential nature of time is important. We present a Granger causality based technique to cluster temporal microarray gene expression data, which measures the interdependence between two time-series by statistically testing if one time-series can be used for forecasting the other time-series or not. \ud Results: A gene-association matrix is constructed by testing temporal relationships between pairs of genes using the Granger causality test. The association matrix is further analyzed using a graph-theoretic technique to detect highly connected components representing interesting biological modules. We test our approach on synthesized datasets and real biological datasets obtained for Arabidopsis thaliana. We show the effectiveness of our approach by analyzing the results using the existing biological literature. We also report interesting structural properties of the association network commonly desired in any biological system. \ud Conclusions: Our experiments on synthesized and real microarray datasets show that our approach produces encouraging results. The method is simple in implementation and is statistically traceable at each step. The method can produce sets of functionally related genes which can be further used for reverse-engineering of gene circuits

Topics: QK, QH426
Publisher: BioMed Central Ltd.
Year: 2010
OAI identifier: oai:wrap.warwick.ac.uk:2993

Suggested articles



  1. (1999). (Eds): Identifying gene regulatory networks from experimental data doi
  2. (2007). Almon RR: Analysis of Time-Series Gene Expression Data: Methods, Challenges and Opportunities. Annual Review of Biomedical Engineering doi
  3. Applied multivariate statistical analysis Prentice-Hall doi
  4. (2006). Bar-Joseph Z: STEM: a tool for the analysis of short time series gene expression data. doi
  5. (2007). Chatterjee S: Causality and pathway search in microarray time series experiment. Bioinformatics doi
  6. (2001). Churchill GA: Statistical design and the analysis of gene expression microarray data. Genet Res doi
  7. (1998). Cluster analysis and display of genome-wide expression patterns. PNAS doi
  8. (2005). Clustering of gene expression data using a local shape-based similarity measure. Bioinformatics doi
  9. (2008). Comment on causality and pathway search in microarray time series experiment. Bioinformatics doi
  10. (2001). Computational analysis of microarray data. doi
  11. (2004). Dominant spectral component analysis for transcriptional regulations using microarray time-series data. Bioinformatics doi
  12. (2000). et al: Orchestrated transcription of key pathways in Arabidopsis by the circadian clock. Science doi
  13. (2008). Feng JF: Uncovering interactions in the frequency domain. PLoS Comp Biology doi
  14. (1999). G: Systematic determination of genetic network architecture. Nat Genet doi
  15. (2003). Gene expression dynamics inspector (GEDI): for integrative analysis of expression profiles. Bioinformatics doi
  16. (1969). GJ: Minimum spanning trees and single linkage analysis. Appl Stat doi
  17. (2005). How does gene expression clustering work?. Nat Biotechnol doi
  18. (2003). Inference of transcriptional regulation relationships from gene expression data. Bioinformatics doi
  19. (1969). Investigating causal relations by econometric models and cross-spectral methods. Econometrica doi
  20. (2002). Linked: The New Science of Networks Basic Books
  21. (2000). Mea: Genomic expression programs in the response of yeast cells to environmental changes. Mol Biol Cell doi
  22. (1984). Multivariate Observations doi
  23. (2001). Oltvai ZN: Lethality and centrality in protein networks.
  24. On spectral clustering: Analysis and an algorithm.
  25. (2004). Revealing modularity and organization in the yeast molecular network by integrated analysis of highly heterogeneous genomewide data. PNAS doi
  26. (2006). RL: Clustering the periodic pattern of gene expression using Fourier series approximations. Curr Genomics doi
  27. (2008). S: A partial granger causality approach to explore causal networks derived from multi-parameter data. Lecture notes in Computer Science doi
  28. (2003). Statistical Analysis of Gene Expression Microarray Data Chapman and Hall/CRC doi
  29. (2004). Strimmer K: Identifying Periodically Expressed Transcripts in Microarray Time Series Data. Bioinformatics doi
  30. (2005). Tan KL: Identifying time-lagged gene clusters using gene expression data. Bioinformatics doi

To submit an update or takedown request for this paper, please submit an Update/Correction/Removal Request.