Skip to main content
Article thumbnail
Location of Repository

Temporal clustering by affinity propagation reveals transcriptional modules in Arabidopsis thaliana

By Steven J. Kiddle, Oliver P. Windram, Stuart McHattie, A. Mead, Jim Beynon, Vicky Buchanan-Wollaston, Katherine J. Denby and Sach Mukherjee

Abstract

Motivation: Identifying regulatory modules is an important task in the exploratory analysis of gene expression time series data. Clustering algorithms are often used for this purpose. However, gene regulatory events may induce complex temporal features in a gene expression profile, including time delays, inversions and transient correlations, which are not well accounted for by current clustering methods. As the cost of microarray experiments continues to fall, the temporal resolution of time course studies is increasing. This has led to a need to take account of detailed temporal features of this kind. Thus, while standard clustering methods are both widely used and much studied, their shared shortcomings with respect to such temporal features motivates the work presented here. \ud \ud Results: Here, we introduce a temporal clustering approach for high-dimensional gene expression data which takes account of time delays, inversions and transient correlations. We do so by exploiting a recently introduced, message-passing-based algorithm called Affinity Propagation (AP). We take account of temporal features of interest following an approximate but efficient dynamic programming approach due to Qian et al. The resulting approach is demonstrably effective in its ability to discern non-obvious temporal features, yet efficient and robust enough for routine use as an exploratory tool. We show results on validated transcription factor–target pairs in yeast and on gene expression data from a study of Arabidopsis thaliana under pathogen infection. The latter reveals a number of biologically striking findings. \ud \ud Availability: Matlab code for our method is available at http://www.wsbc.warwick.ac.uk/stevenkiddle/tcap.html. \ud \u

Topics: QK, QH426
Publisher: Oxford University Press
Year: 2009
OAI identifier: oai:wrap.warwick.ac.uk:3231

Suggested articles

Citations

  1. (2006). Expression profiling and mutant analysis reveals complex regulatory networks involved in Arabidopsis response to Botrytis infection. doi
  2. (2004). Clustering of gene expression data using a local shape-based similarity measure. doi
  3. (2003). Computational discovery of gene modules and regulatory networks. doi
  4. (1998). Profile hidden Markov models. doi
  5. (2007). Clustering by Passing Messages Between Data Points. doi
  6. (2000). Genomic expression programs in the response of yeast cells to environmental changes. doi
  7. (2007). Mixture modelling of gene expression data from microarray experiments. doi
  8. (2005). Systems biology for the virtual plant.
  9. (1972). Direct clustering of a data matrix. doi
  10. (2001). The Elements of Statistical Learning. doi
  11. (2005). Bayesian coclustering of Anopholes gene expression time series: Study of immune defense response to multiple experimental challenges. doi
  12. (2002). Plaid models for gene expression data.
  13. (2006). Experimental validation of a predicted feedback loop in the multi-oscillator clock of Arabidopsis thaliana. Molecular Systems Biology, doi
  14. (2005). A linear time biclustering algorithm for time series gene expression data. doi
  15. (2009). Enrichment constrained time-dependent clustering analysis for finding meaningful temporal transcription modules. doi
  16. (2002). On spectral clustering: analysis and an algorithm. In
  17. (2008). The AP2/ERF domain transcription factor ORA59 integrates jasmonic acid and ethylene signals in plant defense. doi
  18. (2001). Beyond synnexpression relationships: local clustering of timeshifted and inverted gene expression profiles identifies new biologically relevant interactions. doi
  19. (2003). Prediction of regulatory networks: genome-wide identification of transcription factor targets from gene expression data. doi
  20. (1989). A tutorial on hidden Markov models and selected applications in speech recognition. doi
  21. (2004). Elucidation of gene interaction networks through time-lagged correlation analysis of transcriptional data. Genome Res., doi
  22. (2003). Module networks: identifying regulatory networks and their condition specific regulators from gene expression data. doi
  23. (2000). Normalized cuts and image segmentation. doi
  24. (2009). Clustered alignments of gene-expression data. doi
  25. (1998). Comprehensive identification of cell cycle-regulated genes of the yeast Saccharomyces cerevisiaeby microarray hybridization. doi
  26. (2008). The Arabidopsis information resource (TAIR): gene structure and function annotation. doi
  27. (2006). Evaluation and comparison of gene clustering methods in microarray analysis. doi
  28. (2007). Mechanical stress induces biotic and abiotic stress responses via a novel cis-element. doi
  29. (2006). Effective similarity measures for expression profiles. doi

To submit an update or takedown request for this paper, please submit an Update/Correction/Removal Request.