Location of Repository

Statistical modelling of transcript profiles of differentially regulated genes

By Daniel C. Eastwood, A. (Andrew) Mead, Martin J. Sergeant and Kerry S. Burton

Abstract

Background: The vast quantities of gene expression profiling data produced in microarray studies, and\ud the more precise quantitative PCR, are often not statistically analysed to their full potential. Previous\ud studies have summarised gene expression profiles using simple descriptive statistics, basic analysis of\ud variance (ANOVA) and the clustering of genes based on simple models fitted to their expression profiles\ud over time. We report the novel application of statistical non-linear regression modelling techniques to\ud describe the shapes of expression profiles for the fungus Agaricus bisporus, quantified by PCR, and for E.\ud coli and Rattus norvegicus, using microarray technology. The use of parametric non-linear regression models\ud provides a more precise description of expression profiles, reducing the "noise" of the raw data to\ud produce a clear "signal" given by the fitted curve, and describing each profile with a small number of\ud biologically interpretable parameters. This approach then allows the direct comparison and clustering of\ud the shapes of response patterns between genes and potentially enables a greater exploration and\ud interpretation of the biological processes driving gene expression.\ud Results: Quantitative reverse transcriptase PCR-derived time-course data of genes were modelled. "Splitline"\ud or "broken-stick" regression identified the initial time of gene up-regulation, enabling the classification\ud of genes into those with primary and secondary responses. Five-day profiles were modelled using the\ud biologically-oriented, critical exponential curve, y(t) = A + (B + Ct)Rt + ε. This non-linear regression\ud approach allowed the expression patterns for different genes to be compared in terms of curve shape,\ud time of maximal transcript level and the decline and asymptotic response levels. Three distinct regulatory\ud patterns were identified for the five genes studied. Applying the regression modelling approach to\ud microarray-derived time course data allowed 11% of the Escherichia coli features to be fitted by an\ud exponential function, and 25% of the Rattus norvegicus features could be described by the critical\ud exponential model, all with statistical significance of p < 0.05.\ud Conclusion: The statistical non-linear regression approaches presented in this study provide detailed\ud biologically oriented descriptions of individual gene expression profiles, using biologically variable data to\ud generate a set of defining parameters. These approaches have application to the modelling and greater\ud interpretation of profiles obtained across a wide range of platforms, such as microarrays. Through careful\ud choice of appropriate model forms, such statistical regression approaches allow an improved comparison\ud of gene expression profiles, and may provide an approach for the greater understanding of common\ud regulatory mechanisms between genes

Topics: QH426
Publisher: Biomed central
Year: 2008
OAI identifier: oai:wrap.warwick.ac.uk:204

Suggested articles

Preview

Citations

  1. (2002). A: Bioinformatics methods for the analysis of expression arrays: data clustering and information extraction. doi
  2. (2002). A: HMGB1 and HMGB2 cell-specifically down-regulate the p53-and p73-dependant sequence-specific transactivation from the human Bax gene promoter. doi
  3. (1998). Aelist AC: Morphometric analysis of ageing mushrooms (Agaricus bisporus) during post-harvest development. Postharvest Biol Technol doi
  4. (2002). AFM: Sensitivity and accuracy of quantitative real-time polymerase chain reaction using SYBR green I depends on cDNA synthesis conditions. Anal Biochem doi
  5. (1975). Changes in respiration and carbohydrates during the post-harvest storage of mushrooms (Agaricus bisporus). J Sci Food Agric doi
  6. (2006). DA: A quantitative study of gene regulation involved in the immune response of Anopheline mosquitoes: An application of Bayesian hierarchical clustering of curves. doi
  7. (2005). DL: A Bayesian approach to reconstructing genetic regulatory networks with hiddenfactors. Bioinformatics doi
  8. (2001). DW: Molecular cloning: a laboratory manual 3rd edition.
  9. (2001). F: Methods and approaches in the analysis of gene expression data. doi
  10. (2004). Falciani F: Modelling T-cell activation using gene expression profiling and state-space models. Bioinformatics doi
  11. (2007). Griensven LJLD, Op Den Camp HJM: Argininosuccinate synthetase and argininosuccinate lyase: two ornithine cycle enzymes from Agaricus bisporus. Mycol Res doi
  12. (2001). Habitat patchiness and plant species richness. Ecol Lett doi
  13. (2002). Improved quantitative real-time RT-PCR for expression profiling of individual cells. doi
  14. (2004). Inferring quantitative models of regulatory networks from expression data. Bioinformatics doi
  15. (2006). Isolation and analysis of genes specifically expressed during fruiting body development in the basidiomycete Flammulina velutipes by fluorescence differential display. FEMS Microbiol Lett doi
  16. (1997). Isolation of developmentally regulated genes from the edible mushroom Agaricus bisporus. doi
  17. (2001). KS: Cloning and post-harvest expression of serine proteinase transcripts in the cultivated mushroom Agaricus bisporus. Fungal Gen Biol doi
  18. (2006). KS: Differential expression of a putative riboflavin-aldehydeforming enzyme (raf) gene during development and postharvest storage and in different tissue of the sporophore in Agaricus bisporus. Appl Microbiol Biotechnol doi
  19. (2001). KS: Genes with increased transcript levels following harvest of the sporophore of Agaricus bisporus have multiple physiological roles. Mycol Res doi
  20. (2000). Life history and developmental processes in the basidiomycete Coprinus cinereus. Microbiol Mol Biol Rev doi
  21. (2003). Microarray Bioinformatics Cambridge doi
  22. (2003). Modelling of corticosteroid pharmacogenomics in rat liver using gene microarrays. J Pharmacol Exp Ther
  23. (2005). Molecular cloning of developmentally specific genes by representational difference analysis during fruiting body formation in the basidiomycete Lentinula edodes. Fungal Gen Biol doi
  24. (2002). Molecular genetics of sexual development in the mushroom Coprinus cinereus. BioEssays doi
  25. (1997). Morphological studies on the life span, development stages, senescence and death of fruiting bodies of Agaricus bisporus. Mycol Res doi
  26. (2007). Pomposiello PJ: Rapid changes in gene expression dynamics in response to superoxide reveal SoxRS-dependent and independent transcriptional networks. PLoS ONE doi
  27. (1994). RG: The high-mobility group protein HMG1 can reversibly inhibit class-II gene-transcription by interaction with the TATA-binding protein.
  28. (2000). Ribosomal 18S RNA prevails over Glyceraldehyde-3-phosphate dehydrogenase and β-actin genes as internal standards for quantitative comparison of mRNA levels in invasive and noninvasive human melanoma cell subpopulations. Anal Biochem doi
  29. (2003). RL: Impact of the analysis of a bone density reference range on determination of the T-score. J Clin Densitom doi
  30. (2005). Shiozaki K: Yeast signalling pathways in oxidative stress response. Mutat Res-Fund Mol Mech Mutagen
  31. (2006). Speed TP: A multivariate empirical Bayes statistic for replicated microarray time course data. Ann Stat doi
  32. (2005). Statistical analysis of microarray data. Addict Biol doi
  33. (2007). Statistical reconstruction of transcription factor activity using Michaelis-Menten kinetics. Biometrics doi
  34. (2006). Talön M: maSigPro: A method to identify significantly differential expression profiles in timecourse microarray experiments. Bioinformatics doi
  35. (1993). Villa-Komaroff L: An alternative method for the visualization of RNA in formaldehyde agarose gels. Focus
  36. (2006). Westhead DR: Arabidopsis Co-expression Tool (ACT): web server tools for microarray-based gene expression analysis. Nucleic Acids Res doi
  37. (2003). Wilcox WF: Effects of humidity on the development of grapevine powdery mildew. Phytopathology doi
  38. (2002). Yoo Y-B: Comparative analysis of sequences expressed during liquid-cultured mycelia and fruit body stages of Pleurotus ostreatus. Fungal Gen Biol doi

To submit an update or takedown request for this paper, please submit an Update/Correction/Removal Request.