Article thumbnail

Impact of gene expression data pre-processing on expression quantitative trait locus mapping

By Aurelie Labbe, Marie-Paule Roth, Pierre-Hugues Carmichael and Maria Martinez


We evaluate the impact of three pre-processing methods for Affymetrix microarray data on expression quantitative trait locus (eQTL) mapping, using 14 CEPH Utah families (GAW Problem 1 data). Different sets of expression traits were chosen according to different selection criteria: expression level, variance, and heritability. For each gene, three expression phenotypes were obtained by different pre-processing methods. Each quantitative phenotype was then submitted to a whole-genome scan, using multipoint variance component LODs. Pre-processing methods were compared with respect to their linkage outcomes (number of linkage signals with LODs greater than 3, consistencies in the location of the trait-specific linkage signals, and type of cis/trans-regulating loci). Overall, we found little agreement between linkage results from the different pre-processing methods: most of the linkage signals were specific to one pre-processing method. However, agreement rates varied according to the criteria used to select the traits. For instance, these rates were higher in the set of the most heritable traits. On the other hand, the pre-processing method had little impact on the relative proportion of detected cis and trans-regulating loci. Interestingly, although the number of detected cis-regulating loci was relatively small, pre-processing methods agreed much better in this set of linkage signals than in the trans-regulating loci. Several potential factors explaining the discordance observed between the methods are discussed

Topics: Proceedings
Publisher: BioMed Central
OAI identifier:
Provided by: PubMed Central

To submit an update or takedown request for this paper, please submit an Update/Correction/Removal Request.

Suggested articles


  1. Affymetrix: Affymetrix Microarray Suite User Guide, version 4
  2. (2002). Affymetrix: Statistical Algorithms Description Document
  3. (2005). Burdick JT: Mapping determinants of human gene expression by regional and whole genome association. Nature
  4. (2004). Cheung VG: Genetic analysis of genome-wide variation in human gene expression. Nature
  5. (2006). Comparison of Affymetrix GeneChip expression measures. Bioinformatics
  6. (2004). F: A model based background adjustment for oligonucleotide expression arrays.
  7. (2005). GP: Sources of variation in Affymetrix microarray experiments.
  8. (2006). Little PFR: Normalization procedures and detection of linkage signal in genetical-genomic experiments. Nat Genet
  9. (2002). LR: Merlin-rapid analysis of dense genetic maps using sparse gene flow trees. Nat Genet
  10. (2006). N: Reply to "Normalization procedures and detection of linkage signal in genetical-genomics experiments".
  11. (2006). RW: Reply to "Normalization procedures and detection of linkage signal in genetical-genomics experiments".
  12. (2003). Speed TP: Exploration, normalization, and summaries of high density oligonucleotide array probe level data. Biostatistics
  13. (2003). Spielman RS: Natural variation in human gene expression assessed in lymphoblastoid cells. Nat Genet