Search CORE

1,209 research outputs found

Empirical Bayes models for multiple probe type microarrays at the probe level

Author: A Hess
A Sjögren
A Spira
AM Hein
AP Dempster
B Efron
BP Durbin
BP Durbin
D Gaile
D Holder
DM Rocke
E Kristiansson
E Kristiansson
GK Smyth
I Lönnstedt
IA Eaves
J Comander
J Hu
JW Tukey
LM Cope
M Åstrand
MA Sartor
Magnus Åstrand
Mats Rudemo
N Jain
P Baldi
P Munson
Petter Mostad
R Opgen-Rhein
RA Irizarry
RS Stearman
S Choe
SC Geller
T Hastie
VG Tusher
W Huber
W Lemon
X Liu
X Liu
Publication venue: BioMed Central
Publication date: 01/01/2008
Field of study

Abstract Background When analyzing microarray data a primary objective is often to find differentially expressed genes. With empirical Bayes and penalized t-tests the sample variances are adjusted towards a global estimate, producing more stable results compared to ordinary t-tests. However, for Affymetrix type data a clear dependency between variability and intensity-level generally exists, even for logged intensities, most clearly for data at the probe level but also for probe-set summarizes such as the MAS5 expression index. As a consequence, adjustment towards a global estimate results in an intensity-level dependent false positive rate. Results We propose two new methods for finding differentially expressed genes, Probe level Locally moderated Weighted median-t (PLW) and Locally Moderated Weighted-t (LMW). Both methods use an empirical Bayes model taking the dependency between variability and intensity-level into account. A global covariance matrix is also used allowing for differing variances between arrays as well as array-to-array correlations. PLW is specially designed for Affymetrix type arrays (or other multiple-probe arrays). Instead of making inference on probe-set summaries, comparisons are made separately for each perfect-match probe and are then summarized into one score for the probe-set. Conclusion The proposed methods are compared to 14 existing methods using five spike-in data sets. For RMA and GCRMA processed data, PLW has the most accurate ranking of regulated genes in four out of the five data sets, and LMW consistently performs better than all examined moderated t-tests when used on RMA, GCRMA, and MAS5 expression indexes.</p

Crossref

Springer

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

Chalmers Research

Chalmers Publication Library

A Population Proportion approach for ranking differentially expressed genes

Author: A Ben-Dor
A Wille
AA Alizadeh
AA Alizadeh
B Wu
C Ding
CS Cooper
D Lambrechts
D Rajagopalan
D Singh
DA Notterman
DJ Duggan
DM Rocke
E Segal
G Ramsay
G Seth
GA Churchill
H Lian
IB Jeffery
J Lyons-Weiler
J Valls
JG Zhang
JL DeRisi
JR Nevins
JT McClave
KM Carr
LM Staudt
M Dettling
M Schena
M West
MF Oleksiak
Mugdha Gadgil
N Jain
OG Troyanskaya
P Baldi
P Hegde
PS Mischel
R Bijlani
R Simon
R Tibshirani
RD Wolfinger
S Draghici
S Dudoit
SA Tomlins
TJ Belbin
TR Golub
TS Furey
U Alon
VG Tusher
W Huber
W Pan
W Pan
X Liu
Publication venue: BioMed Central
Publication date: 01/01/2008
Field of study

Abstract Background DNA microarrays are used to investigate differences in gene expression between two or more classes of samples. Most currently used approaches compare mean expression levels between classes and are not geared to find genes whose expression is significantly different in only a subset of samples in a class. However, biological variability can lead to situations where key genes are differentially expressed in only a subset of samples. To facilitate the identification of such genes, a new method is reported. Methods The key difference between the Population Proportion Ranking Method (PPRM) presented here and almost all other methods currently used is in the quantification of variability. PPRM quantifies variability in terms of inter-sample ratios and can be used to calculate the relative merit of differentially expressed genes with a specified difference in expression level between at least some samples in the two classes, which at the same time have lower than a specified variability within each class. Results PPRM is tested on simulated data and on three publicly available cancer data sets. It is compared to the t test, PPST, COPA, OS, ORT and MOST using the simulated data. Under the conditions tested, it performs as well or better than the other methods tested under low intra-class variability and better than t test, PPST, COPA and OS when a gene is differentially expressed in only a subset of samples. It performs better than ORT and MOST in recognizing non differentially expressed genes with high variability in expression levels across all samples. For biological data, the success of predictor genes identified in appropriately classifying an independent sample is reported.</p

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central