Search CORE

5 research outputs found

A probabilistic generative model for GO enrichment analysis

Author: Alexa
Bader
Bar-Joseph
Cheung
Davis
Deutscher
Eisen
Ernst
Ernst
Ewing
Gasch
Gerard J. Nau
Giot
Grassme
Grossmann
Harbison
Ihmels
Itamar Simon
Jones
Kellis
Leem
Mewes
Mukherjee
Nasmyth
Natarajan
Nau
Navarre
Palomero
Park
Ren
Rojas
Roni Rosenfeld
Spellman
The ENCODE Project Consortium.
The Gene Ontology Consortium.
The Toxicogenomics Research Consortium.
Thomas
Yong Lu
Ziv Bar-Joseph
Publication venue: Oxford University Press
Publication date: 01/01/2008
Field of study

The Gene Ontology (GO) is extensively used to analyze all types of high-throughput experiments. However, researchers still face several challenges when using GO and other functional annotation databases. One problem is the large number of multiple hypotheses that are being tested for each study. In addition, categories often overlap with both direct parents/descendents and other distant categories in the hierarchical structure. This makes it hard to determine if the identified significant categories represent different functional outcomes or rather a redundant view of the same biological processes. To overcome these problems we developed a generative probabilistic model which identifies a (small) subset of categories that, together, explain the selected gene set. Our model accommodates noise and errors in the selected gene set and GO. Using controlled GO data our method correctly recovered most of the selected categories, leading to dramatic improvements over current methods for GO analysis. When used with microarray expression data and ChIP-chip data from yeast and human our method was able to correctly identify both general and specific enriched categories which were overlooked by other methods

Crossref

PubMed Central

Reproducibility of microarray data: a further analysis of microarray quality control (MAQC) data

Author: Chen-An Tsai
Chien-Ju Lin
E Marshall
Huey-Miin Hsueh
James J Chen
JE Larkin
JM Perket
JP Ioannidis
KK Dobbin
L Guo
L Klebanov
L Shi
L Shi
MAQC Consortium
Members of the Toxicogenomics Research Consortium
P Liang
PK Tan
RA Irizarry
RD Canales
Robert R Delongchamp
S Draghici
S Frantz
SC Chow
TA Patterson
Y Benjamini
Publication venue: BioMed Central
Publication date: 01/01/2007
Field of study

Abstract Background Many researchers are concerned with the comparability and reliability of microarray gene expression data. Recent completion of the MicroArray Quality Control (MAQC) project provides a unique opportunity to assess reproducibility across multiple sites and the comparability across multiple platforms. The MAQC analysis presented for the conclusion of inter- and intra-platform comparability/reproducibility of microarray gene expression measurements is inadequate. We evaluate the reproducibility/comparability of the MAQC data for 12901 common genes in four titration samples generated from five high-density one-color microarray platforms and the TaqMan technology. We discuss some of the problems with the use of correlation coefficient as metric to evaluate the inter- and intra-platform reproducibility and the percent of overlapping genes (POG) as a measure for evaluation of a gene selection procedure by MAQC. Results A total of 293 arrays were used in the intra- and inter-platform analysis. A hierarchical cluster analysis shows distinct differences in the measured intensities among the five platforms. A number of genes show a small fold-change in one platform and a large fold-change in another platform, even though the correlations between platforms are high. An analysis of variance shows thirty percent of gene expressions of the samples show inconsistent patterns across the five platforms. We illustrated that POG does not reflect the accuracy of a selected gene list. A non-overlapping gene can be truly differentially expressed with a stringent cut, and an overlapping gene can be non-differentially expressed with non-stringent cutoff. In addition, POG is an unusable selection criterion. POG can increase or decrease irregularly as cutoff changes; there is no criterion to determine a cutoff so that POG is optimized. Conclusion Using various statistical methods we demonstrate that there are differences in the intensities measured by different platforms and different sites within platform. Within each platform, the patterns of expression are generally consistent, but there is site-by-site variability. Evaluation of data analysis methods for use in regulatory decision should take no treatment effect into consideration, when there is no treatment effect, "a fold-change cutoff with a non-stringent p-value cutoff" could result in 100% false positive error selection.</p

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

A robust measure of correlation between two genes on a microarray

Author: A Beaton
Aya Mitani
B Zhang
Brian VanKoten
C Brown
C Glasbey
D Jiang
D Rocke
DM Rocke
E Hubbel
E Marshall
E Schadt
F Mosteller
G Davidson
H Lopuhaä
HP Lopuhaa
I Gat-Vilks
J Ioannidis
J Qin
J Tukey
Johanna Hardin
K Kafadar
K Kafadar
K Yeung
L Dodd
L Heyer
Leanne Hicks
M Eisen
P Huber
P Rousseeuw
P Rousseeuw
P Spellman
R Wilcox
S Bergmann
S Carter
S Chu
S Datta
S Dudoit
T Golub
Toxicogenomics Research Consortium
X Wang
X Wang
Y Yang
Y Yang
Z Bar-Joseph
Publication venue: BioMed Central
Publication date: 01/01/2007
Field of study

Abstract Background The underlying goal of microarray experiments is to identify gene expression patterns across different experimental conditions. Genes that are contained in a particular pathway or that respond similarly to experimental conditions could be co-expressed and show similar patterns of expression on a microarray. Using any of a variety of clustering methods or gene network analyses we can partition genes of interest into groups, clusters, or modules based on measures of similarity. Typically, Pearson correlation is used to measure distance (or similarity) before implementing a clustering algorithm. Pearson correlation is quite susceptible to outliers, however, an unfortunate characteristic when dealing with microarray data (well known to be typically quite noisy.) Results We propose a resistant similarity metric based on Tukey's biweight estimate of multivariate scale and location. The resistant metric is simply the correlation obtained from a resistant covariance matrix of scale. We give results which demonstrate that our correlation metric is much more resistant than the Pearson correlation while being more efficient than other nonparametric measures of correlation (e.g., Spearman correlation.) Additionally, our method gives a systematic gene flagging procedure which is useful when dealing with large amounts of noisy data. Conclusion When dealing with microarray data, which are known to be quite noisy, robust methods should be used. Specifically, robust distances, including the biweight correlation, should be used in clustering and gene network analysis.</p

Scholarship@Claremont

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

Standardized Solutions for Quantitative and Real-Time RT-PCR to Accelerate Biopharmaceutical Development

Author: Aviv
Bas
Bernard
Blomberg
Boom
Bustin
Chirgwin
Cusi
Gibson
Heid
Henegariu
Hill
Huggett
Irizarry
Kutyavin
Larkin
Livak
Members of the Toxicogenomics Research Consortium
Nazarenko
Nazarenko
Pfaffl
Piechaczyk
Ross
Sellner
Stallcup
Tanaka
Tyagi
Ullmann
Vogelstein
Whitcombe
Wilfinger
Wittwer
Publication venue: 'Wiley'
Publication date
Field of study

Crossref

Evolving strategies for global gene expression analysis of cancer

Author: Allinen
Bertone
Bild
Boyle
Butler
Calzone
Caretti
Chen
Coe
Costa
Dalmasso
Davis
de Kok
Dobbin
Ein-Dor
Eisen
Esteller
Gardina
Giangreco
Goidin
Gounaris
Haller
Heidenblad
Higdon
Hyman
Jhavar
Johnson
Jung
Kan
Kim
Klinck
Lakshmipathy
Larkin
Lau
Lee
Li
Liang
Lockwood
Mani
MAQC Consortium
Mass
Members of the Toxicogenomics Research Consortium
Meyer
Michiels
Nagalakshmi
Nevins
Paddison
Penland
Perou
Pollack
Pusztai
Qian
Quackenbush
Quintas-Cardama
Raz
Rosenwald
Schena
Shipp
Sorlie
Storey
Subkhankulova
Subramanian
Sultan
Tong
Velculescu
Vogel
Wang
Wang
Weber
Weinstein
Wolf
Xu
Xu
Zhang
Publication venue: 'Wiley'
Publication date
Field of study

Crossref