Search CORE

5 research outputs found

Gene selection with multiple ordering criteria

Author: BA Rosenzweig
C Ambroise
CA Tsai
Chen-An Tsai
Chun-Houh Chen
G Fleury
GS Akerman
H Liu
I Guyon
James J Chen
JH Cho
JM Perket
L Breiman
L Breiman
L Li
M de Berg
M Dettling
MAQC Consortium
O Barndorff-Nielsen
S Michiels
SE Choe
SH Jung
ShengLi Tzeng
U Alon
V Tusher
W Jin
Y Benjamini
Publication venue: BioMed Central
Publication date: 01/01/2007
Field of study

BACKGROUND: A microarray study may select different differentially expressed gene sets because of different selection criteria. For example, the fold-change and p-value are two commonly known criteria to select differentially expressed genes under two experimental conditions. These two selection criteria often result in incompatible selected gene sets. Also, in a two-factor, say, treatment by time experiment, the investigator may be interested in one gene list that responds to both treatment and time effects. RESULTS: We propose three layer ranking algorithms, point-admissible, line-admissible (convex), and Pareto, to provide a preference gene list from multiple gene lists generated by different ranking criteria. Using the public colon data as an example, the layer ranking algorithms are applied to the three univariate ranking criteria, fold-change, p-value, and frequency of selections by the SVM-RFE classifier. A simulation experiment shows that for experiments with small or moderate sample sizes (less than 20 per group) and detecting a 4-fold change or less, the two-dimensional (p-value and fold-change) convex layer ranking selects differentially expressed genes with generally lower FDR and higher power than the standard p-value ranking. Three applications are presented. The first application illustrates a use of the layer rankings to potentially improve predictive accuracy. The second application illustrates an application to a two-factor experiment involving two dose levels and two time points. The layer rankings are applied to selecting differentially expressed genes relating to the dose and time effects. In the third application, the layer rankings are applied to a benchmark data set consisting of three dilution concentrations to provide a ranking system from a long list of differentially expressed genes generated from the three dilution concentrations. CONCLUSION: The layer ranking algorithms are useful to help investigators in selecting the most promising genes from multiple gene lists generated by different filter, normalization, or analysis methods for various objectives

Crossref

Springer

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

Reproducibility of microarray data: a further analysis of microarray quality control (MAQC) data

Author: Chen-An Tsai
Chien-Ju Lin
E Marshall
Huey-Miin Hsueh
James J Chen
JE Larkin
JM Perket
JP Ioannidis
KK Dobbin
L Guo
L Klebanov
L Shi
L Shi
MAQC Consortium
Members of the Toxicogenomics Research Consortium
P Liang
PK Tan
RA Irizarry
RD Canales
Robert R Delongchamp
S Draghici
S Frantz
SC Chow
TA Patterson
Y Benjamini
Publication venue: BioMed Central
Publication date: 01/01/2007
Field of study

Abstract Background Many researchers are concerned with the comparability and reliability of microarray gene expression data. Recent completion of the MicroArray Quality Control (MAQC) project provides a unique opportunity to assess reproducibility across multiple sites and the comparability across multiple platforms. The MAQC analysis presented for the conclusion of inter- and intra-platform comparability/reproducibility of microarray gene expression measurements is inadequate. We evaluate the reproducibility/comparability of the MAQC data for 12901 common genes in four titration samples generated from five high-density one-color microarray platforms and the TaqMan technology. We discuss some of the problems with the use of correlation coefficient as metric to evaluate the inter- and intra-platform reproducibility and the percent of overlapping genes (POG) as a measure for evaluation of a gene selection procedure by MAQC. Results A total of 293 arrays were used in the intra- and inter-platform analysis. A hierarchical cluster analysis shows distinct differences in the measured intensities among the five platforms. A number of genes show a small fold-change in one platform and a large fold-change in another platform, even though the correlations between platforms are high. An analysis of variance shows thirty percent of gene expressions of the samples show inconsistent patterns across the five platforms. We illustrated that POG does not reflect the accuracy of a selected gene list. A non-overlapping gene can be truly differentially expressed with a stringent cut, and an overlapping gene can be non-differentially expressed with non-stringent cutoff. In addition, POG is an unusable selection criterion. POG can increase or decrease irregularly as cutoff changes; there is no criterion to determine a cutoff so that POG is optimized. Conclusion Using various statistical methods we demonstrate that there are differences in the intensities measured by different platforms and different sites within platform. Within each platform, the patterns of expression are generally consistent, but there is site-by-site variability. Evaluation of data analysis methods for use in regulatory decision should take no treatment effect into consideration, when there is no treatment effect, "a fold-change cutoff with a non-stringent p-value cutoff" could result in 100% false positive error selection.</p

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

A comprehensive functional analysis of tissue specificity of human gene expression

Author: A Broder
A Subramanian
AD Smith
AL Barabasi
Alexey Guryanov
Andrej Bugrim
AS Adler
Damir Dosymbekov
E Eisenberg
Eugene Rakhmatulin
Evgeny Sviridov
F Chalmel
GK Smyth
H Kitano
J Zhu
JA Warrington
JD Watson
JM Perket
Julie Blake
K Kadota
KE Kouadjo
Kelly Li
L Klebanov
LL Hsiao
M Csete
M Schena
M Shipitsin
MAQC Consortium
MB Eisen
MJ Callow
N Schultz
Raymond R Samaha
Richard J Brennan
S Ekins
S Lee
Tatiana Nikolskaya
Tatiana Serebriyskaya
W Feller
Weiwei Shi
Y Nikolsky
Yuri Nikolsky
Z Tu
Zoltán Dezső
Publication venue: BioMed Central
Publication date: 01/01/2008
Field of study

Abstract Background In recent years, the maturation of microarray technology has allowed the genome-wide analysis of gene expression patterns to identify tissue-specific and ubiquitously expressed ('housekeeping') genes. We have performed a functional and topological analysis of housekeeping and tissue-specific networks to identify universally necessary biological processes, and those unique to or characteristic of particular tissues. Results We measured whole genome expression in 31 human tissues, identifying 2374 housekeeping genes expressed in all tissues, and genes uniquely expressed in each tissue. Comprehensive functional analysis showed that the housekeeping set is substantially larger than previously thought, and is enriched with vital processes such as oxidative phosphorylation, ubiquitin-dependent proteolysis, translation and energy metabolism. Network topology of the housekeeping network was characterized by higher connectivity and shorter paths between the proteins than the global network. Ontology enrichment scoring and network topology of tissue-specific genes were consistent with each tissue's function and expression patterns clustered together in accordance with tissue origin. Tissue-specific genes were twice as likely as housekeeping genes to be drug targets, allowing the identification of tissue 'signature networks' that will facilitate the discovery of new therapeutic targets and biomarkers of tissue-targeted diseases. Conclusion A comprehensive functional analysis of housekeeping and tissue-specific genes showed that the biological function of housekeeping and tissue-specific genes was consistent with tissue origin. Network analysis revealed that tissue-specific networks have distinct network properties related to each tissue's function. Tissue 'signature networks' promise to be a rich source of targets and biomarkers for disease treatment and diagnosis.</p

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

SLEPR: A Sample-Level Enrichment-Based Pathway Ranking Method — Seeking Biological Themes through Pathway-Level Consistency

Author: A Bild
A Boorsma
A Subramanian
A Sweet-Cordero
AI Su
B Wu
BR Zeeberg
D Damian
DA Hosack
DR Rhodes
E Huang
E Segal
F Al-Shahrour
GK Smyth
GK Smyth
H Lian
H Tao
I Ulitsky
J Lyons-Weiler
J Tomfohr
JA Hartigan
JB Welsh
JD Storey
Ji Zhu
JJ Goeman
JM Perket
L Lamb
L Peltonen
L Tian
LT Reiter
M Yi
MB Eisen
Ming Yi
MK Kerr
N Cordes
N Jain
P Pavlidis
P Pavlidis
P Pavlidis
P Shannon
P Tamayo
R Nadon
R Tibshirani
Robert M. Stephens
S Burke
S Draghici
S Draghici
S Draghici
S Döhr
S Efroni
S Seo
SA Tomlins
SY Kim
T Manoli
TA Baudino
VG Tusher
VK Mootha
WK Scott
WP Hsieh
X Lu
Publication venue: Public Library of Science
Publication date: 01/01/2008
Field of study

Analysis of microarray and other high throughput data often involves identification of genes consistently up or down-regulated across samples as the first step in extraction of biological meaning. This gene-level paradigm can be limited as a result of valid sample fluctuations and biological complexities. In this report, we describe a novel method, SLEPR, which eliminates this limitation by relying on pathway-level consistencies. Our method first selects the sample-level differentiated genes from each individual sample, capturing genes missed by other analysis methods, ascertains the enrichment levels of associated pathways from each of those lists, and then ranks annotated pathways based on the consistency of enrichment levels of individual samples from both sample classes. As a proof of concept, we have used this method to analyze three public microarray datasets with a direct comparison with the GSEA method, one of the most popular pathway-level analysis methods in the field. We found that our method was able to reproduce the earlier observations with significant improvements in depth of coverage for validated or expected biological themes, but also produced additional insights that make biological sense. This new method extends existing analyses approaches and facilitates integration of different types of HTP data

CiteSeerX

Public Library of Science (PLOS)

Crossref

Directory of Open Access Journals

PubMed Central

The Leaching of Major and Trace Elements from Coal Ash

Author: A Läuchli
A Wadge
ACM Bourg
AH Welch
AK Furr
ASTM
BM Chapman
CA Cowan
CC Ainsworth
CJ Hostetler
CJ Hostetler
CJ Warren
CJ Warren
CL Carlson
CL Perket
D Langmuir
DA Grisafe
DA Kopsick
DC Adriano
DC Dusing
DC Mangold
DG Kinniburgh
DG Kinniburgh
DJ Swaine
DJ Swaine
DJR Dodd
DK Nordstrom
DM Miller
DR Dreesen
DR Jackson
DR Jackson
DR Jones
DR Jones
DT Merrill
EA Crecelius
EA Jenne
EJ Reardon
EJ Wu
FA Cotton
GA Cutter
GJ Groot de
HA Sloot van der
HA Sloot van der
HA Sloot van der
HA Sloot van der
HJ Meyer
HM Johnston
HR White
HT Evans Jr.
IP Murarka
IP Murarka
J Frigge
J Gulens
J Vuceta
JA Cox
JA Davis
JD Allison
JF Hollis
JF Villaume
JH Howard III
JH Moore
JM Brannon
JM McNeal
JM Zachara
JO Leckie
JO Nriagu
JR Fowle III
JR Herring
JS Fruchter
JW Doran
K Furuya
K Sato
KD Ferguson
LB Clarke
LD Hansen
LD Hansen
LD Hulett
LD Hulett Jr.
LE Eary
LJ Evans
ME Essington
MJ Dudas
ML Pierce
MM Benjamin
N Kaufherr
O Weres
PH Masscheleyn
R Hermann
R Wagemann
RG Robins
RJ Bartlett
RL Davison
RM McKenzie
RR Turner
RR Turner
RW Talbot
S Goldberg
S Goldberg
SPN Singh
SS Sorini
SV Mattigod
SV Mattigod
TH Brown
TJ Chu
TL Theis
TL Theis
USEPA
USEPA
W Stumm
WA Sack
WR Cullen
WR Roy
WR Roy
YA Kim
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/1995
Field of study

Crossref