Search CORE

39 research outputs found

Different effects of the probe summarization algorithms PLIER and RMA on high-level analysis of Affymetrix exon arrays

Author: Chen Yuchen
He Fei
Qu Yi
Publication venue: BioMed Central
Publication date: 01/01/2010
Field of study

Abstract Background Alternative splicing is an important mechanism that increases protein diversity and functionality in higher eukaryotes. Affymetrix exon arrays are a commercialized platform used to detect alternative splicing on a genome-wide scale. Two probe summarization algorithms, PLIER (Probe Logarithmic Intensity Error) and RMA (Robust Multichip Average), are commonly used to compute gene-level and exon-level expression values. However, a systematic comparison of these two algorithms on their effects on high-level analysis of the arrays has not yet been reported. Results In this study, we showed that PLIER summarization led to over-estimation of gene-level expression changes, relative to exon-level expression changes, in two-group comparisons. Consequently, it led to detection of substantially more skipped exons on up-regulated genes, as well as substantially more included (i.e., non-skipped) exons on down-regulated genes. In contrast, this bias was not observed for RMA-summarized data. By using a published human tissue dataset, we compared the tissue-specific expression and splicing detected by Affymetrix exon arrays with those detected based on expressed sequence databases. We found the tendency of PLIER was not supported by the expressed sequence data. Conclusion We showed that the tendency of PLIER in detection of alternative splicing is likely caused by a technical bias in the approach, rather than a biological bias. Moreover, we observed abnormal summarization results when using the PLIER algorithm, indicating that mathematical problems, such as numerical instability, may affect PLIER performance.</p

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

Analysis of Affymetrix Exon Arrays

Author: Leser Ulf
Zimmermann Karin
Publication venue: Humboldt-Universität zu Berlin, Mathematisch-Naturwissenschaftliche Fakultät II, Institut für Informatik
Publication date: 01/01/2010
Field of study

Exon arrays enable the monitoring of expression on a more fine-grained level than conventional 3’ arrays. By targeting single exons alternative splicing events can be detected. However, the increased amount of data resulting from the denser coverage of the transcribed regions gives rise to new challenges in data analysis compared to 3’ arrays. One must carefully decide which probes are considered for the final analysis to avoid measurements that are not reflecting biological reality. The most outstanding difference between gene level and exon level analysis emerges in the detection of differential expression. To decide whether an exon is differentially expressed between two conditions it must be set in relation to its corresponding gene. Therefore, completely new algorithms need to be applied. This work gives an overview on the analysis of Affymetrix exon arrays. Technical Design, Preprocessing and the detection of alternative splicing are dicussed and finally, a complete workflow is proposed

CiteSeerX

Dokumenten-Publikationsserver der Humboldt-Universität zu Berlin

Dissecting an alternative splicing analysis workflow for GeneChip® Exon 1.0 ST Affymetrix arrays

Author: Cristina Della Beffa
Della Beffa Cristina
Francesca Cordero
Raffaele A Calogero
Publication venue: BioMed Central
Publication date: 01/01/2008
Field of study

Crossref

Springer

Springer - Publisher Connector

PubMed Central

Evaluation of Microarray Preprocessing Algorithms Based on Concordance with RT-PCR in Clinical Samples

Author: Aron C. Eklund
Balazs Gyorffy
Bela Molnar
Chad Creighton
Hermann Lage
Zoltan Szallasi
Publication venue: Public Library of Science
Publication date: 01/01/2009
Field of study

BACKGROUND Several preprocessing algorithms for Affymetrix gene expression microarrays have been developed, and their performance on spike-in data sets has been evaluated previously. However, a comprehensive comparison of preprocessing algorithms on samples taken under research conditions has not been performed. METHODOLOGY/PRINCIPAL FINDINGS We used TaqMan RT-PCR arrays as a reference to evaluate the accuracy of expression values from Affymetrix microarrays in two experimental data sets: one comprising 84 genes in 36 colon biopsies, and the other comprising 75 genes in 29 cancer cell lines. We evaluated consistency using the Pearson correlation between measurements obtained on the two platforms. Also, we introduce the log-ratio discrepancy as a more relevant measure of discordance between gene expression platforms. Of nine preprocessing algorithms tested, PLIER+16 produced expression values that were most consistent with RT-PCR measurements, although the difference in performance between most of the algorithms was not statistically significant. CONCLUSIONS/SIGNIFICANCE Our results support the choice of PLIER+16 for the preprocessing of clinical Affymetrix microarray data. However, other algorithms performed similarly and are probably also good choices

Crossref

Directory of Open Access Journals

PubMed Central

Repository of the Academy's Library

Semmelweis Repository

Online Research Database In Technology

Comprehensive Analysis of Affymetrix Exon Arrays Using BioConductor

Author: B Modrek
Crispin J Miller
Dalma-Weiszhausz
Fran Lewitter
J Lu
JM Johnson
JP Venables
K Kapur
M Dai
Michał J Okoniewski
MJ Okoniewski
MJ Okoniewski
NA Faustino
P Gardina
R Bender
R Gentleman
RA Irizarry
RC Gentleman
SD Pepper
T Clark
TJ Hubbard
WM Liu
WN Venables
Publication venue: Public Library of Science
Publication date: 01/02/2008
Field of study

ISSN:1553-734XISSN:1553-735

Repository for Publications and Research Data

Crossref

Directory of Open Access Journals

PubMed Central

Cross-hybridization modeling on Affymetrix exon arrays

Author: Affymetrix
Affymetrix
Boutz
Casneuf
Clark
Eklund
Gresham
Hubbard
Hui Jiang
Irizarry
Jiang
Johnson
Kapur
Karen Kapur
Li
Li
Mortazavi
Okoniewski
Smith
Srinivasan
Stoughton
Wing Hung Wong
Wu
Xing
Xing
Yeo
Yi Xing
Publication venue: Oxford University Press
Publication date
Field of study

Motivation: Microarray designs have become increasingly probe-rich, enabling targeting of specific features, such as individual exons or single nucleotide polymorphisms. These arrays have the potential to achieve quantitative high-throughput estimates of transcript abundances, but currently these estimates are affected by biases due to cross-hybridization, in which probes hybridize to off-target transcripts

Crossref

PubMed Central

Seq-ing improved gene expression estimates from microarrays using machine learning

Author: Geeleher Paul
Korir Paul K.
Seoighe Cathal
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 04/09/2015
Field of study

BACKGROUND: Quantifying gene expression by RNA-Seq has several advantages over microarrays, including greater dynamic range and gene expression estimates on an absolute, rather than a relative scale. Nevertheless, microarrays remain in widespread use, demonstrated by the ever-growing numbers of samples deposited in public repositories. RESULTS: We propose a novel approach to microarray analysis that attains many of the advantages of RNA-Seq. This method, called Machine Learning of Transcript Expression (MaLTE), leverages samples for which both microarray and RNA-Seq data are available, using a Random Forest to learn the relationship between the fluorescence intensity of sets of microarray probes and RNA-Seq transcript expression estimates. We trained MaLTE on data from the Genotype-Tissue Expression (GTEx) project, consisting of Affymetrix gene arrays and RNA-Seq from over 700 samples across a broad range of human tissues. CONCLUSION: This approach can be used to accurately estimate absolute expression levels from microarray data, at both gene and transcript level, which has not previously been possible. This methodology will facilitate re-analysis of archived microarray data and broaden the utility of the vast quantities of data still being generated

Springer - Publisher Connector

Irish Universities

PubMed Central

Cork Open Research Archive

Access to Research at National University of Ireland, Galway

Transcript-based redefinition of grouped oligonucleotide probe sets using AceView: High-resolution annotation for microarrays

Author: Cam Margaret C
Lee Joseph C
Lu Jun
Salit Marc L
Publication venue: BioMed Central
Publication date: 01/01/2007
Field of study

BACKGROUND: Extracting biological information from high-density Affymetrix arrays is a multi-step process that begins with the accurate annotation of microarray probes. Shortfalls in the original Affymetrix probe annotation have been described; however, few studies have provided rigorous solutions for routine data analysis. RESULTS: Using AceView, a comprehensive human transcript database, we have reannotated the probes by matching them to RNA transcripts instead of genes. Based on this transcript-level annotation, a new probe set definition was created in which every probe in a probe set maps to a common set of AceView gene transcripts. In addition, using artificial data sets we identified that a minimal probe set size of 4 is necessary for reliable statistical summarization. We further demonstrate that applying the new probe set definition can detect specific transcript variants contributing to differential expression and it also improves cross-platform concordance. CONCLUSION: We conclude that our transcript-level reannotation and redefinition of probe sets complement the original Affymetrix design. Redefinitions introduce probe sets whose sizes may not support reliable statistical summarization; therefore, we advocate using our transcript-level mapping redefinition in a secondary analysis step rather than as a replacement. Knowing which specific transcripts are differentially expressed is important to properly design probe/primer pairs for validation purposes. For convenience, we have created custom chip-description-files (CDFs) and annotation files for our new probe set definitions that are compatible with Bioconductor, Affymetrix Expression Console or third party software

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

Microarray data analysis and mining approaches

Author: Botta Marco
Calogero Raffaele Adolfo
Cordero Francesca
Publication venue: 'Oxford University Press (OUP)'
Publication date: 01/01/2007
Field of study

Institutional Research Information System University of Turin

Statistical analysis of an RNA titration series evaluates microarray precision and sensitivity on a whole-array basis

Author: Bowtell David DL
Diyagama Dileepa S
Holloway Andrew J
Oshlack Alicia
Smyth Gordon K
Publication venue: BioMed Central
Publication date: 01/01/2006
Field of study

BACKGROUND: Concerns are often raised about the accuracy of microarray technologies and the degree of cross-platform agreement, but there are yet no methods which can unambiguously evaluate precision and sensitivity for these technologies on a whole-array basis. RESULTS: A methodology is described for evaluating the precision and sensitivity of whole-genome gene expression technologies such as microarrays. The method consists of an easy-to-construct titration series of RNA samples and an associated statistical analysis using non-linear regression. The method evaluates the precision and responsiveness of each microarray platform on a whole-array basis, i.e., using all the probes, without the need to match probes across platforms. An experiment is conducted to assess and compare four widely used microarray platforms. All four platforms are shown to have satisfactory precision but the commercial platforms are superior for resolving differential expression for genes at lower expression levels. The effective precision of the two-color platforms is improved by allowing for probe-specific dye-effects in the statistical model. The methodology is used to compare three data extraction algorithms for the Affymetrix platforms, demonstrating poor performance for the commonly used proprietary algorithm relative to the other algorithms. For probes which can be matched across platforms, the cross-platform variability is decomposed into within-platform and between-platform components, showing that platform disagreement is almost entirely systematic rather than due to measurement variability. CONCLUSION: The results demonstrate good precision and sensitivity for all the platforms, but highlight the need for improved probe annotation. They quantify the extent to which cross-platform measures can be expected to be less accurate than within-platform comparisons for predicting disease progression or outcome

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

University of Melbourne Institutional Repository