Search CORE

36 research outputs found

Learning from microarray interlaboratory studies: measures of precision for gene expression

Author: Duewer David L
Jones Wendell D
Reid Laura H
Salit Marc
Publication venue: BioMed Central
Publication date: 01/01/2009
Field of study

Abstract Background The ability to demonstrate the reproducibility of gene expression microarray results is a critical consideration for the use of microarray technology in clinical applications. While studies have asserted that microarray data can be "highly reproducible" under given conditions, there is little ability to quantitatively compare amongst the various metrics and terminology used to characterize and express measurement performance. Use of standardized conceptual tools can greatly facilitate communication among the user, developer, and regulator stakeholders of the microarray community. While shaped by less highly multiplexed systems, measurement science (metrology) is devoted to establishing a coherent and internationally recognized vocabulary and quantitative practice for the characterization of measurement processes. Results The two independent aspects of the metrological concept of "accuracy" are "trueness" (closeness of a measurement to an accepted reference value) and "precision" (the closeness of measurement results to each other). A carefully designed collaborative study enables estimation of a variety of gene expression measurement precision metrics: repeatability, several flavors of intermediate precision, and reproducibility. The three 2004 Expression Analysis Pilot Proficiency Test collaborative studies, each with 13 to 16 participants, provide triplicate microarray measurements on each of two reference RNA pools. Using and modestly extending the consensus ISO 5725 documentary standard, we evaluate the metrological precision figures of merit for individual microarray signal measurement, building from calculations appropriate to single measurement processes, such as technical replicate expression values for individual probes on a microarray, to the estimation and display of precision functions representing all of the probes in a given platform. Conclusion With only modest extensions, the established metrological framework can be fruitfully used to characterize the measurement performance of microarray and other highly multiplexed systems. Precision functions, summarizing routine precision metrics estimated from appropriately repeated measurements of one or more reference materials as functions of signal level, are demonstrated and merit further development for characterizing measurement platforms, monitoring changes in measurement system performance, and comparing performance among laboratories or analysts.</p

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

Transcript-based redefinition of grouped oligonucleotide probe sets using AceView: High-resolution annotation for microarrays

Author: Cam Margaret C
Lee Joseph C
Lu Jun
Salit Marc L
Publication venue: BioMed Central
Publication date: 01/01/2007
Field of study

BACKGROUND: Extracting biological information from high-density Affymetrix arrays is a multi-step process that begins with the accurate annotation of microarray probes. Shortfalls in the original Affymetrix probe annotation have been described; however, few studies have provided rigorous solutions for routine data analysis. RESULTS: Using AceView, a comprehensive human transcript database, we have reannotated the probes by matching them to RNA transcripts instead of genes. Based on this transcript-level annotation, a new probe set definition was created in which every probe in a probe set maps to a common set of AceView gene transcripts. In addition, using artificial data sets we identified that a minimal probe set size of 4 is necessary for reliable statistical summarization. We further demonstrate that applying the new probe set definition can detect specific transcript variants contributing to differential expression and it also improves cross-platform concordance. CONCLUSION: We conclude that our transcript-level reannotation and redefinition of probe sets complement the original Affymetrix design. Redefinitions introduce probe sets whose sizes may not support reliable statistical summarization; therefore, we advocate using our transcript-level mapping redefinition in a secondary analysis step rather than as a replacement. Knowing which specific transcripts are differentially expressed is important to properly design probe/primer pairs for validation purposes. For convenience, we have created custom chip-description-files (CDFs) and annotation files for our new probe set definitions that are compatible with Bioconductor, Affymetrix Expression Console or third party software

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

PEPR: pipelines for evaluating prokaryotic references

Author: Daniel V. Samarov
Justin M. Zook
Marc L. Salit
Nathan D. Olson
Scott A. Jackson
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

A framework for assessing 16S rRNA marker-gene survey data analysis methods using mixtures.

Author: Braccia Domenick J.
Corrada Bravo Hector
Hao Stephanie
Kumar M. Senthil
Li Shan
Olson Nathan D.
Salit Marc L.
Stine O. Colin
Timp Winston
Publication venue: Springer Nature
Publication date: 13/03/2020
Field of study

There are a variety of bioinformatic pipelines and downstream analysis methods for analyzing 16S rRNA marker-gene surveys. However, appropriate assessment datasets and metrics are needed as there is limited guidance to decide between available analysis methods. Mixtures of environmental samples are useful for assessing analysis methods as one can evaluate methods based on calculated expected values using unmixed sample measurements and the mixture design. Previous studies have used mixtures of environmental samples to assess other sequencing methods such as RNAseq. But no studies have used mixtures of environmental to assess 16S rRNA sequencing. We developed a framework for assessing 16S rRNA sequencing analysis methods which utilizes a novel two-sample titration mixture dataset and metrics to evaluate qualitative and quantitative characteristics of count tables. Our qualitative assessment evaluates feature presence/absence exploiting features only present in unmixed samples or titrations by testing if random sampling can account for their observed relative abundance. Our quantitative assessment evaluates feature relative and differential abundance by comparing observed and expected values. We demonstrated the framework by evaluating count tables generated with three commonly used bioinformatic pipelines: (i) DADA2 a sequence inference method, (ii) Mothur a de novo clustering method, and (iii) QIIME an open-reference clustering method. The qualitative assessment results indicated that the majority of Mothur and QIIME features only present in unmixed samples or titrations were accounted for by random sampling alone, but this was not the case for DADA2 features. Combined with count table sparsity (proportion of zero-valued cells in a count table), these results indicate DADA2 has a higher false-negative rate whereas Mothur and QIIME have higher false-positive rates. The quantitative assessment results indicated the observed relative abundance and differential abundance values were consistent with expected values for all three pipelines. We developed a novel framework for assessing 16S rRNA marker-gene survey methods and demonstrated the framework by evaluating count tables generated with three bioinformatic pipelines. This framework is a valuable community resource for assessing 16S rRNA marker-gene survey bioinformatic methods and will help scientists identify appropriate analysis methods for their marker-gene surveys.https://doi.org/10.1186/s40168-020-00812-

Digital Repository at the University of Maryland

Exploring the use of internal and externalcontrols for assessing microarray technical performance

Author: A Kauffmann
A Kauffmann
A Kauffmann
AA Ellington
Affymetrix Inc
AJ Holloway
BM Bolstad
C Tomlinson
CA Ball
CL Yauk
David L Duewer
DL Duewer
EF Petricoin
FCP Holstege
GA Held
H van Bakel
Helen C Causton
IV Yang
J Brettschneider
Katrice A Lippa
L Gautier
Laurence Game
LH Reid
LJ Kricka
LM Shi
M Navarange
Marc L Salit
ML Salit
MN McCall
R Mansourian
RA Irizarry
RA Irizarry
SC Baker
SE Choe
TR Hughes
W Liggett
W Zhang
WD Tong
XH Fan
Publication venue: BioMed Central
Publication date: 01/01/2010
Field of study

Abstract Background The maturing of gene expression microarray technology and interest in the use of microarray-based applications for clinical and diagnostic applications calls for quantitative measures of quality. This manuscript presents a retrospective study characterizing several approaches to assess technical performance of microarray data measured on the Affymetrix GeneChip platform, including whole-array metrics and information from a standard mixture of external spike-in and endogenous internal controls. Spike-in controls were found to carry the same information about technical performance as whole-array metrics and endogenous "housekeeping" genes. These results support the use of spike-in controls as general tools for performance assessment across time, experimenters and array batches, suggesting that they have potential for comparison of microarray data generated across species using different technologies. Results A layered PCA modeling methodology that uses data from a number of classes of controls (spike-in hybridization, spike-in polyA+, internal RNA degradation, endogenous or "housekeeping genes") was used for the assessment of microarray data quality. The controls provide information on multiple stages of the experimental protocol (e.g., hybridization, RNA amplification). External spike-in, hybridization and RNA labeling controls provide information related to both assay and hybridization performance whereas internal endogenous controls provide quality information on the biological sample. We find that the variance of the data generated from the external and internal controls carries critical information about technical performance; the PCA dissection of this variance is consistent with whole-array quality assessment based on a number of quality assurance/quality control (QA/QC) metrics. Conclusions These results provide support for the use of both external and internal RNA control data to assess the technical quality of microarray experiments. The observed consistency amongst the information carried by internal and external controls and whole-array quality measures offers promise for rationally-designed control standards for routine performance monitoring of multiplexed measurement platforms.</p

Crossref

Springer - Publisher Connector

Columbia University Academic Commons

Directory of Open Access Journals

PubMed Central

Summarizing performance for genome scale measurement of miRNA: reference samples and metrics

Author: Pine P. Scott
Lund Steven P.
Parsons Jerod R.
Vang Lindsay K.
Mahabal Ashish A.
Cinquini Luca
Kelly Sean C.
Kincaid Heather
Crichton Daniel J.
Spira Avrum
Liu Gang
Gower Adam C.
Pass Harvey I.
Goparaju Chandra
Dubinett Steven M.
Krysan Kostyantyn
Stass Sanford A.
Kukuruga Debra
Van Keuren-Jensen Kendall
Courtright-Lim Amanda
Thompson Karol L.
Rosenzweig Barry A.
Sorbara Lynn
Srivastava Sudhir
Salit Marc L.
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 25/08/2017
Field of study

Background: The potential utility of microRNA as biomarkers for early detection of cancer and other diseases is being investigated with genome-scale profiling of differentially expressed microRNA. Processes for measurement assurance are critical components of genome-scale measurements. Here, we evaluated the utility of a set of total RNA samples, designed with between-sample differences in the relative abundance of miRNAs, as process controls. Results: Three pure total human RNA samples (brain, liver, and placenta) and two different mixtures of these components were evaluated as measurement assurance control samples on multiple measurement systems at multiple sites and over multiple rounds. In silico modeling of mixtures provided benchmark values for comparison with physical mixtures. Biomarker development laboratories using next-generation sequencing (NGS) or genome-scale hybridization assays participated in the study and returned data from the samples using their routine workflows. Multiplexed and single assay reverse-transcription PCR (RT-PCR) was used to confirm in silico predicted sample differences. Data visualizations and summary metrics for genome-scale miRNA profiling assessment were developed using this dataset, and a range of performance was observed. These metrics have been incorporated into an online data analysis pipeline and provide a convenient dashboard view of results from experiments following the described design. The website also serves as a repository for the accumulation of performance values providing new participants in the project an opportunity to learn what may be achievable with similar measurement processes. Conclusions: The set of reference samples used in this study provides benchmark values suitable for assessing genome-scale miRNA profiling processes. Incorporation of these metrics into an online resource allows laboratories to periodically evaluate their performance and assess any changes introduced into their measurement process

Hunter Library Digital Collections (Western Carolina University)

Caltech Authors

Summarizing performance for genome scale measurement of miRNA: reference samples and metrics

Author: Cinquini Luca
Courtright-Lim Amanda
Crichton Daniel J.
Dubinett Steven M.
Goparaju Chandra
Gower Adam C.
Kelly Sean C.
Kincaid Heather
Krysan Kostyantyn
Kukuruga Debra
Liu Gang
Lund Steven P.
Mahabal Ashish A.
Parsons Jerod R.
Pass Harvey I.
Pine P. Scott
Rosenzweig Barry A.
Salit Marc L.
Sorbara Lynn
Spira Avrum
Srivastava Sudhir
Stass Sanford A.
Thompson Karol L.
Van Keuren-Jensen Kendall
Vang Lindsay K.
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 06/03/2018
Field of study

Using mixtures of biological samples as process controls for RNA-sequencing experiments

Author: A Kuhn
AL Durham
B Li
C Bock
C Trapnell
D Thierry-Mieg
DL Duewer
EL Van Dijk
External RNA Controls Consortium
G Quon
G Quon
GK Fu
GP Wagner
J Harrow
Jennifer McDaniel
Jerod Parsons
K Yoshihara
KD Hansen
KD Pruitt
KL Thompson
KR Rosenbloom
L Jiang
MAQC Consortium
Marc Salit
Michele Mehaffey
NF Lahens
NK Jue
P Brennecke
P. Scott Pine
PS Pine
R Gaujoux
R Shippy
Sarah Munro
SC Baker
SS Shen-Orr
SS Shen-Orr
T Gong
T Gong
T Hart
T Raz
W Liu
Y Li
Y Yuan
Y Zhao
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref