Search CORE

56 research outputs found

SpliceMiner: a high-throughput database implementation of the NCBI Evidence Viewer for microarray splice variant analysis

Author: Jamison D Curtis
Kahn Ari B
Liu Hongfang
Ryan Michael C
Weinstein John N
Zeeberg Barry R
Publication venue: BioMed Central
Publication date: 01/01/2007
Field of study

BACKGROUND: There are many fewer genes in the human genome than there are expressed transcripts. Alternative splicing is the reason. Alternatively spliced transcripts are often specific to tissue type, developmental stage, environmental condition, or disease state. Accurate analysis of microarray expression data and design of new arrays for alternative splicing require assessment of probes at the sequence and exon levels. DESCRIPTION: SpliceMiner is a web interface for querying Evidence Viewer Database (EVDB). EVDB is a comprehensive, non-redundant compendium of splice variant data for human genes. We constructed EVDB as a queryable implementation of the NCBI Evidence Viewer (EV). EVDB is based on data obtained from NCBI Entrez Gene and EV. The automated EVDB build process uses only complete coding sequences, which may or may not include partial or complete 5' and 3' UTRs, and filters redundant splice variants. Unlike EV, which supports only one-at-a-time queries, SpliceMiner supports high-throughput batch queries and provides results in an easily parsable format. SpliceMiner maps probes to splice variants, effectively delineating the variants identified by a probe. CONCLUSION: EVDB can be queried by gene symbol, genomic coordinates, or probe sequence via a user-friendly web-based tool we call SpliceMiner (). The EVDB/SpliceMiner combination provides an interface with human splice variant information and, going beyond the very valuable NCBI Evidence Viewer, supports fluent, high-throughput analysis. Integration of EVDB information into microarray analysis and design pipelines has the potential to improve the analysis and bioinformatic interpretation of gene expression data, for both batch and interactive processing. For example, whenever a gene expression value is recognized as important or appears anomalous in a microarray experiment, the interactive mode of SpliceMiner can be used quickly and easily to check for possible splice variant issues

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

SpliceCenter: A suite of web-based bioinformatic applications for evaluating the impact of alternative splicing on RT-PCR, RNAi, microarray, and peptide-based studies

Author: Caplen Natasha J
Cleland James A
Kahn Ari B
Liu Hongfang
Ryan Michael C
Weinstein John N
Zeeberg Barry R
Publication venue: BioMed Central
Publication date: 01/01/2008
Field of study

Abstract Background Over 60% of protein-coding genes in vertebrates express mRNAs that undergo alternative splicing. The resulting collection of transcript isoforms poses significant challenges for contemporary biological assays. For example, RT-PCR validation of gene expression microarray results may be unsuccessful if the two technologies target different splice variants. Effective use of sequence-based technologies requires knowledge of the specific splice variant(s) that are targeted. In addition, the critical roles of alternative splice forms in biological function and in disease suggest that assay results may be more informative if analyzed in the context of the targeted splice variant. Results A number of contemporary technologies are used for analyzing transcripts or proteins. To enable investigation of the impact of splice variation on the interpretation of data derived from those technologies, we have developed SpliceCenter. SpliceCenter is a suite of user-friendly, web-based applications that includes programs for analysis of RT-PCR primer/probe sets, effectors of RNAi, microarrays, and protein-targeting technologies. Both interactive and high-throughput implementations of the tools are provided. The interactive versions of SpliceCenter tools provide visualizations of a gene's alternative transcripts and probe target positions, enabling the user to identify which splice variants are or are not targeted. The high-throughput batch versions accept user query files and provide results in tabular form. When, for example, we used SpliceCenter's batch siRNA-Check to process the Cancer Genome Anatomy Project's large-scale shRNA library, we found that only 59% of the 50,766 shRNAs in the library target all known splice variants of the target gene, 32% target some but not all, and 9% do not target any currently annotated transcript. Conclusion SpliceCenter <url>http://discover.nci.nih.gov/splicecenter</url> provides unique, user-friendly applications for assessing the impact of transcript variation on the design and interpretation of RT-PCR, RNAi, gene expression microarrays, antibody-based detection, and mass spectrometry proteomics. The tools are intended for use by bench biologists as well as bioinformaticists.</p

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

VennMaster: Area-proportional Euler diagrams for functional GO analysis of microarrays

Author: A Saeed
André Müller
Barry R Zeeberg
BR Zeeberg
BR Zeeberg
D Hosack
David W Kane
G Allwein
HA Kestler
Hans A Kestler
Hongfang Liu
J Gil
J Kennedy
J O'Rourke
J Venn
JC Oliveros
Johann M Kraus
John N Weinstein
KH Buetow
M Buchholz
Malte Buchholz
S Chow
S Chow
SS Skiena
T Bäck
TH Cormen
The Gene Ontology Consortium
Thomas M Gress
Publication venue: BioMed Central
Publication date: 01/01/2008
Field of study

Crossref

Springer - Publisher Connector

PubMed Central

Functional Categories Associated with Clusters of Genes That Are Co-Expressed across the NCI-60 Cancer Cell Lines

Author: A Sturn
Barry R. Zeeberg
BR Zeeberg
BR Zeeberg
C Prieto
G Zoppoli
Gerhard G. Thallinger
H Liu
H Liu
Ilya Ulasov
JK Choi
JN Weinstein
John N. Weinstein
Kurt W. Kohn
LA Garraway
M Ashburner
MC Ryan
René Snajder
RH Shoemaker
RJ Larsen
S Holbeck
SB Cho
TA Eyre
U Scherf
UT Shankavaram
UT Shankavaram
WC Reinhold
William Reinhold
Y Lai
Yves Pommier
Z Wu
Publication venue: Public Library of Science
Publication date: 24/01/2012
Field of study

The NCI-60 is a panel of 60 diverse human cancer cell lines used by the U.S. National Cancer Institute to screen compounds for anticancer activity. In the current study, gene expression levels from five platforms were integrated to yield a single composite transcriptome profile. The comprehensive and reliable nature of that dataset allows us to study gene co-expression across cancer cell lines.Hierarchical clustering revealed numerous clusters of genes in which the genes co-vary across the NCI-60. To determine functional categorization associated with each cluster, we used the Gene Ontology (GO) Consortium database and the GoMiner tool. GO maps genes to hierarchically-organized biological process categories. GoMiner can leverage GO to perform ontological analyses of gene expression studies, generating a list of significant functional categories.GoMiner analysis revealed many clusters of coregulated genes that are associated with functional groupings of GO biological process categories. Notably, those categories arising from coherent co-expression groupings reflect cancer-related themes such as adhesion, cell migration, RNA splicing, immune response and signal transduction. Thus, these clusters demonstrate transcriptional coregulation of functionally-related genes

Public Library of Science (PLOS)

Crossref

Directory of Open Access Journals

PubMed Central

FigShare

Nonlinear gene cluster analysis with labeling for microarray gene expression data in organ development

Author: A Sturn
B Zeeberg
B Zeeberg
Barry R Zeeberg
Brian P Brooks
CA Suàrez-Quian
CL Sigulinsky
DJ Mordantameron
Gene Ontology Consortium
Jacob Brown
JD Brown
JN Weinstein
K Pearson
L Kaufman
M Ashburner
M Belkin
M Belkin
Martin Ehler
P Langfelder
RF Bonner
Robert F Bonner
S Reichman
SP Lloyd
SR Goldstein
T Hastie
T Hestilow
Vinodh N Rajapakse
W Czaja
Wojciech Czaja
Z Yang
Publication venue: BioMed Central
Publication date: 01/01/2011
Field of study

Crossref

Springer - Publisher Connector

PubMed Central

RedundancyMiner: De-replication of redundant GO categories in microarray and proteomics analysis

Author: A Alexa
A Sturn
AJ Richards
Ari B Kahn
Barry R Zeeberg
BR Zeeberg
BR Zeeberg
Brian P Brooks
C Herrmann
Hongfang Liu
J Wang
Jacob D Brown
JN Weinstein
JN Weinstein
John N Weinstein
K Prufer
M Ashburner
Martin Ehler
P Pehkonen
Robert F Bonner
S Bauer
S Grossmann
T Xu
Vinodh N Rajapakse
Vladimir L Larionov
William Reinhold
Y Lu
Yves G Pommier
Publication venue: BioMed Central
Publication date: 01/01/2011
Field of study

Abstract Background The Gene Ontology (GO) Consortium organizes genes into hierarchical categories based on biological process, molecular function and subcellular localization. Tools such as GoMiner can leverage GO to perform ontological analysis of microarray and proteomics studies, typically generating a list of significant functional categories. Two or more of the categories are often redundant, in the sense that identical or nearly-identical sets of genes map to the categories. The redundancy might typically inflate the report of significant categories by a factor of three-fold, create an illusion of an overly long list of significant categories, and obscure the relevant biological interpretation. Results We now introduce a new resource, RedundancyMiner, that de-replicates the redundant and nearly-redundant GO categories that had been determined by first running GoMiner. The main algorithm of RedundancyMiner, MultiClust, performs a novel form of cluster analysis in which a GO category might belong to several category clusters. Each category cluster follows a "complete linkage" paradigm. The metric is a similarity measure that captures the overlap in gene mapping between pairs of categories. Conclusions RedundancyMiner effectively eliminated redundancies from a set of GO categories. For illustration, we have applied it to the clarification of the results arising from two current studies: (1) assessment of the gene expression profiles obtained by laser capture microdissection (LCM) of serial cryosections of the retina at the site of final optic fissure closure in the mouse embryos at specific embryonic stages, and (2) analysis of a conceptual data set obtained by examining a list of genes deemed to be "kinetochore" genes.</p

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

GoMiner: a resource for biological interpretation of genomic and proteomic data

Author: Barrett J Carl
Bussey Kimberly J
Feng Weimin
Fojo Anthony T
Kane David W
Lababidi Samir
Narasimhan Sudarshan
Reinhold William C
Riss Joseph
Sunshine Margot
Wang Geoffrey
Wang May D
Weinstein John N
Zeeberg Barry R
Publication venue: BioMed Central
Publication date: 01/01/2003
Field of study

We have developed GoMiner, a program package that organizes lists of 'interesting' genes (for example, under- and overexpressed genes from a microarray experiment) for biological interpretation in the context of the Gene Ontology. GoMiner provides quantitative and statistical output files and two useful visualizations. The first is a tree-like structure analogous to that in the AmiGO browser and the second is a compact, dynamically interactive 'directed acyclic graph'. Genes displayed in GoMiner are linked to major public bioinformatics resources

Springer - Publisher Connector

PubMed Central

High-Throughput GoMiner, an 'industrial-strength' integrative gene ontology tool for interpretation of multiple-microarray experiments, with application to studies of Common Variable Immune Deficiency (CVID)

Author: Bryant David
Burt Stanley K
Cao Hong
Cunningham-Rundles Charlotte
Elnekave Eldad
Hari Danielle M
Kane David W
Narasimhan Sudarshan
Nelson David
Qin Haiying
Reimers Mark
Stephens Robert M
Stewart Donn M
Sunshine Margot
Weinstein John N
Wynn Thomas A
Zeeberg Barry R
Publication venue: BioMed Central
Publication date: 01/01/2005
Field of study

BACKGROUND: We previously developed GoMiner, an application that organizes lists of 'interesting' genes (for example, under-and overexpressed genes from a microarray experiment) for biological interpretation in the context of the Gene Ontology. The original version of GoMiner was oriented toward visualization and interpretation of the results from a single microarray (or other high-throughput experimental platform), using a graphical user interface. Although that version can be used to examine the results from a number of microarrays one at a time, that is a rather tedious task, and original GoMiner includes no apparatus for obtaining a global picture of results from an experiment that consists of multiple microarrays. We wanted to provide a computational resource that automates the analysis of multiple microarrays and then integrates the results across all of them in useful exportable output files and visualizations. RESULTS: We now introduce a new tool, High-Throughput GoMiner, that has those capabilities and a number of others: It (i) efficiently performs the computationally-intensive task of automated batch processing of an arbitrary number of microarrays, (ii) produces a human-or computer-readable report that rank-orders the multiple microarray results according to the number of significant GO categories, (iii) integrates the multiple microarray results by providing organized, global clustered image map visualizations of the relationships of significant GO categories, (iv) provides a fast form of 'false discovery rate' multiple comparisons calculation, and (v) provides annotations and visualizations for relating transcription factor binding sites to genes and GO categories. CONCLUSION: High-Throughput GoMiner achieves the desired goal of providing a computational resource that automates the analysis of multiple microarrays and integrates results across all of the microarrays. For illustration, we show an application of this new tool to the interpretation of altered gene expression patterns in Common Variable Immune Deficiency (CVID). High-Throughput GoMiner will be useful in a wide range of applications, including the study of time-courses, evaluation of multiple drug treatments, comparison of multiple gene knock-outs or knock-downs, and screening of large numbers of chemical derivatives generated from a promising lead compound

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

Gene expression AffyProbeMiner: a web resource for computing or retrieving accurately redefined Affymetrix probe sets

Author: A Gunes Koru
Alessandro Ferrucci
Antej Nuhanovic
Ari Kahn
Barry R Zeeberg
David W Kane
Gang Qu
Hongfang Liu
John N Weinstein
Michael C Ryan
Peter J Munson
William C Reinhold
Publication venue
Publication date: 01/01/2007
Field of study

CiteSeerX

Heading Down the Wrong Pathway: on the Influence of Correlation within Gene Sets

Author: A Subramanian
AL Boulesteix
Andrew B Nobel
B Efron
BR Zeeberg
D Montaner
D Sean
Daniel M Gatti
DB Allison
dW Huang
Fred A Wright
H Ogata
HK Lee
I Dinu
I Dinu
Ivan Rusyn
J Shi
JJ Goeman
JJ Goeman
JM Fostel
JT Leek
K Virtaneva
L Klebanov
L Tian
M Ackermann
M Ashburner
M Hummel
MB Eisen
P Kaposi-Novak
R Development Core Team
RC Fry
RC Gentleman
S Song
SW Kong
SY Kim
T Barrett
T Breslin
T Sorlie
VK Mootha
William T Barry
WT Barry
WT Barry
X Lu
X Qiu
Y Zhu
Publication venue: BioMed Central
Publication date: 01/01/2010
Field of study

Abstract Background Analysis of microarray experiments often involves testing for the overrepresentation of pre-defined sets of genes among lists of genes deemed individually significant. Most popular gene set testing methods assume the independence of genes within each set, an assumption that is seriously violated, as extensive correlation between genes is a well-documented phenomenon. Results We conducted a meta-analysis of over 200 datasets from the Gene Expression Omnibus in order to demonstrate the practical impact of strong gene correlation patterns that are highly consistent across experiments. We show that a common independence assumption-based gene set testing procedure produces very high false positive rates when applied to data sets for which treatment groups have been randomized, and that gene sets with high internal correlation are more likely to be declared significant. A reanalysis of the same datasets using an array resampling approach properly controls false positive rates, leading to more parsimonious and high-confidence gene set findings, which should facilitate pathway-based interpretation of the microarray data. Conclusions These findings call into question many of the gene set testing results in the literature and argue strongly for the adoption of resampling based gene set testing criteria in the peer reviewed biomedical literature.</p

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

DukeSpace

Carolina Digital Repository