Search CORE

207 research outputs found

KC-SMARTR: An R package for detection of statistically significant aberrations in multi-experiment aCGH data

Author: Arno Velds
C Klijn
Christiaan Klijn
D Hanahan
ES Venkatraman
H Fiegler
H Holstege
Henne Holstege
Jorma J de Ronde
Jos Jonkers
K Chin
Lodewyk FA Wessels
Marcel JT Reinders
VG Tusher
Publication venue: BioMed Central
Publication date: 01/01/2010
Field of study

Background: Most approaches used to find recurrent or differential DNA Copy Number Alterations (CNA) in array Comparative Genomic Hybridization (aCGH) data from groups of tumour samples depend on the discretization of the aCGH data to gain, loss or no-change states. This causes loss of valuable biological information in tumour samples, which are frequently heterogeneous. We have previously developed an algorithm, KC-SMART, that bases its estimate of the magnitude of the CNA at a given genomic location on kernel convolution (Klijn et al., 2008). This accounts for the intensity of the probe signal, its local genomic environment and the signal distribution across multiple samples. Results: Here we extend the approach to allow comparative analyses of two groups of samples and introduce the R implementation of these two approaches. The comparative module allows for a supervised analysis to be performed, to enable the identification of regions that are differentially aberrated between two user-defined classes. We analyzed data from a series of B- and T-cell lymphomas and were able to retrieve all positive control regions (VDJ regions) in addition to a number of new regions. A t-test employing segmented data, that we implemented, was also able to locate all the positive control regions and a number of new regions but these regions were highly fragmented. Conclusions: KC-SMARTR offers recurrent CNA and class specific CNA detection, at different genomic scales, in a single package without the need for additional segmentation. It is memory efficient and runs on a wide range of machines. Most importantly, it does not rely on data discretization and therefore maximally exploits the biological information in the aCGH data.MediamaticsElectrical Engineering, Mathematics and Computer Scienc

Crossref

TU Delft Repository

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

A statistical framework to evaluate virtual screening

Author: AC Davison
AN Jain
AN Jain
AN Jain
B Efron
BK Shoichet
C Bissantz
C Konstantinou-Kirtay
CE Metz
CN Cavasotto
D Bamber
EO Cannon
ER DeLong
ES Venkatraman
ES Venkatraman
G Casella
G Klebe
GL Warren
H Chen
J Kirchmair
J Kirchmair
JA Hanley
JA Hanley
James M Boyett
JC Cole
JF Truchon
JL Simon
K Hevener
Kirk E Hevener
MH Zweig
N Triballeau
PF Griner
R Brenk
RD Clark
Richard E Lee
RP Sheridan
Stephen W White
TA Pham
TN Doman
Wei Zhao
XH Zhou
Publication venue: BioMed Central
Publication date: 01/01/2009
Field of study

Abstract Background Receiver operating characteristic (ROC) curve is widely used to evaluate virtual screening (VS) studies. However, the method fails to address the "early recognition" problem specific to VS. Although many other metrics, such as RIE, BEDROC, and pROC that emphasize "early recognition" have been proposed, there are no rigorous statistical guidelines for determining the thresholds and performing significance tests. Also no comparisons have been made between these metrics under a statistical framework to better understand their performances. Results We have proposed a statistical framework to evaluate VS studies by which the threshold to determine whether a ranking method is better than random ranking can be derived by bootstrap simulations and 2 ranking methods can be compared by permutation test. We found that different metrics emphasize "early recognition" differently. BEDROC and RIE are 2 statistically equivalent metrics. Our newly proposed metric SLR is superior to pROC. Through extensive simulations, we observed a "seesaw effect" – overemphasizing early recognition reduces the statistical power of a metric to detect true early recognitions. Conclusion The statistical framework developed and tested by us is applicable to any other metric as well, even if their exact distribution is unknown. Under this framework, a threshold can be easily selected according to a pre-specified type I error rate and statistical comparisons between 2 ranking methods becomes possible. The theoretical null distribution of SLR metric is available so that the threshold of SLR can be exactly determined without resorting to bootstrap simulations, which makes it easy to use in practical virtual screening studies.</p

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

On the Adaptive Partition Approach to the Detection of Multiple Change-Points

Author: AB Olshen
B Efron
D Siegmund
ES Venkatraman
F Picard
F Picard
J Liu
JV Braun
JW Smith
Mike B. Gravenor
MJ Schell
MJ Silvapulle
P Good
P Groeneboom
R Autio
R Autio
S Dudoit
T Hastie
T Robertson
TH Cormen
TS Price
WB Wu
WS Cleveland
Yinglei Lai
Publication venue: Public Library of Science
Publication date: 01/01/2011
Field of study

With an adaptive partition procedure, we can partition a “time course” into consecutive non-overlapped intervals such that the population means/proportions of the observations in two adjacent intervals are significantly different at a given level . However, the widely used recursive combination or partition procedures do not guarantee a global optimization. We propose a modified dynamic programming algorithm to achieve a global optimization. Our method can provide consistent estimation results. In a comprehensive simulation study, our method shows an improved performance when it is compared to the recursive combination/partition procedures. In practice, can be determined based on a cross-validation procedure. As an application, we consider the well-known Pima Indian Diabetes data. We explore the relationship among the diabetes risk and several important variables including the plasma glucose concentration, body mass index and age

CiteSeerX

Public Library of Science (PLOS)

Crossref

Directory of Open Access Journals

PubMed Central

ReadDepth: A Parallel R Package for Detecting Copy Number Alterations from Short Sequencing Reads

Author: AJ Iafrate
Aleksandar Milosavljevic
AM Snijders
C Alkan
C Coarfa
C Xie
Christopher A. Miller
Cristian Coarfa
D Pinkel
DR Bentley
DY Chiang
ES Venkatraman
F Mitelman
H Li
J Castle
J Wang
JL Freeman
K Inoue
L Shayesteh
M Frommer
MD Robinson
Oliver Hampton
P Cohen
R Lister
S Ahn
S Yoon
SA McCarroll
Stein Aerts
Y Ji
Publication venue: Public Library of Science
Publication date: 01/01/2011
Field of study

Copy number alterations are important contributors to many genetic diseases, including cancer. We present the readDepth package for R, which can detect these aberrations by measuring the depth of coverage obtained by massively parallel sequencing of the genome. In addition to achieving higher accuracy than existing packages, our tool runs much faster by utilizing multi-core architectures to parallelize the processing of these large data sets. In contrast to other published methods, readDepth does not require the sequencing of a reference sample, and uses a robust statistical model that accounts for overdispersed data. It includes a method for effectively increasing the resolution obtained from low-coverage experiments by utilizing breakpoint information from paired end sequencing to do positional refinement. We also demonstrate a method for inferring copy number using reads generated by whole-genome bisulfite sequencing, thus enabling integrative study of epigenomic and copy number alterations. Finally, we apply this tool to two genomes, showing that it performs well on genomes sequenced to both low and high coverage. The readDepth package runs on Linux and MacOSX, is released under the Apache 2.0 license, and is available at http://code.google.com/p/readdepth/

Public Library of Science (PLOS)

Crossref

Directory of Open Access Journals

PubMed Central

A novel SNP analysis method to detect copy number alterations with an unbiased reference signal directly from tumor samples

Author: Alex Lisovich
Ashley R Brown
C Vonesch
C Vonesch
C Yau
E Ben-Yaacov
ES Venkatraman
G Yamamoto
H Bengtsson
H Bengtsson
Ian F Pollack
K Wang
M Lyons-Weiler
Maureen A Lyons-Weiler
P Freire
R Pique-Regi
Regina I Jakacki
Robert W Sobol
SJ Diskin
Uma R Chandran
William A LaFramboise
X Gai
X Zhao
Y Nannya
Publication venue: BioMed Central
Publication date: 01/01/2011
Field of study

Abstract Background Genomic instability in cancer leads to abnormal genome copy number alterations (CNA) as a mechanism underlying tumorigenesis. Using microarrays and other technologies, tumor CNA are detected by comparing tumor sample CN to normal reference sample CN. While advances in microarray technology have improved detection of copy number alterations, the increase in the number of measured signals, noise from array probes, variations in signal-to-noise ratio across batches and disparity across laboratories leads to significant limitations for the accurate identification of CNA regions when comparing tumor and normal samples. Methods To address these limitations, we designed a novel "Virtual Normal" algorithm (VN), which allowed for construction of an unbiased reference signal directly from test samples within an experiment using any publicly available normal reference set as a baseline thus eliminating the need for an in-lab normal reference set. Results The algorithm was tested using an optimal, paired tumor/normal data set as well as previously uncharacterized pediatric malignant gliomas for which a normal reference set was not available. Using Affymetrix 250K Sty microarrays, we demonstrated improved signal-to-noise ratio and detected significant copy number alterations using the VN algorithm that were validated by independent PCR analysis of the target CNA regions. Conclusions We developed and validated an algorithm to provide a virtual normal reference signal directly from tumor samples and minimize noise in the derivation of the raw CN signal. The algorithm reduces the variability of assays performed across different reagent and array batches, methods of sample preservation, multiple personnel, and among different laboratories. This approach may be valuable when matched normal samples are unavailable or the paired normal specimens have been subjected to variations in methods of preservation.</p

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

D-Scholarship@Pitt

One statistical test is sufficient for assessing new predictive markers

Author: A Gallina
AJ Vickers
AJ Vickers
AJ Vickers
Andrew J Vickers
Angel M Cronin
AR Folsom
AS Adabag
CE Metz
Colin B Begg
DG Mitchell
E Steyerberg
ER DeLong
ES Venkatraman
EW Steyerberg
FE Harrell Jr
FH Jansen
HS Wieand
JA Hanley
JA Swets
MA Hlatky
MJ Pencina
MS Pepe
MW Kattan
NR Cook
NR Cook
P Pierorazio
PA McCullough
S Kaptoge
SG Baker
Publication venue: BioMed Central
Publication date: 01/01/2011
Field of study

Crossref

Springer - Publisher Connector

PubMed Central

SeqGene: a comprehensive software solution for mining exome- and transcriptome- sequencing data

Author: A Mortazavi
AL Dixon
B Langmead
BE Stranger
BT Wilhelm
C Trapnell
DB Johnson
DC Koboldt
ER Mardis
ES Venkatraman
GA Heap
GK Smyth
H Li
H Li
J Wang
JC Marioni
JI Kim
JK Pickrell
JT Robinson
KA Frazer
L Wang
M Kanehisa
MF Moffatt
N Cloonan
PA Fujita
Q Zhao
R Goya
R Li
R Lister
RM Durbin
S Sherry
SB Montgomery
SB Ng
SD Nimer
TJP Hubbard
V Ramensky
W Cookson
X Yi
XJ Yan
Xutao Deng
Publication venue: BioMed Central
Publication date: 01/01/2011
Field of study

Abstract Background The popularity of massively parallel exome and transcriptome sequencing projects demands new data mining tools with a comprehensive set of features to support a wide range of analysis tasks. Results SeqGene, a new data mining tool, supports mutation detection and annotation, dbSNP and 1000 Genome data integration, RNA-Seq expression quantification, mutation and coverage visualization, allele specific expression (ASE), differentially expressed genes (DEGs) identification, copy number variation (CNV) analysis, and gene expression quantitative trait loci (eQTLs) detection. We also developed novel methods for testing the association between SNP and expression and identifying genotype-controlled DEGs. We showed that the results generated from SeqGene compares favourably to other existing methods in our case studies. Conclusion SeqGene is designed as a general-purpose software package. It supports both paired-end reads and single reads generated on most sequencing platforms; it runs on all major types of computers; it supports arbitrary genome assemblies for arbitrary organisms; and it scales well to support both large and small scale sequencing projects. The software homepage is http://seqgene.sourceforge.net.</p

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

pROC: an open-source package for R and S+ to analyze and compare ROC curves

Author: A Moise
AI Bandos
AI Bandos
Alexandre Hainard
B Hanczar
C Stephan
CE Metz
DK McClish
DL Streiner
ER DeLong
ES Venkatraman
ES Venkatraman
Frédérique Lisacek
G Campbell
J Carpenter
JA Hanley
JA Hanley
JA Swets
Jean-Charles Sanchez
KH Zou
M Pepe
Markus Müller
MS Pepe
N Turck
Natacha Turck
Natalia Tiberti
P Sonego
R Development Core Team
T Fawcett
T Sing
TM Braun
WJ Ewens
WN Venables
X Robin
Xavier Robin
Y Jiang
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

Integrated genomics of ovarian xenograft tumor progression and chemotherapy response

Author: A Jemal
A Subramanian
AA Secord
AK Sood
Alexander S Brodsky
AN Gordon
Andrew Fischer
Anna Ritz
Ashley Stuckey
Benjamin J Raphael
Daniel H Miller
ES Venkatraman
F Kikkawa
GD Girnun
GR Bignell
HZ Chen
J Andrews
JJ Manfredi
JR Pollack
K Garson
K Robison
KJ Bussey
KK Deeb
KL Gorringe
Kyu K Kim
L Brard
L Brard
L Chin
Laurent Brard
M Payton
M Shackleton
M Wiench
MW Bronson
P Mishra
R Beroukhim
Rakesh K Singh
S Majumder
Sara Hillenmeyer
TG Oliver
TS Lange
TS Lange
Publication venue: BioMed Central
Publication date: 01/01/2011
Field of study

Abstract Background Ovarian cancer is the most deadly gynecological cancer with a very poor prognosis. Xenograft mouse models have proven to be one very useful tool in testing candidate therapeutic agents and gene function <it>in vivo</it>. In this study we identify genes and gene networks important for the efficacy of a pre-clinical anti-tumor therapeutic, MT19c. Methods In order to understand how ovarian xenograft tumors may be growing and responding to anti-tumor therapeutics, we used genome-wide mRNA expression and DNA copy number measurements to identify key genes and pathways that may be critical for SKOV-3 xenograft tumor progression. We compared SKOV-3 xenografts treated with the ergocalciferol derived, MT19c, to untreated tumors collected at multiple time points. Cell viability assays were used to test the function of the PPARγ agonist, Rosiglitazone, on SKOV-3 cell growth. Results These data indicate that a number of known survival and growth pathways including Notch signaling and general apoptosis factors are differentially expressed in treated vs. untreated xenografts. As tumors grow, cell cycle and DNA replication genes show increased expression, consistent with faster growth. The steroid nuclear receptor, PPARγ, was significantly up-regulated in MT19c treated xenografts. Surprisingly, stimulation of PPARγ with Rosiglitazone reduced the efficacy of MT19c and cisplatin suggesting that PPARγ is regulating a survival pathway in SKOV-3 cells. To identify which genes may be important for tumor growth and treatment response, we observed that MT19c down-regulates some high copy number genes and stimulates expression of some low copy number genes suggesting that these genes are particularly important for SKOV-3 xenograft growth and survival. Conclusions We have characterized the time dependent responses of ovarian xenograft tumors to the vitamin D analog, MT19c. Our results suggest that PPARγ promotes survival for some ovarian tumor cells. We propose that a combination of regulated expression and copy number can identify genes that are likely important for chemotherapy response. Our findings suggest a new approach to identify candidate genes that are critical for anti-tumor therapy.</p

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

Exome profiling of primary, metastatic and recurrent ovarian carcinomas in a BRCA1-positive patient

Author: A Mortazavi
Alex Ferenczy
B Ge
C Loveday
CJ Lord
DJ Osher
Emilie Lalonde
ES Venkatraman
H Johannsdottir
H Li
H Li
IA Adzhubei
J Downward
Jacek Majewski
JF Sathirapongsasuti
Jian Zhang
JMJ Piek
K Wang
KL Bolton
LA Brown
Lili Li
Luca Cavallone
M Castellarin
MS Lee
N Sangha
PA Jones
S Zhang
T Walsh
The Cancer Genome Atlas Research Network
Walter H Gotlieb
William D Foulkes
Yuhao Shi
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref