Search CORE

257 research outputs found

Modeling the evolution space of breakage fusion bridge cycles with a stochastic folding process

Author: B McClintock
BJ Raphael
BJ Raphael
C Alkan
C Lemaitre
C Xie
C. D. Greenman
CD Greenman
CD Greenman
D Sankoff
ED Pleasance
ED Pleasance
G Klambauer
GR Bignell
GR Bignell
J Neggers
J. Marshall
JL Bentley
M Kinsella
M. R. Stratton
P Loo Van
P. J. Campbell
PJ Campbell
S Zakov
S. L. Cooke
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2015
Field of study

Breakage-Fusion-Bridge cycles in cancer arise when a broken segment of DNA is duplicated and an end from each copy joined together. This structure then 'unfolds' into a new piece of palindromic DNA. This is one mechanism responsible for the localised amplicons observed in cancer genome data. The process has parallels with paper folding sequences that arise when a piece of paper is folded several times and then unfolded. Here we adapt such methods to study the breakage-fusion-bridge structures in detail. We firstly consider discrete representations of this space with 2-d trees to demonstrate that there are 2^(n(n-1)/2) qualitatively distinct evolutions involving n breakage-fusion-bridge cycles. Secondly we consider the stochastic nature of the fold positions, to determine evolution likelihoods, and also describe how amplicons become localised. Finally we highlight these methods by inferring the evolution of breakage-fusion-bridge cycles with data from primary tissue cancer samples

Crossref

Springer - Publisher Connector

PubMed Central

Enlighten

University of East Anglia digital repository

That lung cancer incidence falls in ex-smokers: misconceptions 2

Author: B Altshuler
ED Pleasance
J Cairns
J Peto
JM Samet
MT Halpern
P Armitage
R Doll
Publication venue: Nature Publishing Group
Publication date: 01/02/2011
Field of study

Misconceptions and ill-founded theories can arise in all areas of science. However, the apparent accessibility of many epidemiology findings and popular interest in the subject can lead to additional misunderstandings. The article below continues an occasional series of short editorials highlighting some current misinterpretations of epidemiological findings. Invited authors will be given wide scope in judging the prevalence of the misconception under discussion. We hope that this series will prove instructive to cancer researchers in other disciplines as well as to students of epidemiology. Adrian L Harris and Leo Kinle

Crossref

LSHTM Research Online

PubMed Central

Assessing Matched Normal and Tumor Pairs in Next-Generation Sequencing Studies

Author: A McKenna
Benjamin Low
Bin Tean Teh
CC Laurie
ED Pleasance
ED Pleasance
ED Roberson
Geng Bo Chen
GL Dalgliesh
GR Abecasis
H Li
H Li
Ioana Cutcutache
Liang Goh
ML Metzker
Patrick Tan
Raya Khanin
RD Morin
S Purcell
SP Shah
Steve Rozen
W Lee
YJ Li
Z Kan
ZJ Zang
Publication venue: Public Library of Science
Publication date: 01/01/2011
Field of study

Next generation sequencing technology has revolutionized the study of cancers. Through matched normal-tumor pairs, it is now possible to identify genome-wide germline and somatic mutations. The generation and analysis of the data requires rigorous quality checks and filtering, and the current analytical pipeline is constantly undergoing improvements. We noted however that in analyzing matched pairs, there is an implicit assumption that the sequenced data are matched, without any quality check such as those implemented in association studies. There are serious implications in this assumption as identification of germline and rare somatic variants depend on the normal sample being the matched pair. Using a genetics concept on measuring relatedness between individuals, we demonstrate that the matchedness of tumor pairs can be quantified and should be included as part of a quality protocol in analysis of sequenced data. Despite the mutation changes in cancer samples, matched tumor-normal pairs are still relatively similar in sequence compared to non-matched pairs. We demonstrate that the approach can be used to assess the mutation landscape between individuals

Public Library of Science (PLOS)

Crossref

Directory of Open Access Journals

PubMed Central

ScholarBank@NUS

Personalized Pathway Enrichment Map of Putative Cancer Genes from Next Generation Sequencing Data

Author: AF Rubin
C Greenman
D Merico
D Pinto
DW Parsons
ED Pleasance
ED Pleasance
G Getz
H Yamamoto
J Xia
L Ding
M Kanehisa
M Ladanyi
MC Wendl
MD Young
Peilin Jia
S Raychaudhuri
Steve Horvath
T Gojobori
T Sjoblom
VK Ramanan
W Lee
WF Forrest
Y Gu
Z Kan
Z Zhao
Zhongming Zhao
Publication venue: Public Library of Science
Publication date: 18/05/2012
Field of study

BACKGROUND: Pathway analysis of a set of genes represents an important area in large-scale omic data analysis. However, the application of traditional pathway enrichment methods to next-generation sequencing (NGS) data is prone to several potential biases, including genomic/genetic factors (e.g., the particular disease and gene length) and environmental factors (e.g., personal life-style and frequency and dosage of exposure to mutagens). Therefore, novel methods are urgently needed for these new data types, especially for individual-specific genome data. METHODOLOGY: In this study, we proposed a novel method for the pathway analysis of NGS mutation data by explicitly taking into account the gene-wise mutation rate. We estimated the gene-wise mutation rate based on the individual-specific background mutation rate along with the gene length. Taking the mutation rate as a weight for each gene, our weighted resampling strategy builds the null distribution for each pathway while matching the gene length patterns. The empirical P value obtained then provides an adjusted statistical evaluation. PRINCIPAL FINDINGS/CONCLUSIONS: We demonstrated our weighted resampling method to a lung adenocarcinomas dataset and a glioblastoma dataset, and compared it to other widely applied methods. By explicitly adjusting gene-length, the weighted resampling method performs as well as the standard methods for significant pathways with strong evidence. Importantly, our method could effectively reject many marginally significant pathways detected by standard methods, including several long-gene-based, cancer-unrelated pathways. We further demonstrated that by reducing such biases, pathway crosstalk for each individual and pathway co-mutation map across multiple individuals can be objectively explored and evaluated. This method performs pathway analysis in a sample-centered fashion, and provides an alternative way for accurate analysis of cancer-personalized genomes. It can be extended to other types of genomic data (genotyping and methylation) that have similar bias problems

Public Library of Science (PLOS)

Crossref

Directory of Open Access Journals

PubMed Central

FigShare

Ranking insertion, deletion and nonsense mutations based on their effect on genetic information

Author: A Bairoch
Alan M Moses
AM Moses
Amin Zia
AV Dalca
B Konopka
CD Bustamante
CD Bustamante
DC King
DL Hartl
ED Pleasance
ED Pleasance
EL Goode
EL Sonnhammer
ET Dermitzakis
FW Allendorf
G Liti
GP Zambetti
J Warringer
JA Drake
JC Fay
JC Knight
JC Knight
JM Mullaney
JN Hirschhorn
L Ettwiller
L Feuk
M Kellis
M Kimura
MI McCarthy
N Gibbs
P Cliften
PC Ng
PC Ng
PD Keightley
R DeSalle
R Durbin
S Chun
SA Boyadjiev
SA Rasmussen
SA Sawyer
SF Altschul
SM Post
SW Doniger
T Barøy
T Soussi
TD Schnider
The Cystic Fibrosis Genetic Analysis Consortium
V Balz
X Xie
Publication venue: BioMed Central
Publication date: 01/01/2011
Field of study

Abstract Background Genetic variations contribute to normal phenotypic differences as well as diseases, and new sequencing technologies are greatly increasing the capacity to identify these variations. Given the large number of variations now being discovered, computational methods to prioritize the functional importance of genetic variations are of growing interest. Thus far, the focus of computational tools has been mainly on the prediction of the effects of amino acid changing single nucleotide polymorphisms (SNPs) and little attention has been paid to indels or nonsense SNPs that result in premature stop codons. Results We propose computational methods to rank insertion-deletion mutations in the coding as well as non-coding regions and nonsense mutations. We rank these variations by measuring the extent of their effect on biological function, based on the assumption that evolutionary conservation reflects function. Using sequence data from budding yeast and human, we show that variations which that we predict to have larger effects segregate at significantly lower allele frequencies, and occur less frequently than expected by chance, indicating stronger purifying selection. Furthermore, we find that insertions, deletions and premature stop codons associated with disease in the human have significantly larger predicted effects than those not associated with disease. Interestingly, the large-effect mutations associated with disease show a similar distribution of predicted effects to that expected for completely random mutations. Conclusions This demonstrates that the evolutionary conservation context of the sequences that harbour insertions, deletions and nonsense mutations can be used to predict and rank the effects of the mutations.</p

University of Toronto Research Repository

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

Targeted high throughput sequencing in clinical cancer Settings: formaldehyde fixed-paraffin embedded (FFPE) tumor tissues, input amount and tumor heterogeneity

Author: A Gnirke
A Weise
AW Briggs
B Timmermann
Berger MSL F Michael
Bernd Timmermann
BS Taylor
C Greenman
CA Macintosh
D Aird
DG Bostwick
DT Okou
DW Bell
DW Parsons
E Hodges
ED Pleasance
ED Pleasance
G Bartsch
Georg Bartsch
Georg Schaefer
GJ Porreca
Hans Lehrach
Helmut Klocker
HM Wood
Holger Sültmann
Irmgard Verdorfer
J Clark
J Yu
L Ding
LD Wood
M Aihara
M Barry
Martin Kerick
Melanie Isau
Michal R Schweiger
MR Schweiger
MR Schweiger
N Navin
PM Krawitz
R Mehra
R Mehra
R Yatani
Ralf Herwig
RB Shah
S Jones
SB Ng
SP Shah
Sylvia Krobitsch
T Shiraishi
T Sjoblom
TJ Albert
TJ Ley
W Horninger
W Lee
W Liu
WA Sakr
Z Kan
Publication venue: BioMed Central
Publication date: 01/01/2011
Field of study

Abstract Background Massively parallel sequencing technologies have brought an enormous increase in sequencing throughput. However, these technologies need to be further improved with regard to reproducibility and applicability to clinical samples and settings. Methods Using identification of genetic variations in prostate cancer as an example we address three crucial challenges in the field of targeted re-sequencing: Small nucleotide variation (SNV) detection in samples of formalin-fixed paraffin embedded (FFPE) tissue material, minimal amount of input sample and sampling in view of tissue heterogeneity. Results We show that FFPE tissue material can supplement for fresh frozen tissues for the detection of SNVs and that solution-based enrichment experiments can be accomplished with small amounts of DNA with only minimal effects on enrichment uniformity and data variance. Finally, we address the question whether the heterogeneity of a tumor is reflected by different genetic alterations, e.g. different foci of a tumor display different genomic patterns. We show that the tumor heterogeneity plays an important role for the detection of copy number variations. Conclusions The application of high throughput sequencing technologies in cancer genomics opens up a new dimension for the identification of disease mechanisms. In particular the ability to use small amounts of FFPE samples available from surgical tumor resections and histopathological examinations facilitates the collection of precious tissue materials. However, care needs to be taken in regard to the locations of the biopsies, which can have an influence on the prediction of copy number variations. Bearing these technological challenges in mind will significantly improve many large-scale sequencing studies and will - in the long term - result in a more reliable prediction of individual cancer therapies.</p

Crossref

Directory of Open Access Journals

PubMed Central

MPG.PuRe

GISTIC2.0 facilitates sensitive and confident localization of the targets of focal somatic copy-number alteration in human cancers

Author: A Baross
AB Olshen
AJ Bass
AJ Holland
B Nilsson
BA Weir
Barbara Hill
BM Bolstad
BS Taylor
C Greenman
C Li
C Li
Craig H Mermel
D Chiang
D Etemadmoghadam
D Hanahan
DY Chiang
E Pleasance
ED Pleasance
ES Venkatraman
F Sanchez-Garcia
G Schwarz
Gad Getz
GR Bignell
HS Dahlback
LM Merlo
M Guttman
M Metzker
Matthew L Meyerson
MR Stratton
Network CGAR
NT Leach
P Hupé
PA Northcott
PJ Stephens
PJ Stephens
R Beroukhim
R Beroukhim
R Firestein
R McLendon
Rameen Beroukhim
SA McCarroll
SJ Diskin
SP Shah
Steven E Schumacher
T Santarius
T Sjoblom
WM Lin
Y Benjamini
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2011
Field of study

We describe methods with enhanced power and specificity to identify genes targeted by somatic copy-number alterations (SCNAs) that drive cancer growth. By separating SCNA profiles into underlying arm-level and focal alterations, we improve the estimation of background rates for each category. We additionally describe a probabilistic method for defining the boundaries of selected-for SCNA regions with user-defined confidence. Here we detail this revised computational approach, GISTIC2.0, and validate its performance in real and simulated datasets

DSpace@MIT

Crossref

Springer - Publisher Connector

PubMed Central

Somatic Mutation Profiles of MSI and MSS Colorectal Cancer Identified by Whole Exome Next Generation Sequencing and Bioinformatics Analysis

BACKGROUND: Colorectal cancer (CRC) is with approximately 1 million cases the third most common cancer worldwide. Extensive research is ongoing to decipher the underlying genetic patterns with the hope to improve early cancer diagnosis and treatment. In this direction, the recent progress in next generation sequencing technologies has revolutionized the field of cancer genomics. However, one caveat of these studies remains the large amount of genetic variations identified and their interpretation. METHODOLOGY/PRINCIPAL FINDINGS: Here we present the first work on whole exome NGS of primary colon cancers. We performed 454 whole exome pyrosequencing of tumor as well as adjacent not affected normal colonic tissue from microsatellite stable (MSS) and microsatellite instable (MSI) colon cancer patients and identified more than 50,000 small nucleotide variations for each tissue. According to predictions based on MSS and MSI pathomechanisms we identified eight times more somatic non-synonymous variations in MSI cancers than in MSS and we were able to reproduce the result in four additional CRCs. Our bioinformatics filtering approach narrowed down the rate of most significant mutations to 359 for MSI and 45 for MSS CRCs with predicted altered protein functions. In both CRCs, MSI and MSS, we found somatic mutations in the intracellular kinase domain of bone morphogenetic protein receptor 1A, BMPR1A, a gene where so far germline mutations are associated with juvenile polyposis syndrome, and show that the mutations functionally impair the protein function. CONCLUSIONS/SIGNIFICANCE: We conclude that with deep sequencing of tumor exomes one may be able to predict the microsatellite status of CRC and in addition identify potentially clinically relevant mutations

Institutional Repository of the Freie Universität Berlin

Crossref

Directory of Open Access Journals

PubMed Central

MPG.PuRe

Paired Tumor and Normal Whole Genome Sequencing of Metastatic Olfactory Neuroblastoma

Olfactory neuroblastoma (ONB) is a rare cancer of the sinonasal tract with little molecular characterization. We performed whole genome sequencing (WGS) on paired normal and tumor DNA from a patient with metastatic-ONB to identify the somatic alterations that might be drivers of tumorigenesis and/or metastatic progression.Genomic DNA was isolated from fresh frozen tissue from a metastatic lesion and whole blood, followed by WGS at >30X depth, alignment and mapping, and mutation analyses. Sanger sequencing was used to confirm selected mutations. Sixty-two somatic short nucleotide variants (SNVs) and five deletions were identified inside coding regions, each causing a non-synonymous DNA sequence change. We selected seven SNVs and validated them by Sanger sequencing. In the metastatic ONB samples collected several months prior to WGS, all seven mutations were present. However, in the original surgical resection specimen (prior to evidence of metastatic disease), mutations in KDR, MYC, SIN3B, and NLRC4 genes were not present, suggesting that these were acquired with disease progression and/or as a result of post-treatment effects.This work provides insight into the evolution of ONB cancer cells and provides a window into the more complex factors, including tumor clonality and multiple driver mutations

Public Library of Science (PLOS)

Crossref

Directory of Open Access Journals

PubMed Central

FigShare

Unexpected effects of different genetic backgrounds on identification of genomic rearrangements via whole-genome next generation sequencing

Author: A Nussenzweig
Ameet K. Mishra
Bifeng Gao
C Boboila
C Boboila
C Boboila
C Guan
CG Mullighan
CT Yan
DF Conrad
ED Pleasance
ED Pleasance
FW Alt
IA Klein
J Jonkers
J Wang
JD Rowley
JH Wang
JH Wang
JH Wang
JH Wang
Jing H. Wang
Katherine Gowan
Katrina Diener
Kenneth Jones
L Pikor
M Krzywinski
M Meyerson
MJ Difilippantonio
MJ Difilippantonio
MR Lieber
PJ Campbell
PJ Stephens
R Chiarle
R Dalla-Favera
RD Morin
RE Handsaker
RE Mills
S Casola
Sawanee S. Viboolsittiseri
Sonia M. Leach
T Honjo
T Ochi
T Xie
Tanya Kadoishi
TM Keane
Y Wang
Z Chen
Z Chen
Z Chen
Zhangguo Chen
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref