Search CORE

Springer - Publisher Connector

Targeted sequencing library preparation by genomic DNA circularization

Author: Bell John M
Ji Hanlee P
Myllykangas Samuel
Natsoulis Georges
Publication venue: BioMed Central
Publication date: 01/01/2011
Field of study

Identification of Insertion Deletion Mutations from Deep Targeted Resequencing

Author: Bell John
Ji Hanlee P
Natsoulis Georges
Welch Katrina
Zhang Nancy R
Publication venue: ScholarlyCommons
Publication date: 01/01/2013
Field of study

Taking advantage of the deep targeted sequencing capabilities of next generation sequencers, we have developed a novel two step insertion deletion (indel) detection algorithm (IDA) that can determine indels from single read sequences with high computational efficiency and sensitivity when indels are fractionally less compared to wild type reference sequence. First, it identifies candidate indel positions utilizing specific sequence alignment artifacts produced by rapid alignment programs. Second, it confirms the location of the candidate indel by using the Smith-Waterman (SW) algorithm on a restricted subset of Sequence reads. We demonstrate that IDA is applicable to indels of varying sizes from deep targeted sequencing data at low fractions where the indel is diluted by wild type sequence. Our algorithm is useful in detecting indel variants present at variable allelic frequencies such as may occur in heterozygotes and mixed normal-tumor tissue

A cross-sample statistical model for SNP detection in short-read sequencing data

Author: Bansal
Bentley
Dahl
Daniel Newburger
DePristo
Efron
Georges Natsoulis
Hanlee Ji
Hoberman
Holt
Hua Xu
Itai Kela
John Bell
Li
Li
Li
McKenna
Nancy Zhang
Natsoulis
Omkar Muralidharan
Ossowski
Shendure
Van Tassell
Publication venue: Oxford University Press
Publication date
Field of study

Highly multiplex DNA sequencers have greatly expanded our ability to survey human genomes for previously unknown single nucleotide polymorphisms (SNPs). However, sequencing and mapping errors, though rare, contribute substantially to the number of false discoveries in current SNP callers. We demonstrate that we can significantly reduce the number of false positive SNP calls by pooling information across samples. Although many studies prepare and sequence multiple samples with the same protocol, most existing SNP callers ignore cross-sample information. In contrast, we propose an empirical Bayes method that uses cross-sample information to learn the error properties of the data. This error information lets us call SNPs with a lower false discovery rate than existing methods

Ultrasensitive detection of rare mutations using next-generation targeted resequencing

Author: Baldi
Bansal
Bansal
Dohm
Druley
Georges Natsoulis
Hanlee P. Ji
Hedskog
Jason Buenrostro
John Bell
Koboldt
Kuroda
Li
Mark Holodniy
Mark Winters
McKenna
Nancy Zhang
Natsoulis
Nejentsev
Nollau
Omkar Muralidharan
Patrick Flaherty
Sheldon Brown
Shendure
Shi
Simi
Thomas
Tsibris
Wang
Xiao
Publication venue: Oxford University Press
Publication date: 19/10/2011
Field of study

With next-generation DNA sequencing technologies, one can interrogate a specific genomic region of interest at very high depth of coverage and identify less prevalent, rare mutations in heterogeneous clinical samples. However, the mutation detection levels are limited by the error rate of the sequencing technology as well as by the availability of variant-calling algorithms with high statistical power and low false positive rates. We demonstrate that we can robustly detect mutations at 0.1% fractional representation. This represents accurate detection of one mutant per every 1000 wild-type alleles. To achieve this sensitive level of mutation detection, we integrate a high accuracy indexing strategy and reference replication for estimating sequencing error variance. We employ a statistical model to estimate the error rate at each position of the reference and to quantify the fraction of variant base in the sample. Our method is highly specific (99%) and sensitive (100%) when applied to a known 0.1% sample fraction admixture of two synthetic DNA samples to validate our method. As a clinical application of this method, we analyzed nine clinical samples of H1N1 influenza A and detected an oseltamivir (antiviral therapy) resistance mutation in the H1N1 neuraminidase gene at a sample fraction of 0.18%

CiteSeerX

ScholarWorks@UMass Amherst

The liver pharmacological and xenobiotic gene response repertoire

Author: Alan H Roter
Barrett P Eynon
Cecelia I Pearson
Georges Natsoulis
Hastie T
Ioannides C
Jeremy Gollub
Joe Ferng
Kurt Jarnagin
Mark R Fielden
May D Lee
Radha Idury
Ramesh Nair
Richard J Brennan
Publication venue: Nature Publishing Group
Publication date
Field of study

We have used a supervised classification approach to systematically mine a large microarray database derived from livers of compound-treated rats. Thirty-four distinct signatures (classifiers) for pharmacological and toxicological end points can be identified. Just 200 genes are sufficient to classify these end points. Signatures were enriched in xenobiotic and immune response genes and contain un-annotated genes, indicating that not all key genes in the liver xenobiotic responses have been characterized. Many signatures with equal classification capabilities but with no gene in common can be derived for the same phenotypic end point. The analysis of the union of all genes present in these signatures can reveal the underlying biology of that end point as illustrated here using liver fibrosis signatures. Our approach using the whole genome and a diverse set of compounds allows a comprehensive view of most pharmacological and toxicological questions and is applicable to other situations such as disease and development

Springer - Publisher Connector

Metastatic Tumor Evolution and Organoid Modeling Implicate TGFBR2 as a Cancer Driver in Diffuse Gastric Cancer

Author: Bell John M
Chen Hao
Flaherty Patrick
Ford James M
Garcia Sarah
Hopmans Erik S
Ji Hanlee P
Kuo Calvin J
Miotke Laura
Nadauld Lincoln
Natsoulis Georges
Ootani Akifumi
Pai Reetesh K
Palm Curt
Regan John F
Xu Hua
Zhang Nancy R
Publication venue: ScholarlyCommons
Publication date: 01/01/2014
Field of study

Background: Gastric cancer is the second-leading cause of global cancer deaths, with metastatic disease representing the primary cause of mortality. To identify candidate drivers involved in oncogenesis and tumor evolution, we conduct an extensive genome sequencing analysis of metastatic progression in a diffuse gastric cancer. This involves a comparison between a primary tumor from a hereditary diffuse gastric cancer syndrome proband and its recurrence as an ovarian metastasis. Results: Both the primary tumor and ovarian metastasis have common biallelic loss-of-function of both the CDH1 and TP53 tumor suppressors, indicating a common genetic origin. While the primary tumor exhibits amplification of the Fibroblast growth factor receptor 2 (FGFR2) gene, the metastasis notably lacks FGFR2 amplification but rather possesses unique biallelic alterations of Transforming growth factor-beta receptor 2 (TGFBR2), indicating the divergent in vivo evolution of a TGFBR2-mutant metastatic clonal population in this patient. As TGFBR2 mutations have not previously been functionally validated in gastric cancer, we modeled the metastatic potential of TGFBR2 loss in a murine three-dimensional primary gastric organoid culture. The Tgfbr2 shRNA knockdown within Cdh1-/-; Tp53-/- organoids generates invasion in vitro and robust metastatic tumorigenicity in vivo, confirming Tgfbr2 metastasis suppressor activity. Conclusions: We document the metastatic differentiation and genetic heterogeneity of diffuse gastric cancer and reveal the potential metastatic role of TGFBR2 loss-of-function. In support of this study, we apply a murine primary organoid culture method capable of recapitulating in vivo metastatic gastric cancer. Overall, we describe an integrated approach to identify and functionally validate putative cancer drivers involved in metastasi

D-Scholarship@Pitt

Public Library of Science (PLOS)

A Flexible Approach for Highly Multiplexed Candidate Gene Targeted Resequencing

Author: A Gnirke
A Kimura
A McKenna
C Yeang
D Bentley
Daniel Newburger
DR Bentley
F Dahl
F Di Fiore
Georges Natsoulis
H Jiang
H Li
Hanlee P. Ji
Heather Ordonez
Hua Xu
J Jurka
J Jurka
J Shendure
J Stenberg
Jacob M. Zahn
Jason D. Buenrostro
JM Clark
John M. Bell
KD Pruitt
L Mamanova
M Brink
M Margulies
MA Quail
Michael Jensen
MN Bainbridge
Nancy Zhang
R Adams
R Li
R Tewhey
RC Edgar
RC Edgar
RG Amado
Robert C. Fleischer
S Jones
S Krishnakumar
Susan Grimes
T Sjoblom
TD Harris
Y Liu
Publication venue: Public Library of Science
Publication date: 01/01/2011
Field of study

We have developed an integrated strategy for targeted resequencing and analysis of gene subsets from the human exome for variants. Our capture technology is geared towards resequencing gene subsets substantially larger than can be done efficiently with simplex or multiplex PCR but smaller in scale than exome sequencing. We describe all the steps from the initial capture assay to single nucleotide variant (SNV) discovery. The capture methodology uses in-solution 80-mer oligonucleotides. To provide optimal flexibility in choosing human gene targets, we designed an in silico set of oligonucleotides, the Human OligoExome, that covers the gene exons annotated by the Consensus Coding Sequencing Project (CCDS). This resource is openly available as an Internet accessible database where one can download capture oligonucleotides sequences for any CCDS gene and design custom capture assays. Using this resource, we demonstrated the flexibility of this assay by custom designing capture assays ranging from 10 to over 100 gene targets with total capture sizes from over 100 Kilobases to nearly one Megabase. We established a method to reduce capture variability and incorporated indexing schemes to increase sample throughput. Our approach has multiple applications that include but are not limited to population targeted resequencing studies of specific gene subsets, validation of variants discovered in whole genome sequencing surveys and possible diagnostic analysis of disease gene subsets. We also present a cost analysis demonstrating its cost-effectiveness for large population studies

CiteSeerX

Directory of Open Access Journals