Search CORE

139 research outputs found

Efficiency and Power as a Function of Sequence Coverage, SNP Array Density, and Imputation

Author: Citation Flannick
David Altshuler
David Altshuler
Eric Banks
Eric Banks
George B. Grant
George B. Grant
Jason Flannick
Joshua M. Korn
Joshua M. Korn
Mark A. Depristo
Mark A. Depristo
Pierre Fontanillas
Pierre Fontanillas
Publication venue: 'Public Library of Science (PLoS)'
Publication date: 01/01/2012
Field of study

High coverage whole genome sequencing provides near complete information about genetic variation. However, other technologies can be more efficient in some settings by (a) reducing redundant coverage within samples and (b) exploiting patterns of genetic variation across samples. To characterize as many samples as possible, many genetic studies therefore employ lower coverage sequencing or SNP array genotyping coupled to statistical imputation. To compare these approaches individually and in conjunction, we developed a statistical framework to estimate genotypes jointly from sequence reads, array intensities, and imputation. In European samples, we find similar sensitivity (89%) and specificity (99.6%) from imputation with either 1× sequencing or 1 M SNP arrays. Sensitivity is increased, particularly for low-frequency polymorphisms (MAF <5%), when low coverage sequence reads are added to dense genome-wide SNP arrays — the converse, however, is not true. At sites where sequence reads and array intensities produce different sample genotypes, joint analysis reduces genotype errors and identifies novel error modes. Our joint framework informs the use of next-generation sequencing in genome wide association studies and supports development of improved methods for genotype calling

CiteSeerX

Public Library of Science (PLOS)

DSpace@MIT

Crossref

Harvard University - DASH

Directory of Open Access Journals

PubMed Central

FigShare

Recommended from our members

Genetic and Computational Identification of a Conserved Bacterial Metabolic Module

Author: Batzoglou Serafim
Boutte Cara C.
Crosson Sean
Flannick Jason A.
Martens Andrew T.
Novak Antal F.
Srinivasan Balaji S.
Viollier Patrick H.
Publication venue
Publication date: 03/01/2024
Field of study

We have experimentally and computationally defined a set of genes that form a conserved metabolic module in the α-proteobacterium Caulobacter crescentus and used this module to illustrate a schema for the propagation of pathway-level annotation across bacterial genera. Applying comprehensive forward and reverse genetic methods and genome-wide transcriptional analysis, we (1) confirmed the presence of genes involved in catabolism of the abundant environmental sugar myo-inositol, (2) defined an operon encoding an ABC-family myo-inositol transmembrane transporter, and (3) identified a novel myo-inositol regulator protein and cis-acting regulatory motif that control expression of genes in this metabolic module. Despite being encoded from non-contiguous loci on the C. crescentus chromosome, these myo-inositol catabolic enzymes and transporter proteins form a tightly linked functional group in a computationally inferred network of protein associations. Primary sequence comparison was not sufficient to confidently extend annotation of all components of this novel metabolic module to related bacterial genera. Consequently, we implemented the Graemlin multiple-network alignment algorithm to generate cross-species predictions of genes involved in myo-inositol transport and catabolism in other α-proteobacteria. Although the chromosomal organization of genes in this functional module varied between species, the upstream regions of genes in this aligned network were enriched for the same palindromic cis-regulatory motif identified experimentally in C. crescentus. Transposon disruption of the operon encoding the computationally predicted ABC myo-inositol transporter of Sinorhizobium meliloti abolished growth on myo-inositol as the sole carbon source, confirming our cross-genera functional prediction. Thus, we have defined regulatory, transport, and catabolic genes and a cis-acting regulatory sequence that form a conserved module required for myo-inositol metabolism in select α-proteobacteria. Moreover, this study describes a forward validation of gene-network alignment, and illustrates a strategy for reliably transferring pathway-level annotation across bacterial species.</p

Knowledge UChicago

Genetic and Computational Identification of a Conserved Bacterial Metabolic Module

Author: A Davidson
A Hottes
A Majumder
A Marchler-Bauer
Andrew T. Martens
Antal F. Novak
B Ely
Balaji S. Srinivasan
BL Turner
BS Srinivasan
Cara C. Boutte
E Krings
EE Fetsch
EJ Mullaney
F Pazos
G Jiang
GB Kiss
GE Crooks
H Barbier-Brygoo
H Kawsar
J Flannick
J Fry
Jason A. Flannick
JW Gober
K Yoshida
K-I Yoshida
K-I Yoshida
L Breiman
M Evinger
M Galbraith
M Kanehisa
M Kanehisa
M Pellegrini
M Thanbichler
MF Roberts
Michael T. Laub
MJ Yebra
N Pobigaylo
P Poole
Patrick H. Viollier
PH Viollier
R Ramaley
S Rossbach
Sean Crosson
Serafim Batzoglou
SV Albers
T Bailey
T Bayes
T Berman
T Berman
TM Finan
VG Tusher
W Anderson
W Anderson
Publication venue: Public Library of Science
Publication date: 01/01/2008
Field of study

CiteSeerX

Public Library of Science (PLOS)

Crossref

Directory of Open Access Journals

PubMed Central

Archive ouverte UNIGE

Burden of Rare Sarcomere Gene Variants in the Framingham and Jackson Heart Study Cohorts

Author: Altshuler David M.
Aragam Jayashri
Benjamin Emelia J.
Bick Alexander G.
Cheng Susan
DePalma Steven R.
Flannick Jason
Fox Ervin R.
Funke Birgit H.
Gabriel Stacey B.
Gupta Namrata
Herman Daniel S.
Hirschhorn Joel N.
Ito Kaoru
Kathiresan Sekar
Newton-Cheh Christopher
O’Donnell Christopher J.
Parfenov Michael G.
Rehm Heidi L.
Seidman Christine
Seidman J.G.
Taylor Herman A.
Vasan Ramachandran S.
Wilson James G.
Publication venue: The American Society of Human Genetics. Published by Elsevier Inc.
Publication date: 07/09/2012
Field of study

Rare sarcomere protein variants cause dominant hypertrophic and dilated cardiomyopathies. To evaluate whether allelic variants in eight sarcomere genes are associated with cardiac morphology and function in the community, we sequenced 3,600 individuals from the Framingham Heart Study (FHS) and Jackson Heart Study (JHS) cohorts. Out of the total, 11.2% of individuals had one or more rare nonsynonymous sarcomere variants. The prevalence of likely pathogenic sarcomere variants was 0.6%, twice the previous estimates; however, only four of the 22 individuals had clinical manifestations of hypertrophic cardiomyopathy. Rare sarcomere variants were associated with an increased risk for adverse cardiovascular events (hazard ratio: 2.3) in the FHS cohort, suggesting that cardiovascular risk assessment in the general population can benefit from rare variant analysis

Elsevier - Publisher Connector

PubMed Central

Targeted 'Next-Generation' sequencing in anophthalmia and microphthalmia patients confirms SOX2, OTX2 and FOXE3 mutations

Author: A McKenna
Adele Schneider
Anne M Slavotinek
AS Verma
E Lalonde
EA Otto
EF Percin
Elliott H Sherr
G Billingsley
H Li
I Tzoulaki
J Amiel
J Fantes
J Gonzalez-Rodriguez
Jason Flannick
Jiang Li
JR ten Bosch
Leath Tonkin
LM Reis
M Choi
Mani Yahyavi
Nelson Lopez Jimenez
NK Ragge
P Bakrania
SB Ng
SB Ng
SE Calvo
SP Shah
T Glaser
Tanya Bardakjian
Publication venue: BioMed Central
Publication date: 01/01/2011
Field of study

Abstract Background Anophthalmia/microphthalmia (A/M) is caused by mutations in several different transcription factors, but mutations in each causative gene are relatively rare, emphasizing the need for a testing approach that screens multiple genes simultaneously. We used next-generation sequencing to screen 15 A/M patients for mutations in 9 pathogenic genes to evaluate this technology for screening in A/M. Methods We used a pooled sequencing design, together with custom single nucleotide polymorphism (SNP) calling software. We verified predicted sequence alterations using Sanger sequencing. Results We verified three mutations - c.542delC in S<it>OX2</it>, resulting in p.Pro181Argfs*22, p.Glu105X in <it>OTX2 </it>and p.Cys240X in <it>FOXE3</it>. We found several novel sequence alterations and SNPs that were likely to be non-pathogenic - p.Glu42Lys in <it>CRYBA4</it>, p.Val201Met in <it>FOXE3 </it>and p.Asp291Asn in <it>VSX2</it>. Our analysis methodology gave one false positive result comprising a mutation in <it>PAX6 </it>(c.1268A > T, predicting p.X423LeuextX*15) that was not verified by Sanger sequencing. We also failed to detect one 20 base pair (bp) deletion and one 3 bp duplication in <it>SOX2</it>. Conclusions Our results demonstrated the power of next-generation sequencing with pooled sample groups for the rapid screening of candidate genes for A/M as we were correctly able to identify disease-causing mutations. However, next-generation sequencing was less useful for small, intragenic deletions and duplications. We did not find mutations in 10/15 patients and conclude that there is a need for further gene discovery in A/M.</p

Crossref

Harvard University - DASH

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

eScholarship - University of California

Recommended from our members

Erratum: Sequence data and association statistics from 12,940 type 2 diabetes cases and controls.

Author: Abboud Hanna E
Agarwala Vineeta
Balkau Beverley
Barzilai Nir
Beer Nicola L
Below Jennifer E
Blackwell Thomas W
Boeing Heiner
Butterworth Adam S
Carey Jason
Caulkins Lizz
Chen Han
Chen Peng
Chen Yuhui
Chines Peter S
Cingolani Pablo
Danesh John
Day-Williams Aaron G
Dupuis Josee
Ferreira Teresa
Fingerlin Tasha
Flannick Jason
Fuchsberger Christian
Gamazon Eric R
Gaulton Kyle J
Giedraitis Vilmantas
Go Min Jin
Gottesman Omri
Grant George
Grarup Niels
Green Todd
Han Bok-Ghee
Hartl Christopher
Highland Heather M
Horikoshi Momoko
Howson Joanna MM
Hu Cheng
Huang Jinyan
Huh Iksoo
Huyghe Jeroen R
Ikram Mohammad Kamran
Jackson Anne U
Jenkinson Christopher P
Kim Bong-Jo
Kim Yongkang
Kim Young Jin
Koesterer Ryan
Kumar Ashish
Kuulasmaa Teemu
Kuusisto Johanna
Kwak Soo-Heon
Kwan Phoenix
Kwon Min-Seok
Lam Vincent KL
Lee Heung Man
Lee Jaehoon
Lee Juyoung
Lee Selyeong
Lin Keng-Han
Lindgren Cecilia M
Locke Adam E
Lu Yingchang
Ma Clement
Mahajan Anubha
Manning Alisa
Maxwell Taylor J
McCarthy Davis J
Moutsianas Loukas
Müller-Nurasyid Martina
Müller-Nurasyid Martina
Nagai Yoshihiko
Neale Benjamin M
Ng Maggie CY
Palmer Nicholette D
Parker Stephen CJ
Pasko Dorota
Pearson Richard D
Perry John RB
Prabhakaran Dorairaj
Purcell Shaun
Rayner N William
Rivas Manuel A
Robertson Neil R
Scott James
Scott Robert A
Sim Xueling
Smith Joshua D
Stančáková Alena
Stitzel Michael L
Stringham Heather M
Tajes Juan Fernandez
Teslovich Tanya M
van de Bunt Martijn
Varga Tibor V
Voight Benjamin F
Wang Xu
Welch Ryan P
Yoon Joon
Zhang Weihua
Zhao Wei
Publication venue: eScholarship, University of California
Publication date: 01/01/2018
Field of study

This corrects the article DOI: 10.1038/sdata.2017.179

eScholarship - University of California

Recommended from our members

Refining the accuracy of validated target identification through coding variant fine-mapping in type 2 diabetes.

Author: Afaq Saima
Afzal Shoaib
Ahlqvist Emma
Almgren Peter
Amin Najaf
An Ping
Bang Lia B
Bertoni Alain G
Bielak Lawrence F
Bombieri Cristina
Bork-Jensen Jette
Brandslund Ivan
Brody Jennifer A
Burtt Noël P
Canouil Mickaël
Chen Yii-Der Ida
Cho Yoon Shin
Christensen Cramer
Chu Audrey Y
Cook James P
de Haan Hugoline G
Demirkan Ayse
Eastwood Sophie V
Eckardt Kai-Uwe
ExomeBP Consortium
Fischer Krista
Flannick Jason
Gambaro Giovanni
Gan Wei
GIANT Consortium
Giedraitis Vilmantas
Graff Marielisa
Grarup Niels
Grove Megan L
Guo Xiuqing
Gustafsson Stefan
Hackinger Sophie
Hai Yang
Han Sohee
Highland Heather M
Hivert Marie-France
Hu Yao
Huo Shaofeng
Isomaa Bo
Jensen Richard A
Justice Anne E
Jäger Susanne
Jørgensen Marit E
Jørgensen Torben
Kim Bong-Jo
Kim Sung Soo
Kim Young Jin
Kitajima Hidetoshi
Koistinen Heikki A
Kovacs Peter
Kravic Jasmina
Kriebel Jennifer
Kronenberg Florian
Käräjämäki Annemari
Lange Leslie A
Lecoeur Cécile
Lee Jung-Jin
Lehne Benjamin
Li Huaixing
Li Jin
Li Man
Li-Gao Ruifang
Ligthart Symen
Lin Keng-Hung
Liu Dajiang J
Lohman Kurt K
Lu Yingchang
Läll Kristi
MAGIC Consortium
Mahajan Anubha
Malerba Giovanni
Marouli Eirini
Marten Jonathan
Meidtner Karina
Müller-Nurasyid Martina
Peloso Gina Marie
Preuss Michael
Prins Bram Peter
Rayner N William
Robertson Neil R
Rybin Denis V
Smith Albert Vernon
Steinthorsdottir Valgerdur
Tajes Juan Fernandez
Taliun Daniel
Trubetskoy Vassily Vladimirovich
Tybjærg-Hansen Anne
Varga Tibor V
Warren Helen R
Wessel Jennifer
Willems Sara M
Wuttke Matthias
Yaghootkar Hanieh
Zhang Weihua
Zhao Wei
Publication venue: eScholarship, University of California
Publication date: 01/04/2018
Field of study

We aggregated coding variant data for 81,412 type 2 diabetes cases and 370,832 controls of diverse ancestry, identifying 40 coding variant association signals (P < 2.2 × 10-7); of these, 16 map outside known risk-associated loci. We make two important observations. First, only five of these signals are driven by low-frequency variants: even for these, effect sizes are modest (odds ratio ≤1.29). Second, when we used large-scale genome-wide association data to fine-map the associated variants in their regional context, accounting for the global enrichment of complex trait associations in coding sequence, compelling evidence for coding variant causality was obtained for only 16 signals. At 13 others, the associated coding variants clearly represent 'false leads' with potential to generate erroneous mechanistic inference. Coding variant associations offer a direct route to biological insight for complex diseases and identification of validated therapeutic targets; however, appropriate mechanistic inference requires careful specification of their causal contribution to disease predisposition

eScholarship - University of California