Search CORE

13 research outputs found

Analysis of protein-coding genetic variation in 60,706 humans

Author: Altshuler DM
Ardissino D
Banks E
Berghout J
Birnbaum DP
Boehnke M
Cooper DN
Cummings BB
Daly MJ
Danesh J
Deflaux N
DePristo M
Do R
Donnelly S
Duncan LE
Elosua R
Estrada K
Exome Aggregation Consortium
Fennell T
Flannick J
Florez JC
Fromer M
Gabriel SB
Gauthier L
Getz G
Glatt SJ
Goldstein J
Gupta N
Hill AJ
Howrigan D
Hultman CM
Karczewski KJ
Kathiresan S
Kiezun A
Kosmicki JA
Kurki MI
Laakso M
Lek M
MacArthur DG
McCarroll S
McCarthy MI
McGovern D
McPherson R
Minikel EV
Moonshine AL
Natarajan P
Neale BM
O'Donnell-Luria AH
Orozco L
Palotie A
Peloso GM
Pierce-Hoffman E
Poplin R
Purcell SM
Rivas MA
Rose SA
Ruano-Rubio V
Ruderfer DM
Saleheen D
Samocha KE
Scharf JM
Shakir K
Sklar P
Stenson PD
Stevens C
Sullivan PF
Thomas BP
Tiao G
Tsuang MT
Tukiainen T
Tuomilehto J
Tusie-Luna MT
Ware JS
Watkins HC
Weisburd B
Wilson JG
Won HH
Yu D
Zhao F
Zou J
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 24/06/2016
Field of study

Large-scale reference data sets of human genetic variation are critical for the medical and functional interpretation of DNA sequence changes. Here we describe the aggregation and analysis of high-quality exome (protein-coding region) DNA sequence data for 60,706 individuals of diverse ancestries generated as part of the Exome Aggregation Consortium (ExAC). This catalogue of human genetic diversity contains an average of one variant every eight bases of the exome, and provides direct evidence for the presence of widespread mutational recurrence. We have used this catalogue to calculate objective metrics of pathogenicity for sequence variants, and to identify genes subject to strong selection against various classes of mutation; identifying 3,230 genes with near-complete depletion of predicted protein-truncating variants, with 72% of these genes having no currently established human disease phenotype. Finally, we demonstrate that these data can be used for the efficient filtering of candidate disease-causing variants, and for the discovery of human 'knockout' variants in protein-coding genes

Oxford University Research Archive

Spiral - Imperial College Digital Repository

Analysis of protein-coding genetic variation in 60,706 humans

Author: Abboud
Abecasis
Aguilar-Salinas
Altshuler David M.
Ardissino Diego
Arellano-Campos
Atzmon
Aukrust
Banks Eric
Barr
Bell
Bergen
Berghout Joanne
Birnbaum Daniel P.
Bjørkhaug
Blangero
Boehnke Michael
Bowden
Budman
Burtt
Centeno-Cruz
Chambers
Chambert
Clarke
Collins
Cooper David N.
Coppola
Cortes
Cox
Cummings Beryl B.
Córdova
Daly Mark J.
Danesh John
Deflaux Nicole
DePristo Mark
Do Ron
Donnelly Stacey
Duggirala
Duncan Laramie E.
Elosua Roberto
Estrada Karol
Farrall
Fennell Timothy
Fernandez-Lopez
Flannick Jason
Florez Jose C.
Fontanillas
Frayling
Freimer
Fromer Menachem
Fuchsberger
Gabriel Stacey B.
García-Ortiz
Gauthier Laura
Getz Gad
Glatt Stephen J.
Goel
Goldstein Jackie
González-Villalpando
González-Villalpando
Grados
Groop
Gupta Namrata
Gómez-Vázquez
Haiman
Hanis
Hattersley
Henderson
Hill Andrew J.
Hopewell
Howrigan Daniel
Huerta-Chagoya
Hultman Christina M.
Islas-Andrade
Jacobs
Jalilzadeh
Jenkinson
Jiménez-Morale
Karczewski Konrad J.
Kathiresan Sekar
Kiezun Adam
King
Kirov
Kooner
Kosmicki Jack A.
Kurki Mitja I.
Kyriakou
Kähler
Laakso Markku
Lee
Lehman
Lek Monkol
Lyon
MacArthur Daniel G.
MacMahon
Magnusson
Mahajan
Marrugat
Martínez-Hernández
Mathews
McCarroll Steven
McCarthy Mark I.
McGovern Dermot
McPherson Ruth
McVean
Meigs
Meitinger
Mendoza-Caamal
Mercader
Minikel Eric V.
Mohlke
Moonshine Ami Levy
Moran
Moreno-Macías
Morris
Najmi
Natarajan Pradeep
Neale Benjamin M.
Njølstad
O'Donnell-Luria Anne H.
O'Donovan
Ordóñez-Sánchez
Orozco Lorena
Owen
Palotie Aarno
Park
Pauls
Peloso Gina M.
Pierce-Hoffman Emma
Poplin Ryan
Posthuma
Purcell Shaun M.
Revilla-Monsalve
Riba
Ripke
Rivas Manuel A.
Rodríguez-Guillén
Rodríguez-Torres
Rose Samuel A.
Ruano-Rubio Valentin
Ruderfer Douglas M.
Saleheen Danish
Samocha Kaitlin E.
Sandor
Scharf Jeremiah M.
Seielstad
Shakir Khalid
Sklar Pamela
Sladek
Soberón
Spector
Stenson Peter D.
Stevens Christine
Sullivan Patrick F.
Tai
Teslovich
Thomas Brett P.
Tiao Grace
Tsuang Ming T.
Tukiainen Taru
Tuomilehto Jaakko
Tusie-Luna Maria T.
Walford
Ware James S.
Watkins Hugh C.
Weisburd Ben
Wilkens
Williams
Wilson James G.
Won Hong-Hee
Yu Dongmei
Zhao Fengmei
Zou James
Publication venue
Publication date: 01/01/2016
Field of study

Large-scale reference data sets of human genetic variation are critical for the medical and functional interpretation of DNA sequence changes. We describe the aggregation and analysis of high-quality exome (protein-coding region) sequence data for 60,706 individuals of diverse ethnicities generated as part of the Exome Aggregation Consortium (ExAC). This catalogue of human genetic diversity contains an average of one variant every eight bases of the exome, and provides direct evidence for the presence of widespread mutational recurrence. We have used this catalogue to calculate objective metrics of pathogenicity for sequence variants, and to identify genes subject to strong selection against various classes of mutation; identifying 3,230 genes with near-complete depletion of truncating variants with 72% having no currently established human disease phenotype. Finally, we demonstrate that these data can be used for the efficient filtering of candidate disease-causing variants, and for the discovery of human “knockout” variants in protein-coding genes

Carolina Digital Repository

Analysis of protein-coding genetic variation in 60,706 humans

Author: A Freischmidt
A Piton
Aarno Palotie
Adam Kiezun
Ami Levy Moonshine
Andrew J. Hill
Anne H. O’Donnell-Luria
B Vicoso
Ben Weisburd
Benjamin M. Neale
Beryl B. Cummings
BF Voight
Brett P. Thomas
Christina M. Hultman
Christine Stevens
CJ Bell
Daniel G. MacArthur
Daniel Howrigan
Daniel P. Birnbaum
Danish Saleheen
David M. Altshuler
David N. Cooper
Dermot McGovern
DF Gudbjartsson
DG MacArthur
DG MacArthur
Diego Ardissino
DN Cooper
Dongmei Yu
Douglas M. Ruderfer
Emma Pierce-Hoffman
Eric Banks
Eric V. Minikel
ET Lim
EV Minikel
FE Dewey
Fengmei Zhao
Gad Getz
Gina M. Peloso
Grace Tiao
H Jeong
H Li
Hong-Hee Won
Hugh C. Watkins
JA Tennessen
Jaakko Tuomilehto
Jack A. Kosmicki
Jackie Goldstein
James G. Wilson
James S. Ware
James Zou
Jason Flannick
Jeremiah M. Scharf
JM Zook
Joanne Berghout
John Danesh
Jose C. Florez
JX Chong
K-I Goh
Kaitlin E. Samocha
Karol Estrada
KE Samocha
Khalid Shakir
Konrad J. Karczewski
Laramie E. Duncan
Laura Gauthier
Lorena Orozco
M Fromer
M Stoneking
MA DePristo
Manuel A. Rivas
Maria T. Tusie-Luna
Mark DePristo
Mark I. McCarthy
Mark J. Daly
Markku Laakso
Menachem Fromer
Michael Boehnke
Ming T. Tsuang
Mitja I. Kurki
MJ Bamshad
Monkol Lek
Namrata Gupta
Nicole Deflaux
P Chagnon
P Sulem
Pamela Sklar
Patrick F. Sullivan
PD Stenson
Peter D. Stenson
Pradeep Natarajan
R Blekhman
Roberto Elosua
Ron Do
Ruth McPherson
Ryan Poplin
S Kathiresan
S Petrovski
S Richards
Samuel A. Rose
Sekar Kathiresan
Shaun M. Purcell
Stacey B. Gabriel
Stacey Donnelly
Stephen J. Glatt
Steven McCarroll
T Rolland
Taru Tukiainen
Timothy Fennell
Valentin Ruano-Rubio
W Fu
Y Itan
Y Xue
Publication venue
Publication date: 01/01/2016
Field of study

Crossref

VU Research Portal

Online Research @ Cardiff

The Jackson Laboratory: The Mouseion at the JAXlibrary

Harvard University - DASH

PubMed Central

eScholarship - University of California

Oxford University Research Archive

UPF Digital Repository

Helsingin yliopiston digitaalinen arkisto

Apollo (Cambridge)

Demonstrating paths for unlocking the value of cloud genomics through cross cohort analysis

Author: Alexander G. Bick
Anjene Musick
Anthony A. Philippakis
Chris Lunt
Dan M. Roden
David Glazer
Henry Robert Condon
Joshua C. Denny
Kelsey Mayo
Margaret Sunitha Selvaraj
Mark Effingham
Melissa A. Basford
Naomi Allen
Nicole Deflaux
Pradeep Natarajan
Rory Collins
Sara Haidermota
Publication venue: Nature Portfolio
Publication date: 01/09/2023
Field of study

Abstract Recently, large scale genomic projects such as All of Us and the UK Biobank have introduced a new research paradigm where data are stored centrally in cloud-based Trusted Research Environments (TREs). To characterize the advantages and drawbacks of different TRE attributes in facilitating cross-cohort analysis, we conduct a Genome-Wide Association Study of standard lipid measures using two approaches: meta-analysis and pooled analysis. Comparison of full summary data from both approaches with an external study shows strong correlation of known loci with lipid levels (R2 ~ 83–97%). Importantly, 90 variants meet the significance threshold only in the meta-analysis and 64 variants are significant only in pooled analysis, with approximately 20% of variants in each of those groups being most prevalent in non-European, non-Asian ancestry individuals. These findings have important implications, as technical and policy choices lead to cross-cohort analyses generating similar, but not identical results, particularly for non-European ancestral populations

Directory of Open Access Journals

Analysis of protein-coding genetic variation in 60,706 humans

Author: Altshuler DM
Ardissino D
Banks E
Berghout J
Birnbaum DP
Boehnke M
Cooper DN
Cummings BB
Daly MJ
Danesh J
Deflaux N
DePristo M
Do R
Donnelly S
Duncan LE
Elosua R
Estrada K
Exome Aggregation Consortium
Fennell T
Flannick J
Florez JC
Fromer M
Gabriel SB
Gauthier L
Getz G
Glatt SJ
Goldstein J
Gupta N
Hill AJ
Howrigan D
Hultman CM
Karczewski KJ
Kathiresan S
Kiezun A
Kosmicki JA
Kurki MI
Laakso M
Lek M
MacArthur DG
McCarroll S
McCarthy MI
McGovern D
McPherson R
Minikel EV
Moonshine AL
Natarajan P
Neale BM
O'Donnell-Luria AH
Orozco L
Palotie A
Peloso GM
Pierce-Hoffman E
Poplin R
Purcell SM
Rivas MA
Rose SA
Ruano-Rubio V
Ruderfer DM
Saleheen D
Samocha KE
Scharf JM
Shakir K
Sklar P
Stenson PD
Stevens C
Sullivan PF
Thomas BP
Tiao G
Tsuang MT
Tukiainen T
Tuomilehto J
Tusie-Luna MT
Ware JS
Watkins HC
Weisburd B
Wilson JG
Won HH
Yu D
Zhao F
Zou J
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2016
Field of study

Oxford University Research Archive

Spiral - Imperial College Digital Repository

Systematic analysis of challenge-driven improvements in molecular prognostic models for breast cancer

Author: Aparicio S
Bilal E
Borresen-Dale A
Caldas C
Citro C
Curtis C
Deflaux NA
Friend SH
Furia MD
Guinney J
Hellerstein J
Hoff B
Huang E
Kellen MR
Kristensen VN
Mangravite LM
Margolin AA
Mecham BH
Norman TC
Ottestad L
Park D
Pirtle T
Rueda OM
Russnes HG
Sauerwine B
Schildwachter X
Stolovitzky G
Vang VO
Vollan HKM
Youseff L
Publication venue: 'American Association for the Advancement of Science (AAAS)'
Publication date: 01/01/2013
Field of study

Although molecular prognostics in breast cancer are among the most successful examples of translating genomic analysis to clinical applications, optimal approaches to breast cancer clinical risk prediction remain controversial. The Sage Bionetworks-DREAM Breast Cancer Prognosis Challenge (BCC) is a crowdsourced research study for breast cancer prognostic modeling using genome-scale data. The BCC provided a community of data analysts with a common platform for data access and blinded evaluation of model accuracy in predicting breast cancer survival on the basis of gene expression data, copy number data, and clinical covariates. This approach offered the opportunity to assess whether a crowdsourced community Challenge would generate models of breast cancer prognosis commensurate with or exceeding current best-in-class approaches. The BCC comprised multiple rounds of blinded evaluations on held-out portions of data on 1981 patients, resulting in more than 1400 models submitted as open source code. Participants then retrained their models on the full data set of 1981 samples and submitted up to five models for validation in a newly generated data set of 184 breast cancer patients. Analysis of the BCC results suggests that the best-performing modeling strategy outperformed previously reported methods in blinded evaluations; model performance was consistent across several independent evaluations; and aggregating community-developed models achieved performance on par with the best-performing individual models. Copyright 2013 by the American Association for the Advancement of Science; all rights reserve

PubMed Central

The Novartis Repository