Search CORE

16 research outputs found

Analysis of protein-coding genetic variation in 60,706 humans

Author: Altshuler DM
Ardissino D
Banks E
Berghout J
Birnbaum DP
Boehnke M
Cooper DN
Cummings BB
Daly MJ
Danesh J
Deflaux N
DePristo M
Do R
Donnelly S
Duncan LE
Elosua R
Estrada K
Exome Aggregation Consortium
Fennell T
Flannick J
Florez JC
Fromer M
Gabriel SB
Gauthier L
Getz G
Glatt SJ
Goldstein J
Gupta N
Hill AJ
Howrigan D
Hultman CM
Karczewski KJ
Kathiresan S
Kiezun A
Kosmicki JA
Kurki MI
Laakso M
Lek M
MacArthur DG
McCarroll S
McCarthy MI
McGovern D
McPherson R
Minikel EV
Moonshine AL
Natarajan P
Neale BM
O'Donnell-Luria AH
Orozco L
Palotie A
Peloso GM
Pierce-Hoffman E
Poplin R
Purcell SM
Rivas MA
Rose SA
Ruano-Rubio V
Ruderfer DM
Saleheen D
Samocha KE
Scharf JM
Shakir K
Sklar P
Stenson PD
Stevens C
Sullivan PF
Thomas BP
Tiao G
Tsuang MT
Tukiainen T
Tuomilehto J
Tusie-Luna MT
Ware JS
Watkins HC
Weisburd B
Wilson JG
Won HH
Yu D
Zhao F
Zou J
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 24/06/2016
Field of study

Large-scale reference data sets of human genetic variation are critical for the medical and functional interpretation of DNA sequence changes. Here we describe the aggregation and analysis of high-quality exome (protein-coding region) DNA sequence data for 60,706 individuals of diverse ancestries generated as part of the Exome Aggregation Consortium (ExAC). This catalogue of human genetic diversity contains an average of one variant every eight bases of the exome, and provides direct evidence for the presence of widespread mutational recurrence. We have used this catalogue to calculate objective metrics of pathogenicity for sequence variants, and to identify genes subject to strong selection against various classes of mutation; identifying 3,230 genes with near-complete depletion of predicted protein-truncating variants, with 72% of these genes having no currently established human disease phenotype. Finally, we demonstrate that these data can be used for the efficient filtering of candidate disease-causing variants, and for the discovery of human 'knockout' variants in protein-coding genes

Oxford University Research Archive

Spiral - Imperial College Digital Repository

The COMBREX Project: Design, Methodology, and Initial Results

Author: Allen Benjamin
Anton Brian P.
Bateman Alex
Bhagwat Ashok S.
Blumenthal Robert M.
Bollinger J. Martin
Brenner Steven E.
Brown Peter J.
Chang Woo-Suk
Choi Han-Pil
Columbus Linda
Crécy-Lagard Valerié de
DeLisi Charles
Faller Lina L.
Ferguson Donald
Ferrer Manuel
Fomenkov Alexey
Friedberg Iddo
Gadda Giovanni
Galperin Michael Y.
Gobeill Julien
Greiner Russell
Guleria Jyotsna
Haft Daniel
Horn David
Housman Genevieve
Hu Jie
Hu Zhenjun
Hunt John
Karp Peter
Kasif Simon
Klimke William
Klitgord Niels
Krebs Carsten
Letovsky Stanley
Levy-Moonshine Ami
Macelis Dana
Madupu Ramana
Maksad Almaz
Mark McGettrick
Martín María J.
Mazumdar Varun
Miller Jeffrey H.
Monahan Caitlin
Morgan Richard D.
Osmani Lais
Osterman Andrei L.
O’Donovan Claire
Palsson Bernhard
Plata Germán
Pokrzywa Revonda
Rachlin John
Roberts Richard J.
Rochussen Krista
Rodionov Dmitry A.
Rodionova Irina A.
Ruch Patrick
Rudd Kenneth E.
Salzberg Steven L.
Segre Daniel
Setterdahl Aaron
Sjölander Kimmen
Spain James
Steffen Martin
Sutton Granger
Swaminathan Rajeswari
Söll Dieter
Tao Kevin
Tate John
Tchigvintsev Dmitri
Vitkup Dennis
Xu Shuang-yong
Yakunin Alexander F.
Yi-Chien Chang
Publication venue: 'Public Library of Science (PLoS)'
Publication date: 05/06/2019
Field of study

© 2013 Brian P. et al.Prior to the “genomic era,” when the acquisition of DNA sequence involved significant labor and expense, the sequencing of genes was strongly linked to the experimental characterization of their products. Sequencing at that time directly resulted from the need to understand an experimentally determined phenotype or biochemical activity. Now that DNA sequencing has become orders of magnitude faster and less expensive, focus has shifted to sequencing entire genomes. Since biochemistry and genetics have not, by and large, enjoyed the same improvement of scale, public sequence repositories now predominantly contain putative protein sequences for which there is no direct experimental evidence of function. Computational approaches attempt to leverage evidence associated with the ever-smaller fraction of experimentally analyzed proteins to predict function for these putative proteins. Maximizing our understanding of function over the universe of proteins in toto requires not only robust computational methods of inference but also a judicious allocation of experimental resources, focusing on proteins whose experimental characterization will maximize the number and accuracy of follow-on predictions.COMBREX is funded by a GO grant from the National Institute of General Medical Sciences (NIGMS) (1RC2GM092602-01).Peer Reviewe

Digital.CSIC

Analysis of protein-coding genetic variation in 60,706 humans

Author: Abboud
Abecasis
Aguilar-Salinas
Altshuler David M.
Ardissino Diego
Arellano-Campos
Atzmon
Aukrust
Banks Eric
Barr
Bell
Bergen
Berghout Joanne
Birnbaum Daniel P.
Bjørkhaug
Blangero
Boehnke Michael
Bowden
Budman
Burtt
Centeno-Cruz
Chambers
Chambert
Clarke
Collins
Cooper David N.
Coppola
Cortes
Cox
Cummings Beryl B.
Córdova
Daly Mark J.
Danesh John
Deflaux Nicole
DePristo Mark
Do Ron
Donnelly Stacey
Duggirala
Duncan Laramie E.
Elosua Roberto
Estrada Karol
Farrall
Fennell Timothy
Fernandez-Lopez
Flannick Jason
Florez Jose C.
Fontanillas
Frayling
Freimer
Fromer Menachem
Fuchsberger
Gabriel Stacey B.
García-Ortiz
Gauthier Laura
Getz Gad
Glatt Stephen J.
Goel
Goldstein Jackie
González-Villalpando
González-Villalpando
Grados
Groop
Gupta Namrata
Gómez-Vázquez
Haiman
Hanis
Hattersley
Henderson
Hill Andrew J.
Hopewell
Howrigan Daniel
Huerta-Chagoya
Hultman Christina M.
Islas-Andrade
Jacobs
Jalilzadeh
Jenkinson
Jiménez-Morale
Karczewski Konrad J.
Kathiresan Sekar
Kiezun Adam
King
Kirov
Kooner
Kosmicki Jack A.
Kurki Mitja I.
Kyriakou
Kähler
Laakso Markku
Lee
Lehman
Lek Monkol
Lyon
MacArthur Daniel G.
MacMahon
Magnusson
Mahajan
Marrugat
Martínez-Hernández
Mathews
McCarroll Steven
McCarthy Mark I.
McGovern Dermot
McPherson Ruth
McVean
Meigs
Meitinger
Mendoza-Caamal
Mercader
Minikel Eric V.
Mohlke
Moonshine Ami Levy
Moran
Moreno-Macías
Morris
Najmi
Natarajan Pradeep
Neale Benjamin M.
Njølstad
O'Donnell-Luria Anne H.
O'Donovan
Ordóñez-Sánchez
Orozco Lorena
Owen
Palotie Aarno
Park
Pauls
Peloso Gina M.
Pierce-Hoffman Emma
Poplin Ryan
Posthuma
Purcell Shaun M.
Revilla-Monsalve
Riba
Ripke
Rivas Manuel A.
Rodríguez-Guillén
Rodríguez-Torres
Rose Samuel A.
Ruano-Rubio Valentin
Ruderfer Douglas M.
Saleheen Danish
Samocha Kaitlin E.
Sandor
Scharf Jeremiah M.
Seielstad
Shakir Khalid
Sklar Pamela
Sladek
Soberón
Spector
Stenson Peter D.
Stevens Christine
Sullivan Patrick F.
Tai
Teslovich
Thomas Brett P.
Tiao Grace
Tsuang Ming T.
Tukiainen Taru
Tuomilehto Jaakko
Tusie-Luna Maria T.
Walford
Ware James S.
Watkins Hugh C.
Weisburd Ben
Wilkens
Williams
Wilson James G.
Won Hong-Hee
Yu Dongmei
Zhao Fengmei
Zou James
Publication venue
Publication date: 01/01/2016
Field of study

Large-scale reference data sets of human genetic variation are critical for the medical and functional interpretation of DNA sequence changes. We describe the aggregation and analysis of high-quality exome (protein-coding region) sequence data for 60,706 individuals of diverse ethnicities generated as part of the Exome Aggregation Consortium (ExAC). This catalogue of human genetic diversity contains an average of one variant every eight bases of the exome, and provides direct evidence for the presence of widespread mutational recurrence. We have used this catalogue to calculate objective metrics of pathogenicity for sequence variants, and to identify genes subject to strong selection against various classes of mutation; identifying 3,230 genes with near-complete depletion of truncating variants with 72% having no currently established human disease phenotype. Finally, we demonstrate that these data can be used for the efficient filtering of candidate disease-causing variants, and for the discovery of human “knockout” variants in protein-coding genes

Carolina Digital Repository