Search CORE

68 research outputs found

Author Correction: Landscape of multi-nucleotide variants in 125,748 human exomes and 15,708 genomes.

Author: Alföldi J
Cummings BB
Francioli LC
Gauthier LD
Genome Aggregation Database Consortium
Genome Aggregation Database Production Team
Hill AJ
Karczewski KJ
MacArthur DG
O'Donnell-Luria AH
Pierce-Hoffman E
Wang Q
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/02/2021
Field of study

Spiral - Imperial College Digital Repository

Landscape of multi-nucleotide variants in 125,748 human exomes and 15,708 genomes.

Author: Alföldi J
Cummings BB
Francioli LC
Gauthier LD
Genome Aggregation Database Consortium
Genome Aggregation Database Production Team
Hill AJ
Karczewski KJ
MacArthur DG
O'Donnell-Luria AH
Pierce-Hoffman E
Wang Q
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 09/09/2019
Field of study

Multi-nucleotide variants (MNVs), defined as two or more nearby variants existing on the same haplotype in an individual, are a clinically and biologically important class of genetic variation. However, existing tools typically do not accurately classify MNVs, and understanding of their mutational origins remains limited. Here, we systematically survey MNVs in 125,748 whole exomes and 15,708 whole genomes from the Genome Aggregation Database (gnomAD). We identify 1,792,248 MNVs across the genome with constituent variants falling within 2 bp distance of one another, including 18,756 variants with a novel combined effect on protein sequence. Finally, we estimate the relative impact of known mutational mechanisms - CpG deamination, replication error by polymerase zeta, and polymerase slippage at repeat junctions - on the generation of MNVs. Our results demonstrate the value of haplotype-aware variant annotation, and refine our understanding of genome-wide mutational mechanisms of MNVs

Spiral - Imperial College Digital Repository

A framework for the detection of de novo mutations in family-based sequencing data

Author: A Hodgkinson
A Kong
A McKenna
A Ramu
Benjamin M Neale
BM Neale
CA Brownstein
D Earl
DF Conrad
ED Gamsiz
Eric Banks
Genome of the Netherlands Consortium
H Li
H Li
JA Veltman
JJ Michaelson
Kaitlin E Samocha
Kiran V Garimella
Laurent C Francioli
LC Francioli
MA DePristo
Mark A DePristo
Mark J Daly
Menachem Fromer
Mircea Cretu-Stancu
MW Nachman
Paul IW de Bakker
Q Wei
The 1000 Genomes Consortium
Wigard P Kloosterman
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2016
Field of study

Francioli LC, Cretu-Stancu M, Garimella KV, et al. A framework for the detection of de novo mutations in family-based sequencing data. European Journal of Human Genetics. 2016;25(2):227-233

Crossref

PubMed Central

Publications at Bielefeld University

Utrecht University Repository

WGS-based telomere length analysis in Dutch family trios implicates stronger maternal inheritance and a role for RRM1 gene

Author: Amin Najaf
Arakelyan A
Duijn Cornelia
Elbers CC
Estrada Gil Karol
Francioli LC
Hofman Bert
Isaacs Aaron
Karssen Lennart
Kayser Manfred
Koval Slavik
Leeuwen Elisa
Medina Gomez Maria
Menelaou A
Nersisyan L
Nikoghosyan M
Oostra Ben
Oven Mannis
Pulit SL
Rivadeneira Fernando
Uitterlinden André
van Ommen GJB
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2019
Field of study

EUR Research Repository

Mapping and phasing of structural variation in patient genomes using nanopore sequencing

Author: 1000 Genomes Project Consortium
A McKenna
A Tarasov
C Alkan
C Chiang
C Gilissen
C Gilissen
C Redin
C Yang
D Deamer
DF Conrad
DM Church
EA Ashley
EA Ashley
ET Lam
H Li
J Huddleston
J-S Seo
JR Lupski
JY Hehir-Kwa
K Ye
LC Francioli
M Jain
M Patterson
M Pendleton
MJP Chaisson
MS Pagter de
NJ Loman
O Corradin
P Stankiewicz
PH Sudmant
R Hubley
R Tewhey
RC Edgar
RM Layer
S Goodwin
S Middelkamp
SB Ng
SM Kiełbasa
T Marschall
T Rausch
T Zhou
V Boeva
WP Kloosterman
X Chen
Y Mostovoy
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2017
Field of study

Despite improvements in genomics technology, the detection of structural variants (SVs) from short-read sequencing still poses challenges, particularly for complex variation. Here we analyse the genomes of two patients with congenital abnormalities using the MinION nanopore sequencer and a novel computational pipeline—NanoSV. We demonstrate that nanopore long reads are superior to short reads with regard to detection of de novo chromothripsis rearrangements. The long reads also enable efficient phasing of genetic variations, which we leveraged to determine the parental origin of all de novo chromothripsis breakpoints and to resolve the structure of these complex rearrangements. Additionally, genome-wide surveillance of inherited SVs reveals novel variants, missed in short-read data sets, a large proportion of which are retrotransposon insertions. We provide a first exploration of patient genome sequencing with a nanopore sequencer and demonstrate the value of long-read sequencing in mapping and phasing of SVs for both clinical and research applications

Crossref

Harvard University - DASH

Directory of Open Access Journals

Kent Academic Repository

Utrecht University Repository

MPG.PuRe

Institutional Research Information System University of Turin

Large scale variation in the rate of germ-line de novo mutation, base composition, divergence and diversity in humans

Author: A Eyre-Walker
A Eyre-Walker
A Eyre-Walker
A Eyre-Walker
A Hodgkinson
A Hodgkinson
A Kong
A Kong
Adam Eyre-Walker
B Arbeithuber
B Paten
B Schuster-Bockler
C Seoighe
C TEP
DF Conrad
DL Bodian
E Kenigsberg
F Chiaromonte
F Pratto
F Supek
G Bernardi
G Bernardi
G McVicker
GP Holmquist
H Jonsson
I Hellmann
I Hellmann
J Filipski
J Filipski
J Meunier
JB Haldane
JC Dohm
JJ Cai
JJ Michaelson
K Harris
K Harris
K Wolfe
KE Lohmueller
KH Wolfe
L Duret
L Duret
LC Francioli
M Blanchette
MJ Lercher
MW Nachman
NV Terekhanova
P Moorjani
P Polak
Peter F. Arndt
R Burgess
RE Thurman
RS Hansen
S Besenbacher
S Glemin
S Katzman
S Tyekucheva
Shamil R. Sunyaev
Thomas C. A. Smith
TI Gossmann
TN Phung
V Aggarwala
VM Schaibley
WS Wong
Y Benjamini
YH Woo
Publication venue: 'Public Library of Science (PLoS)'
Publication date: 01/03/2018
Field of study

It has long been suspected that the rate of mutation varies across the human genome at a large scale based on the divergence between humans and other species. However, it is now possible to directly investigate this question using the large number of de novo mutations (DNMs) that have been discovered in humans through the sequencing of trios. We investi- gate a number of questions pertaining to the distribution of mutations using more than 130,000 DNMs from three large datasets. We demonstrate that the amount and pattern of variation differs between datasets at the 1MB and 100KB scales probably as a consequence of differences in sequencing technology and processing. In particular, datasets show differ- ent patterns of correlation to genomic variables such as replication time. Never-the-less there are many commonalities between datasets, which likely represent true patterns. We show that there is variation in the mutation rate at the 100KB, 1MB and 10MB scale that can- not be explained by variation at smaller scales, however the level of this variation is modest at large scales–at the 1MB scale we infer that ~90% of regions have a mutation rate within 50% of the mean. Different types of mutation show similar levels of variation and appear to vary in concert which suggests the pattern of mutation is relatively constant across the genome. We demonstrate that variation in the mutation rate does not generate large-scale variation in GC-content, and hence that mutation bias does not maintain the isochore struc- ture of the human genome. We find that genomic features explain less than 40% of the explainable variance in the rate of DNM. As expected the rate of divergence between spe- cies is correlated to the rate of DNM. However, the correlations are weaker than expected if all the variation in divergence was due to variation in the mutation rate. We provide evidence that this is due the effect of biased gene conversion on the probability that a mutation will become fixed. In contrast to divergence, we find that most of the variation in diversity can be explained by variation in the mutation rate. Finally, we show that the correlation between divergence and DNM density declines as increasingly divergent species are considered

Crossref

ZENODO

Directory of Open Access Journals

Dryad Digital Repository (Duke University)

Electronic Archiving System

Sussex Research Online

MPG.PuRe

FigShare

De novo single-nucleotide and copy number variation in discordant monozygotic twins reveals disease-related genes

Author: A Al-Chalabi
A Cecchinato
A Kong
A McKenna
Alan Pittman
B Bertelsen
BS Petersen
C Lavedan
CD Campbell
Charles Lee
Chengsheng Zhang
D Freed
D Mataix-Cols
D Nickles
D Vitucci
Deborah Hughes
DF Levinson
E Colvert
EA Ehli
EHM Wong
Eliza Cerveira
Elliott Rees
EV Davydov
F Antonacci
F Magne
G Kuhlenbäumer
George Kirov
GM Dal
H Higashida
IA Adzhubei
J Chen
J Dongen van
J Fallon
J Tang
Jamal Nasir
JB Potash
JM Schwarz
John Hardy
K Meltz Steinberg
K Ohi
K Wang
K Wang
Kerra Pearce
L Cai
L Vadlamudi
L Yuan
LC Francioli
M Florio
Mark Kristiansen
ME Ketelaar
Michael Simpson
MJ Lindhurst
MY Dennis
Niranjanan Nirmalananthan
Nirmal Vadgama
P Kumar
Peter De Rijk
Qihui Zhu
R Acuna-Hidalgo
R Hashimoto
R Hilker
R Pamphlett
Robin Murray
RP Ebstein
S Akbarian
S Beicht
S Petrovski
S Schuster
SE Baranzini
SP Robertson
Takeo Yoshikawa
Tomas Fitzgerald
V Labrie
YL Liu
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2019
Field of study

Recent studies have demonstrated genetic differences between monozygotic (MZ) twins. To test the hypothesis that early post-twinning mutational events associate with phenotypic discordance, we investigated a cohort of 13 twin pairs (n = 26) discordant for various clinical phenotypes using whole-exome sequencing and screened for copy number variation (CNV). We identified a de novo variant in PLCB1, a gene involved in the hydrolysis of lipid phosphorus in milk from dairy cows, associated with lactase non-persistence, and a variant in the mitochondrial complex I gene MT-ND5 associated with amyotrophic lateral sclerosis (ALS). We also found somatic variants in multiple genes (TMEM225B, KBTBD3, TUBGCP4, TFIP11) in another MZ twin pair discordant for ALS. Based on the assumption that discordance between twins could be explained by a common variant with variable penetrance or expressivity, we screened the twin samples for known pathogenic variants that are shared and identified a rare deletion overlapping ARHGAP11B, in the twin pair manifesting with either schizotypal personality disorder or schizophrenia. Parent-offspring trio analysis was implemented for two twin pairs to assess potential association of variants of parental origin with susceptibility to disease. We identified a de novo variant in RASD2 shared by 8-year-old male twins with a suspected diagnosis of autism spectrum disorder (ASD) manifesting as different traits. A de novo CNV duplication was also identified in these twins overlapping CD38, a gene previously implicated in ASD. In twins discordant for Tourette's syndrome, a paternally inherited stop loss variant was detected in AADAC, a known candidate gene for the disorder

Crossref

Online Research @ Cardiff

The Jackson Laboratory: The Mouseion at the JAXlibrary

University of Northampton's Research Explorer

UCL Discovery

Institutional Repository Universiteit Antwerpen

King's Research Portal

St George's Online Research Archive

NECTAR

Sequencing and de novo assembly of 150 genomes from Denmark as a population reference

Author: A Helgason
A Kong
A Telenti
AD Børglum
Ali Syed
Anders D. Børglum
Anders E. Halager
Anders Krogh
Bent Petersen
BJ Stucky
Chen Ye
Christian N. S. Pedersen
Christian Theil Have
Christina M. Hultman
David Westergaard
DF Gudbjartsson
Esben Flindt
Francesco Lescai
G Lunter
GA Van der Auwera
GD Poznik
GM Cooper
H Cao
H Eiberg
H Kupfermann
H Li
H Li
H Li
Hans Eiberg
Hongzhi Cao
J Huddleston
Jacob Malte Jensen
Jakob Grove
Jette Bork-Jensen
Jihua Sun
Johan van Beusekom
Jonas Andreas Sibbesen
Jose M. G. Izarzugaza
JS Seo
JT Simpson
Jun Wang
Junhua Rao
K Katoh
K Tamura
Karsten Kristiansen
Kirstine Belling
KM Steinberg
L Paternoster
Lars Bolund
Lasse Maretty
Laurits Skov
LC Francioli
M Lek
M Nothnagel
M Oven
M Pendleton
MA Eberle
Maria Luisa Matey-Hernandez
Marie Grosjean
MC Frith
Mikkel Heide Schierup
MR Hoehe
Ning Li
Ole Lund
Ole Mors
Oluf Pedersen
P Rice
Palle Villesen
Patrick Sullivan
Peter Løngren
PH Sudmant
PL Auer
R Hubley
R Luo
Rachita Yadav
Ramneek Gupta
Ruiqi Xu
Rune M. Friborg
S Besenbacher
S Deorowicz
S Gnerre
S Liu
S Ripke
SF Altschul
Shengting Li
Shujia Huang
Simon Rasmussen
Siyang Liu
SM Kiełbasa
Stephanie Le Hellard
Søren Besenbacher
Søren Brunak
T Espeseth
T Magocˇ
Thomas D. Als
Thomas Espeseth
Thomas Mailund
Thomas Sicheritz-Pontén
Thorkild I. A. Sørensen
Torben Hansen
VA Schneider
Weijian Ye
WP Kloosterman
WS Wong
Xiaosen Guo
Xun Xu
Yuqi Chang
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2017
Field of study

Hundreds of thousands of human genomes are now being sequenced to characterize genetic variation and use this information to augment association mapping studies of complex disorders and other phenotypic traits. Genetic variation is identified mainly by mapping short reads to the reference genome or by performing local assembly. However, these approaches are biased against discovery of structural variants and variation in the more complex parts of the genome. Hence, large-scale de novo assembly is needed. Here we show that it is possible to construct excellent de novo assemblies from high-coverage sequencing with mate-pair libraries extending up to 20 kilobases. We report de novo assemblies of 150 individuals (50 trios) from the GenomeDenmark project. The quality of these assemblies is similar to those obtained using the more expensive long-read technology. We use the assemblies to identify a rich set of structural variants including many novel insertions and demonstrate how this variant catalogue enables further deciphering of known association mapping signals. We leverage the assemblies to provide 100 completely resolved major histocompatibility complex haplotypes and to resolve major parts of the Y chromosome. Our study provides a regional reference genome that we expect will improve the power of future association mapping studies and hence pave the way for precision medicine initiatives, which now are being launched in many countries including Denmark

Crossref

Copenhagen University Research Information System

Carolina Digital Repository

Online Research Database In Technology

Author Correction: The mutational constraint spectrum quantified from variation in 141,456 humans

Author: Aggregation G
Alfoldi J
Armean IM
Banks E
Bergelson L
Birnbaum DP
Brand H
Chong JX
Cibulskis K
Collins RL
Connolly KM
Covarrubias M
Cummings BB
Daly MJ
Donnelly S
England EM
Farjoun Y
Ferriera S
Francioli LC
Gabriel S
Ganna A
Gauthier LD
Gentry J
Gupta N
Jeandet T
Kaplan D
Karczewski KJ
Kosmicki JA
Laricchia KM
Lek M
Llanwarne C
MacArthur DG
Minikel EV
Munshi R
Neale BM
Novod S
O'Donnell-Luria AH
Petrillo N
Pierce-Hoffman E
Poterba T
Rhodes D
Roazen D
Ruano-Rubio V
Saltzman A
Samocha KE
Schleicher M
Seaby EG
Seed C
Singer-Berk M
Solomonson M
Soto J
Talkowski ME
Tashman K
Tiao G
Tibbetts K
Tolonen C
Vittal C
Wade G
Walters RK
Wang A
Wang Q
Ware JS
Watts NA
Weisburd B
Whiffin N
Zappala Z
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/02/2021
Field of study

Spiral - Imperial College Digital Repository

A framework for the detection of de novo mutations in family-based sequencing data

Author: Banks E
Cretu-Stancu M
Daly MJ
De Bakker PI
Depristo MA
Francioli LC
Fromer M
Garimella KV
Genome Of The Netherlands Consortium
Kloosterman WP
Neale BM
Palamara PF
Samocha KE
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2016
Field of study

Germline mutation detection from human DNA sequence data is challenging due to the rarity of such events relative to the intrinsic error rates of sequencing technologies and the uneven coverage across the genome. We developed PhaseByTransmission (PBT) to identify de novo single nucleotide variants and short insertions and deletions (indels) from sequence data collected in parent-offspring trios. We compute the joint probability of the data given the genotype likelihoods in the individual family members, the known familial relationships and a prior probability for the mutation rate. Candidate de novo mutations (DNMs) are reported along with their posterior probability, providing a systematic way to prioritize them for validation. Our tool is integrated in the Genome Analysis Toolkit and can be used together with the ReadBackedPhasing module to infer the parental origin of DNMs based on phase-informative reads. Using simulated data, we show that PBT outperforms existing tools, especially in low coverage data and on the X chromosome. We further show that PBT displays high validation rates on empirical parent-offspring sequencing data for whole-exome data from 104 trios and X-chromosome data from 249 parent-offspring families. Finally, we demonstrate an association between father’s age at conception and the number of DNMs in female offspring’s X chromosome, consistent with previous literature reports

Oxford University Research Archive