55 research outputs found
Chromothripsis in healthy individuals affects multiple protein-coding genes and can result in severe congenital abnormalities in offspring
Chromothripsis represents an extreme class of complex chromosome rearrangements (CCRs) with major effects on chromosomal architecture. Although recent studies have associated chromothripsis with congenital abnormalities, the incidence and pathogenic effects of this phenomenon require further investigation. Here, we analyzed the genomes of three families in which chromothripsis rearrangements were transmitted from a mother to her child. The chromothripsis in the mothers resulted in completely balanced rearrangements involving 8-23 breakpoint junctions across three to five chromosomes. Two mothers did not show any phenotypic abnormalities, although 3-13 protein-coding genes were affected by breakpoints. Unbalanced but stable transmission of a subset of the derivative chromosomes caused apparently de novo complex copy-number changes in two children. This resulted in gene-dosage changes, which are probably responsible for the severe congenital phenotypes of these two children. In contrast, the third child, who has a severe congenital disease, harbored all three chromothripsis chromosomes from his healthy mother, but one of the chromosomes acquired de novo rearrangements leading to copy-number changes. These results show that the human genome can tolerate extreme reshuffling of chromosomal architecture, including breakage of multiple protein-coding genes, without noticeable phenotypic effects. The presence of chromothripsis in healthy individuals affects reproduction and is expected to substantially increase the risk of miscarriages, abortions, and severe congenital disease. © 2015 The American Society of Human Genetics
Partner independent fusion gene detection by multiplexed CRISPR-Cas9 enrichment and long read nanopore sequencing
Fusion genes are hallmarks of various cancer types and important determinants for diagnosis, prognosis and treatment. Fusion gene partner choice and breakpoint-position promiscuity restricts diagnostic detection, even for known and recurrent configurations. Here, we develop FUDGE (FUsion Detection from Gene Enrichment) to accurately and impartially identify fusions. FUDGE couples target-selected and strand-specific CRISPR-Cas9 activity for fusion gene driver enrichment - without prior knowledge of fusion partner or breakpoint-location - to long read nanopore sequencing with the bioinformatics pipeline NanoFG. FUDGE has flexible target-loci choices and enables multiplexed enrichment for simultaneous analysis of several genes in multiple samples in one sequencing run. We observe on-average 665 fold breakpoint-site enrichment and identify nucleotide resolution fusion breakpoints within 2 days. The assay identifies cancer cell line and tumor sample fusions irrespective of partner gene or breakpoint-position. FUDGE is a rapid and versatile fusion detection assay for diagnostic pan-cancer fusion detection
Probabilistic (logic) programming concepts
A multitude of different probabilistic programming languages exists today, all extending a traditional programming language with primitives to support modeling of complex, structured probability distributions. Each of these languages employs its own probabilistic primitives, and comes with a particular syntax, semantics and inference procedure. This makes it hard to understand the underlying programming concepts and appreciate the differences between the different languages. To obtain a better understanding of probabilistic programming, we identify a number of core programming concepts underlying the primitives used by various probabilistic languages, discuss the execution mechanisms that they require and use these to position and survey state-of-the-art probabilistic languages and their implementation. While doing so, we focus on probabilistic extensions of logic programming languages such as Prolog, which have been considered for over 20 years
Destabilized SMC5/6 complex leads to chromosome breakage syndrome with severe lung disease
The structural maintenance of chromosomes (SMC) family of proteins supports mitotic proliferation, meiosis, and DNA repair to control genomic stability. Impairments in chromosome maintenance are linked to rare chromosome breakage disorders. Here, we have identified a chromosome breakage syndrome associated with severe lung disease in early childhood. Four children from two unrelated kindreds died of severe pulmonary disease during infancy following viral pneumonia with evidence of combined T and B cell immunodeficiency. Whole exome sequencing revealed biallelic missense mutations in the NSMCE3 (also known as NDNL2) gene, which encodes a subunit of the SMC5/6 complex that is essential for DNA damage response and chromosome segregation. The NSMCE3 mutations disrupted interactions within the SMC5/6 complex, leading to destabilization of the complex. Patient cells showed chromosome rearrangements, micronuclei, sensitivity to replication stress and DNA damage, and defective homologous recombination. This work associates missense mutations in NSMCE3 with an autosomal recessive chromosome breakage syndrome that leads to defective T and B cell function and acute respiratory distress syndrome in early childhood
WGS-based telomere length analysis in Dutch family trios implicates stronger maternal inheritance and a role for RRM1 gene
Telomere length (TL) regulation is an important factor in ageing, reproduction and cancer development. Genetic, hereditary and environmental factors regulating TL are currently widely investigated, however, their relative contribution to TL variability is still understudied. We have used whole genome sequencing data of 250 family trios from the Genome of the Netherlands project to perform computational measurement of TL and a series of regression and genome-wide association analyses to reveal TL inheritance patterns and associated genetic factors. Our results confirm that TL is a largely heritable trait, primarily with mother’s, and, to a lesser extent, with father’s TL having the strongest influence on the offspring. In this cohort, mother’s, but not father’s age at conception was positively linked to offspring TL. Age-related TL attrition of 40 bp/year had relatively small influence on TL variability. Finally, we have identified TL-associated variations in ribonuclease reductase catalytic subunit M1 (RRM1 gene), which is known to regulate telomere maintenance in yeast. We also highlight the importance of multivariate approach and the limitations of existing tools for the analysis of TL as a polygenic heritable quantitative trait
A high-quality human reference panel reveals the complexity and distribution of genomic structural variants
Structural variation (SV) represents a major source of differences between individual human genomes and has been linked to disease phenotypes. However, the majority of studies provide neither a global view of the full spectrum of these variants nor integrate them into reference panels of genetic variation. Here, we analyse whole genome sequencing data of 769 individuals from 250 Dutch families, and provide a haplotype-resolved map of 1.9 million genome variants across 9 different variant classes, including novel forms of complex indels, and retrotransposition-mediated insertions of mobile elements and processed RNAs. A large proportion are previously under reported variants sized between 21 and 100 bp. We detect 4 megabases of novel sequence, encoding 11 new transcripts. Finally, we show 191 known, trait-associated SNPs to be in strong linkage disequilibrium with SVs and demonstrate that our panel facilitates accurate imputation of SVs in unrelated individuals
A high-quality human reference panel reveals the complexity and distribution of genomic structural variants
Structural variation (SV) represents a major source of differences between individual human genomes and has been linked to disease phenotypes. However, the majority of studies provide neither a global view of the full spectrum of these variants nor integrate them into reference panels of genetic variation. Here, we analyse whole genome sequencing data of 769 individuals from 250 Dutch families, and provide a haplotype-resolved map of 1.9 million genome variants across 9 different variant classes, including novel forms of complex indels, and retrotransposition-mediated insertions of mobile elements and processed RNAs. A large proportion are previously under reported variants sized between 21 and 100 bp. We detect 4 megabases of novel sequence, encoding 11 new transcripts. Finally, we show 191 known, trait-associated SNPs to be in strong linkage disequilibrium with SVs and demonstrate that our panel facilitates accurate imputation of SVs in unrelated individuals
The genomic landscape of balanced cytogenetic abnormalities associated with human congenital anomalies
Despite the clinical significance of balanced chromosomal abnormalities (BCAs), their characterization has largely been restricted to cytogenetic resolution. We explored the landscape of BCAs at nucleotide resolution in 273 subjects with a spectrum of congenital anomalies. Whole-genome sequencing revised 93% of karyotypes and demonstrated complexity that was cryptic to karyotyping in 21% of BCAs, highlighting the limitations of conventional cytogenetic approaches. At least 33.9% of BCAs resulted in gene disruption that likely contributed to the developmental phenotype, 5.2% were associated with pathogenic genomic imbalances, and 7.3% disrupted topologically associated domains (TADs) encompassing known syndromic loci. Remarkably, BCA breakpoints in eight subjects altered a single TAD encompassing MEF2C, a known driver of 5q14.3 microdeletion syndrome, resulting in decreased MEF2C expression. We propose that sequence-level resolution dramatically improves prediction of clinical outcomes for balanced rearrangements and provides insight into new pathogenic mechanisms, such as altered regulation due to changes in chromosome topology
- …