40 research outputs found

    Characterization of structural chromosomal variants by massively parallel sequencing

    Get PDF
    Chromosomal Structural Variation (SV) such as translocations, inversions, deletions, and duplications are rearrangements of one or several DNA molecules. SVs are widespread across the human genome, and each individual carries thousands of SVs of different types and sizes. SV are known to contribute both to phenotypic diversity and disease traits, and are therefore of interest in multiple fields, including rare diseases research, and clinical diagnostics. Herein, we present five studies, focused on the analysis of SV using whole genome sequencing (WGS). The project has increased our knowledge regarding the frequency, structure and mechanisms of formation of structural variants in the human genome. In Paper I, II, and IV, we develop and evaluate software for detection and analysis of SV using WGS data. In Paper II, III and IV, we utilize WGS data to delineate the structure and determine the mechanism of formation of several complex SVs. In Paper II, we compare multiple sequencing technologies, and apply these technologies to solve the structure of three complex chromosomal rearrangements. Lastly, in Paper V, we validate the use of SV calling from WGS as a routine test in rare disease diagnostics. Through these studies, we developed and tested tools suitable for WGS SV analysis in a clinical setting. These tools are now part of the routine clinical pipeline; and many of the tools are used by researchers and clinics around the world

    Mosaic Deletions of Known Genes Explain Skeletal Dysplasias With High and Low Bone Mass

    Get PDF
    Mosaicism, a state in which an individual has two or more genetically distinct populations of cells in the body, can be difficult to detect because of either mild or atypical clinical presentation and limitations in the commonly used detection methods. Knowledge of the role of mosaicism is limited in many skeletal disorders, including osteopathia striata with cranial sclerosis (OSCS) and cleidocranial dysplasia (CCD). We used whole-genome sequencing (WGS) with coverage >40x to identify the genetic causes of disease in two clinically diagnosed patients. In a female patient with OSCS, we identified a mosaic 7-nucleotide frameshift deletion in exon 2 of AMER1, NM_152424.4:c.855_861del:p.(His285Glnfs*7), affecting 8.3% of the WGS reads. In a male patient with CCD, approximately 34% of the WGS reads harbored a 3710-basepair mosaic deletion, NC_000006.11:g.45514471_45518181del, starting in intron 8 of RUNX2 and terminating in the 3 ' untranslated region. Droplet digital polymerase chain reaction was used to validate these deletions and quantify the absolute level of mosaicism in each patient. Although constitutional variants in AMER1 and RUNX2 are a known cause of OSCS and CCD, respectively, the mosaic changes here reported have not been described previously. Our study indicates that mosaicism should be considered in unsolved cases of skeletal dysplasia and should be investigated with comprehensive and sensitive detection methods. (c) 2022 The Authors. JBMR Plus published by Wiley Periodicals LLC on behalf of American Society for Bone and Mineral Research.Peer reviewe

    Rare variants in dynein heavy chain genes in two individuals with situs inversus and developmental dyslexia : a case report

    Get PDF
    Background Developmental dyslexia (DD) is a neurodevelopmental learning disorder with high heritability. A number of candidate susceptibility genes have been identified, some of which are linked to the function of the cilium, an organelle regulating left-right asymmetry development in the embryo. Furthermore, it has been suggested that disrupted left-right asymmetry of the brain may play a role in neurodevelopmental disorders such as DD. However, it is unknown whether there is a common genetic cause to DD and laterality defects or ciliopathies. Case presentation Here, we studied two individuals with co-occurring situs inversus (SI) and DD using whole genome sequencing to identify genetic variants of importance for DD and SI. Individual 1 had primary ciliary dyskinesia (PCD), a rare, autosomal recessive disorder with oto-sino-pulmonary phenotype and SI. We identified two rare nonsynonymous variants in the dynein axonemal heavy chain 5 gene (DNAH5): a previously reported variant c.7502G > C; p.(R2501P), and a novel variant c.12043 T > G; p.(Y4015D). Both variants are predicted to be damaging. Ultrastructural analysis of the cilia revealed a lack of outer dynein arms and normal inner dynein arms. MRI of the brain revealed no significant abnormalities. Individual 2 had non-syndromic SI and DD. In individual 2, one rare variant (c.9110A > G;p.(H3037R)) in the dynein axonemal heavy chain 11 gene (DNAH11), coding for another component of the outer dynein arm, was identified. Conclusions We identified the likely genetic cause of SI and PCD in one individual, and a possibly significant heterozygosity in the other, both involving dynein genes. Given the present evidence, it is unclear if the identified variants also predispose to DD and further studies into the association between laterality, ciliopathies and DD are needed.Peer reviewe

    Feasibility to use whole-genome sequencing as a sole diagnostic method to detect genomic aberrations in pediatric B-cell acute lymphoblastic leukemia

    Get PDF
    IntroductionThe suitability of whole-genome sequencing (WGS) as the sole method to detect clinically relevant genomic aberrations in B-cell acute lymphoblastic leukemia (ALL) was investigated with the aim of replacing current diagnostic methods.MethodsFor this purpose, we assessed the analytical performance of 150 bp paired-end WGS (90x leukemia/30x germline). A set of 88 retrospective B-cell ALL samples were selected to represent established ALL subgroups as well as ALL lacking stratifying markers by standard-of-care (SoC), so-called B-other ALL.ResultsBoth the analysis of paired leukemia/germline (L/N)(n=64) as well as leukemia-only (L-only)(n=88) detected all types of aberrations mandatory in the current ALLTogether trial protocol, i.e., aneuploidies, structural variants, and focal copy-number aberrations. Moreover, comparison to SoC revealed 100% concordance and that all patients had been assigned to the correct genetic subgroup using both approaches. Notably, WGS could allocate 35 out of 39 B-other ALL samples to one of the emerging genetic subgroups considered in the most recent classifications of ALL. We further investigated the impact of high (90x; n=58) vs low (30x; n=30) coverage on the diagnostic yield and observed an equally perfect concordance with SoC; low coverage detected all relevant lesions.DiscussionThe filtration of the WGS findings with a short list of genes recurrently rearranged in ALL was instrumental to extract the clinically relevant information efficiently. Nonetheless, the detection of DUX4 rearrangements required an additional customized analysis, due to multiple copies of this gene embedded in the highly repetitive D4Z4 region. We conclude that the diagnostic performance of WGS as the standalone method was remarkable and allowed detection of all clinically relevant genomic events in the diagnostic setting of B-cell ALL

    Discovery of Novel Sequences in 1,000 Swedish Genomes

    No full text
    Novel sequences (NSs), not present in the human reference genome, are abundant and remain largely unexplored. Here, we utilize de novo assembly to study NS in 1,000 Swedish individuals first sequenced as part of the SweGen project revealing a total of 46 Mb in 61,044 distinct contigs of sequences not present in GRCh38. The contigs were aligned to recently published catalogs of Icelandic and Pan-African NSs, as well as the chimpanzee genome, revealing a great diversity of shared sequences. Analyzing the positioning of NS across the chimpanzee genome, we find that 2,807 NS align confidently within 143 chimpanzee orthologs of human genes. Aligning the whole genome sequencing data to the chimpanzee genome, we discover ancestral NS common throughout the Swedish population. The NSs were searched for repeats and repeat elements: revealing a majority of repetitive sequence (56%), and enrichment of simple repeats (28%) and satellites (15%). Lastly, we align the unmappable reads of a subset of the thousand genomes data to our collection of NS, as well as the previously published Pan-African NS: revealing that both the Swedish and Pan-African NS are widespread, and that the Swedish NSs are largely a subset of the Pan-African NS. Overall, these results highlight the importance of creating a more diverse reference genome and illustrate that significant amounts of the NS may be of ancestral origin

    Hybrid sequencing resolves two germline ultra-complex chromosomal rearrangements consisting of 137 breakpoint junctions in a single carrier

    No full text
    Chromoanagenesis is a genomic event responsible for the formation of complex structural chromosomal rearrangements (CCRs). Germline chromoanagenesis is rare and the majority of reported cases are associated with an affected phenotype. Here, we report a healthy female carrying two de novo CCRs involving chromosomes 4, 19, 21 and X and chromosomes 7 and 11, respectively, with a total of 137 breakpoint junctions (BPJs). We characterized the CCRs using a hybrid-sequencing approach, combining short-read sequencing, nanopore sequencing, and optical mapping. The results were validated using multiple cytogenetic methods, including fluorescence in situ hybridization, spectral karyotyping, and Sanger sequencing. We identified 137 BPJs, which to our knowledge is the highest number of reported breakpoint junctions in germline chromoanagenesis. We also performed a statistical assessment of the positioning of the breakpoints, revealing a significant enrichment of BPJ-affecting genes (96 intragenic BPJs, 26 genes, p < 0.0001), indicating that the CCRs formed during active transcription of these genes. In addition, we find that the DNA fragments are unevenly and non-randomly distributed across the derivative chromosomes indicating a multistep process of scattering and re-joining of DNA fragments. In summary, we report a new maximum number of BPJs (137) in germline chromoanagenesis. We also show that a hybrid sequencing approach is necessary for the correct characterization of complex CCRs. Through in-depth statistical assessment, it was found that the CCRs most likely was formed through an event resembling chromoplexy-a catastrophic event caused by erroneous transcription factor binding

    AMYCNE: Confident copy number assessment using whole genome sequencing data

    No full text
    <div><p>Copy number variations (CNVs) within the human genome have been linked to a diversity of inherited diseases and phenotypic traits. The currently used methodology to measure copy numbers has limited resolution and/or precision, especially for regions with more than 4 copies. Whole genome sequencing (WGS) offers an alternative data source to allow for the detection and characterization of the copy number across different genomic regions in a single experiment. A plethora of tools have been developed to utilize WGS data for CNV detection. None of these tools are designed specifically to accurately estimate copy numbers of complex regions in a small cohort or clinical setting. Herein, we present AMYCNE (automatic modeling functionality for copy number estimation), a CNV analysis tool using WGS data. AMYCNE is multifunctional and performs copy number estimation of complex regions, annotation of VCF files, and CNV detection on individual samples. The performance of AMYCNE was evaluated using <i>AMY1A</i> ddPCR measurements from 86 unrelated individuals. In addition, we validated the accuracy of AMYCNE copy number predictions on two additional genes (<i>FCGR3A</i> and <i>FCGR3B)</i> using datasets available through the 1000 genomes consortium. Finally, we simulated levels of mosaic loss and gain of chromosome X and used this dataset for benchmarking AMYCNE. The results show a high concordance between AMYCNE and ddPCR, validating the use of AMYCNE to measure tandem <i>AMY1</i> repeats with high accuracy. This opens up new possibilities for the use of WGS for accurate copy number determination of other complex regions in the genome in small cohorts or single individuals.</p></div

    Copy number distributions of the AMY genes.

    No full text
    <p>Histograms of the copy number distributions of the three genes in the AMY locus: (a) <i>AMY1</i>, (b) <i>AMY2A</i> and (c) <i>AMY2B</i>.</p

    Transposable element insertions in 1000 Swedish individuals

    No full text
    The majority of rare diseases are genetic, and regardless of advanced high-throughput genomics-based investigations, 60% of patients remain undiagnosed. A major factor limiting our ability to identify disease-causing alterations is a poor understanding of the morbid and normal human genome. A major genomic contributor of which function and distribution remain largely unstudied are the transposable elements (TE), which constitute 50% of our genome. Here we aim to resolve this knowledge gap and increase the diagnostic yield of rare disease patients investigated with clinical genome sequencing. To this end we characterized TE insertions in 1000 Swedish individuals from the SweGen dataset and 2504 individuals from the 1000 Genomes Project (1KGP), creating seven population-specific TE insertion databases. Of note, 66% of TE insertions in SweGen were present at &gt; 1% in the 1KGP databases, proving that most insertions are common across populations. Focusing on the rare TE insertions, we show that even though similar to 0.7% of those insertions affect protein coding genes, they rarely affect known disease casing genes (&lt; 0.1%). Finally, we applied a TE insertion identification workflow on two clinical cases where disease causing TE insertions were suspected and could verify the presence of pathogenic TE insertions in both. Altogether we demonstrate the importance of TE insertion detection and highlight possible clinical implications in rare disease diagnostics
    corecore