81 research outputs found

    Protein structure and phenotypic analysis of pathogenic and population missense variants in STXBP1

    Get PDF
    Background: Syntaxin-binding protein 1, encoded by STXBP1, is highly expressed in the brain and involved in fusing synaptic vesicles with the plasma membrane. Studies have shown that pathogenic loss-of-function variants in this gene result in various types of epilepsies, mostly beginning early in life. We were interested to model pathogenic missense variants on the protein structure to investigate the mechanism of pathogenicity and genotype–phenotype correlations. Methods: We report 11 patients with pathogenic de novo mutations in STXBP1 identified in the first 4293 trios of the Deciphering Developmental Disorder (DDD) study, including six missense variants. We analyzed the structural locations of the pathogenic missense variants from this study and the literature, as well as population missense variants extracted from Exome Aggregation Consortium (ExAC). Results: Pathogenic variants are significantly more likely to occur at highly conserved locations than population variants, and be buried inside the protein domain. Pathogenic mutations are also more likely to destabilize the domain structure compared with population variants, increasing the proportion of (partially) unfolded domains that are prone to aggregation or degradation. We were unable to detect any genotype–phenotype correlation, but unlike previously reported cases, most of the DDD patients with STXBP1 pathogenic variants did not present with very early-onset or severe epilepsy and encephalopathy, though all have developmental delay with intellectual disability and most display behavioral problems and suffered seizures in later childhood. Conclusion: Variants across STXBP1 that cause loss of function can result in severe intellectual disability with or without seizures, consistent with a haploinsufficiency mechanism. Pathogenic missense mutations act through destabilization of the protein domain, making it prone to aggregation or degradation. The presence or absence of early seizures may reflect ascertainment bias in the literature as well as the broad recruitment strategy of the DDD study.The DDD study presents independent research commissioned by the Health Innovation Challenge Fund (grant number HICF-1009-003), a parallel funding partnership between the Wellcome Trust and the Department of Health, and the Wellcome Trust Sanger Institute (grant number WT098051)

    Protein structure and phenotypic analysis of pathogenic and population missense variants inSTXBP1.

    Get PDF
    This is the final version of the article. Available from Wiley via the DOI in this record.BACKGROUND: Syntaxin-binding protein 1, encoded bySTXBP1, is highly expressed in the brain and involved in fusing synaptic vesicles with the plasma membrane. Studies have shown that pathogenic loss-of-function variants in this gene result in various types of epilepsies, mostly beginning early in life. We were interested to model pathogenic missense variants on the protein structure to investigate the mechanism of pathogenicity and genotype-phenotype correlations. METHODS: We report 11 patients with pathogenic de novo mutations inSTXBP1identified in the first 4293 trios of the Deciphering Developmental Disorder (DDD) study, including six missense variants. We analyzed the structural locations of the pathogenic missense variants from this study and the literature, as well as population missense variants extracted from Exome Aggregation Consortium (ExAC). RESULTS: Pathogenic variants are significantly more likely to occur at highly conserved locations than population variants, and be buried inside the protein domain. Pathogenic mutations are also more likely to destabilize the domain structure compared with population variants, increasing the proportion of (partially) unfolded domains that are prone to aggregation or degradation. We were unable to detect any genotype-phenotype correlation, but unlike previously reported cases, most of the DDD patients withSTXBP1pathogenic variants did not present with very early-onset or severe epilepsy and encephalopathy, though all have developmental delay with intellectual disability and most display behavioral problems and suffered seizures in later childhood. CONCLUSION: Variants acrossSTXBP1that cause loss of function can result in severe intellectual disability with or without seizures, consistent with a haploinsufficiency mechanism. Pathogenic missense mutations act through destabilization of the protein domain, making it prone to aggregation or degradation. The presence or absence of early seizures may reflect ascertainment bias in the literature as well as the broad recruitment strategy of the DDD study.This study was supported by the Health Innovation Challenge Fund (grant number: HICF-1009-003) and Wellcome Trust Sanger Institute (grant number: WT098051)

    Combining a prioritization strategy and functional studies nominates 5’UTR variants underlying inherited retinal disease

    Get PDF
    BACKGROUND: 5’ untranslated regions (5’UTRs) are essential modulators of protein translation. Predicting the impact of 5’UTR variants is challenging and rarely performed in routine diagnostics. Here, we present a combined approach of a comprehensive prioritization strategy and functional assays to evaluate 5’UTR variation in two large cohorts of patients with inherited retinal diseases (IRDs). METHODS: We performed an isoform-level re-analysis of retinal RNA-seq data to identify the protein-coding transcripts of 378 IRD genes with highest expression in retina. We evaluated the coverage of their 5’UTRs by different whole exome sequencing (WES) kits. The selected 5’UTRs were analyzed in whole genome sequencing (WGS) and WES data from IRD sub-cohorts from the 100,000 Genomes Project (n = 2397 WGS) and an in-house database (n = 1682 WES), respectively. Identified variants were annotated for 5’UTR-relevant features and classified into seven categories based on their predicted functional consequence. We developed a variant prioritization strategy by integrating population frequency, specific criteria for each category, and family and phenotypic data. A selection of candidate variants underwent functional validation using diverse approaches. RESULTS: Isoform-level re-quantification of retinal gene expression revealed 76 IRD genes with a non-canonical retina-enriched isoform, of which 20 display a fully distinct 5’UTR compared to that of their canonical isoform. Depending on the probe design, 3–20% of IRD genes have 5’UTRs fully captured by WES. After analyzing these regions in both cohorts, we prioritized 11 (likely) pathogenic variants in 10 genes (ARL3, MERTK, NDP, NMNAT1, NPHP4, PAX6, PRPF31, PRPF4, RDH12, RD3), of which 7 were novel. Functional analyses further supported the pathogenicity of three variants. Mis-splicing was demonstrated for the PRPF31:c.-9+1G>T variant. The MERTK:c.-125G>A variant, overlapping a transcriptional start site, was shown to significantly reduce both luciferase mRNA levels and activity. The RDH12:c.-123C>T variant was found in cis with the hypomorphic RDH12:c.701G>A (p.Arg234His) variant in 11 patients. This 5’UTR variant, predicted to introduce an upstream open reading frame, was shown to result in reduced RDH12 protein but unaltered mRNA levels. CONCLUSIONS: This study demonstrates the importance of 5’UTR variants implicated in IRDs and provides a systematic approach for 5’UTR annotation and validation that is applicable to other inherited diseases

    Use of genome sequencing to hunt for cryptic second-hit variants: analysis of 31 cases recruited to the 100 000 Genomes Project

    Get PDF
    Background: Current clinical testing methods used to uncover the genetic basis of rare disease have inherent limitations, which can lead to causative pathogenic variants being missed. Within the rare disease arm of the 100 000 Genomes Project (100kGP), families were recruited under the clinical indication ‘single autosomal recessive mutation in rare disease’. These participants presented with strong clinical suspicion for a specific autosomal recessive disorder, but only one suspected pathogenic variant had been identified through standard-of-care testing. Whole genome sequencing (WGS) aimed to identify cryptic ‘second-hit’ variants. Methods: To investigate the 31 families with available data that remained unsolved following formal review within the 100kGP, SVRare was used to aggregate structural variants present in <1% of 100kGP participants. Small variants were assessed using population allele frequency data and SpliceAI. Literature searches and publicly available online tools were used for further annotation of pathogenicity. Results: Using these strategies, 8/31 cases were solved, increasing the overall diagnostic yield of this cohort from 10/41 (24.4%) to 18/41 (43.9%). Exemplar cases include a patient with cystic fibrosis harbouring a novel exonic LINE1 insertion in CFTR and a patient with generalised arterial calcification of infancy with complex interlinked duplications involving exons 2–6 of ENPP1. Although ambiguous by short-read WGS, the ENPP1 variant structure was resolved using optical genome mapping and RNA analysis. Conclusion: Systematic examination of cryptic variants across a multi-disease cohort successfully identifies additional pathogenic variants. WGS data analysis in autosomal recessive rare disease should consider complex structural and small intronic variants as potentially pathogenic second hits

    Targeted Next-Generation Sequencing Analysis of 1,000 Individuals with Intellectual Disability.

    Get PDF
    To identify genetic causes of intellectual disability (ID), we screened a cohort of 986 individuals with moderate to severe ID for variants in 565 known or candidate ID-associated genes using targeted next-generation sequencing. Likely pathogenic rare variants were found in ∼11% of the cases (113 variants in 107/986 individuals: ∼8% of the individuals had a likely pathogenic loss-of-function [LoF] variant, whereas ∼3% had a known pathogenic missense variant). Variants in SETD5, ATRX, CUL4B, MECP2, and ARID1B were the most common causes of ID. This study assessed the value of sequencing a cohort of probands to provide a molecular diagnosis of ID, without the availability of DNA from both parents for de novo sequence analysis. This modeling is clinically relevant as 28% of all UK families with dependent children are single parent households. In conclusion, to diagnose patients with ID in the absence of parental DNA, we recommend investigation of all LoF variants in known genes that cause ID and assessment of a limited list of proven pathogenic missense variants in these genes. This will provide 11% additional diagnostic yield beyond the 10%-15% yield from array CGH alone.Action Medical Research (SP4640); the Birth Defect Foundation (RG45448); the Cambridge National Institute for Health Research Biomedical Research Centre (RG64219); the NIHR Rare Diseases BioResource (RBAG163); Wellcome Trust award WT091310; The Cell lines and DNA bank of Rett Syndrome, X-linked mental retardation and other genetic diseases (member of the Telethon Network of Genetic Biobanks (project no. GTB12001); the Genetic Origins of Congenital Heart Disease Study (GO-CHD)- funded by British Heart Foundation (BHF)This is the final version of the article. It first appeared from Wiley via http://dx.doi.org/10.1002/humu.2290

    Finding Diagnostically Useful Patterns in Quantitative Phenotypic Data.

    Get PDF
    Trio-based whole-exome sequence (WES) data have established confident genetic diagnoses in ∼40% of previously undiagnosed individuals recruited to the Deciphering Developmental Disorders (DDD) study. Here we aim to use the breadth of phenotypic information recorded in DDD to augment diagnosis and disease variant discovery in probands. Median Euclidean distances (mEuD) were employed as a simple measure of similarity of quantitative phenotypic data within sets of ≥10 individuals with plausibly causative de novo mutations (DNM) in 28 different developmental disorder genes. 13/28 (46.4%) showed significant similarity for growth or developmental milestone metrics, 10/28 (35.7%) showed similarity in HPO term usage, and 12/28 (43%) showed no phenotypic similarity. Pairwise comparisons of individuals with high-impact inherited variants to the 32 individuals with causative DNM in ANKRD11 using only growth z-scores highlighted 5 likely causative inherited variants and two unrecognized DNM resulting in an 18% diagnostic uplift for this gene. Using an independent approach, naive Bayes classification of growth and developmental data produced reasonably discriminative models for the 24 DNM genes with sufficiently complete data. An unsupervised naive Bayes classification of 6,993 probands with WES data and sufficient phenotypic information defined 23 in silico syndromes (ISSs) and was used to test a "phenotype first" approach to the discovery of causative genotypes using WES variants strictly filtered on allele frequency, mutation consequence, and evidence of constraint in humans. This highlighted heterozygous de novo nonsynonymous variants in SPTBN2 as causative in three DDD probands

    100,000 Genomes Pilot on Rare-Disease Diagnosis in Health Care — Preliminary Report

    Get PDF
    BACKGROUND: The U.K. 100,000 Genomes Project is in the process of investigating the role of genome sequencing in patients with undiagnosed rare diseases after usual care and the alignment of this research with health care implementation in the U.K. National Health Service. Other parts of this project focus on patients with cancer and infection. METHODS: We conducted a pilot study involving 4660 participants from 2183 families, among whom 161 disorders covering a broad spectrum of rare diseases were present. We collected data on clinical features with the use of Human Phenotype Ontology terms, undertook genome sequencing, applied automated variant prioritization on the basis of applied virtual gene panels and phenotypes, and identified novel pathogenic variants through research analysis. RESULTS: Diagnostic yields varied among family structures and were highest in family trios (both parents and a proband) and families with larger pedigrees. Diagnostic yields were much higher for disorders likely to have a monogenic cause (35%) than for disorders likely to have a complex cause (11%). Diagnostic yields for intellectual disability, hearing disorders, and vision disorders ranged from 40 to 55%. We made genetic diagnoses in 25% of the probands. A total of 14% of the diagnoses were made by means of the combination of research and automated approaches, which was critical for cases in which we found etiologic noncoding, structural, and mitochondrial genome variants and coding variants poorly covered by exome sequencing. Cohortwide burden testing across 57,000 genomes enabled the discovery of three new disease genes and 19 new associations. Of the genetic diagnoses that we made, 25% had immediate ramifications for clinical decision making for the patients or their relatives. CONCLUSIONS: Our pilot study of genome sequencing in a national health care system showed an increase in diagnostic yield across a range of rare diseases. (Funded by the National Institute for Health Research and others.)

    De Novo Truncating Mutations in WASF1 Cause Intellectual Disability with Seizures.

    Get PDF
    Next-generation sequencing has been invaluable in the elucidation of the genetic etiology of many subtypes of intellectual disability in recent years. Here, using exome sequencing and whole-genome sequencing, we identified three de novo truncating mutations in WAS protein family member 1 (WASF1) in five unrelated individuals with moderate to profound intellectual disability with autistic features and seizures. WASF1, also known as WAVE1, is part of the WAVE complex and acts as a mediator between Rac-GTPase and actin to induce actin polymerization. The three mutations connected by Matchmaker Exchange were c.1516C>T (p.Arg506Ter), which occurs in three unrelated individuals, c.1558C>T (p.Gln520Ter), and c.1482delinsGCCAGG (p.Ile494MetfsTer23). All three variants are predicted to partially or fully disrupt the C-terminal actin-binding WCA domain. Functional studies using fibroblast cells from two affected individuals with the c.1516C>T mutation showed a truncated WASF1 and a defect in actin remodeling. This study provides evidence that de novo heterozygous mutations in WASF1 cause a rare form of intellectual disability

    Phenotypic Characterization of EIF2AK4 Mutation Carriers in a Large Cohort of Patients Diagnosed Clinically With Pulmonary Arterial Hypertension.

    Get PDF
    BACKGROUND: Pulmonary arterial hypertension (PAH) is a rare disease with an emerging genetic basis. Heterozygous mutations in the gene encoding the bone morphogenetic protein receptor type 2 (BMPR2) are the commonest genetic cause of PAH, whereas biallelic mutations in the eukaryotic translation initiation factor 2 alpha kinase 4 gene (EIF2AK4) are described in pulmonary veno-occlusive disease/pulmonary capillary hemangiomatosis. Here, we determine the frequency of these mutations and define the genotype-phenotype characteristics in a large cohort of patients diagnosed clinically with PAH. METHODS: Whole-genome sequencing was performed on DNA from patients with idiopathic and heritable PAH and with pulmonary veno-occlusive disease/pulmonary capillary hemangiomatosis recruited to the National Institute of Health Research BioResource-Rare Diseases study. Heterozygous variants in BMPR2 and biallelic EIF2AK4 variants with a minor allele frequency of <1:10 000 in control data sets and predicted to be deleterious (by combined annotation-dependent depletion, PolyPhen-2, and sorting intolerant from tolerant predictions) were identified as potentially causal. Phenotype data from the time of diagnosis were also captured. RESULTS: Eight hundred sixty-four patients with idiopathic or heritable PAH and 16 with pulmonary veno-occlusive disease/pulmonary capillary hemangiomatosis were recruited. Mutations in BMPR2 were identified in 130 patients (14.8%). Biallelic mutations in EIF2AK4 were identified in 5 patients with a clinical diagnosis of pulmonary veno-occlusive disease/pulmonary capillary hemangiomatosis. Furthermore, 9 patients with a clinical diagnosis of PAH carried biallelic EIF2AK4 mutations. These patients had a reduced transfer coefficient for carbon monoxide (Kco; 33% [interquartile range, 30%-35%] predicted) and younger age at diagnosis (29 years; interquartile range, 23-38 years) and more interlobular septal thickening and mediastinal lymphadenopathy on computed tomography of the chest compared with patients with PAH without EIF2AK4 mutations. However, radiological assessment alone could not accurately identify biallelic EIF2AK4 mutation carriers. Patients with PAH with biallelic EIF2AK4 mutations had a shorter survival. CONCLUSIONS: Biallelic EIF2AK4 mutations are found in patients classified clinically as having idiopathic and heritable PAH. These patients cannot be identified reliably by computed tomography, but a low Kco and a young age at diagnosis suggests the underlying molecular diagnosis. Genetic testing can identify these misclassified patients, allowing appropriate management and early referral for lung transplantation
    • …
    corecore