177 research outputs found

    A gene-to-patient approach uplifts novel disease gene discovery and identifies 18 putative novel disease genes

    Get PDF
    PURPOSE: Exome and genome sequencing have drastically accelerated novel disease gene discoveries. However, discovery is still hindered by myriad variants of uncertain significance found in genes of undetermined biological function. This necessitates intensive functional experiments on genes of equal predicted causality, leading to a major bottleneck. METHODS: We apply the loss-of-function observed/expected upper-bound fraction metric of intolerance to gene inactivation to curate a list of predicted haploinsufficient disease genes. Using data from the 100,000 Genomes Project, we adopt a gene-to-patient approach that matches de novo loss-of-function variants in constrained genes to patients with rare disease. Through large-scale aggregation of data, we reduce excess analytical noise currently hindering novel discoveries. RESULTS: Results from 13,949 trios revealed 643 rare, de novo predicted loss-of-function events filtered from 1044 loss-of-function observed/expected upper-bound fraction-constrained genes. A total of 168 variants occurred within 126 genes without a known disease-gene relationship. Of these, 27 genes had >1 kindred affected, and for 18 of these genes, multiple kindreds had overlapping phenotypes. Two years after initial analysis, 11 of 18 (61%) of these genes have been independently published as novel disease gene discoveries. CONCLUSION: Using large cohorts and adopting gene-based approaches can rapidly and objectively accelerate dominantly inherited novel gene discovery by targeting the most appropriate genes for functional validation

    KidneyNetwork: using kidney-derived gene expression data to predict and prioritize novel genes involved in kidney disease

    Get PDF
    Abstract: Genetic testing in patients with suspected hereditary kidney disease may not reveal the genetic cause for the disorder as potentially pathogenic variants can reside in genes that are not yet known to be involved in kidney disease. We have developed KidneyNetwork, that utilizes tissue-specific expression to inform candidate gene prioritization specifically for kidney diseases. KidneyNetwork is a novel method constructed by integrating a kidney RNA-sequencing co-expression network of 878 samples with a multi-tissue network of 31,499 samples. It uses expression patterns and established gene-phenotype associations to predict which genes could be related to what (disease) phenotypes in an unbiased manner. We applied KidneyNetwork to rare variants in exome sequencing data from 13 kidney disease patients without a genetic diagnosis to prioritize candidate genes. KidneyNetwork can accurately predict kidney-specific gene functions and (kidney disease) phenotypes for disease-associated genes. The intersection of prioritized genes with genes carrying rare variants in a patient with kidney and liver cysts identified ALG6 as plausible candidate gene. We strengthen this plausibility by identifying ALG6 variants in several cystic kidney and liver disease cases without alternative genetic explanation. We present KidneyNetwork, a publicly available kidney-specific co-expression network with optimized gene-phenotype predictions for kidney disease phenotypes. We designed an easy-to-use online interface that allows clinicians and researchers to use gene expression and co-regulation data and gene-phenotype connections to accelerate advances in hereditary kidney disease diagnosis and research. Translational statement: Genetic testing in patients with suspected hereditary kidney disease may not reveal the genetic cause for the patient’s disorder. Potentially pathogenic variants can reside in genes not yet known to be involved in kidney disease, making it difficult to interpret the relevance of these variants. This reveals a clear need for methods to predict the phenotypic consequences of genetic variation in an unbiased manner. Here we describe KidneyNetwork, a tool that utilizes tissue-specific expression to predict kidney-specific gene functions. Applying KidneyNetwork to a group of undiagnosed cases identified ALG6 as a candidate gene in cystic kidney and liver disease. In summary, KidneyNetwork can aid the interpretation of genetic variants and can therefore be of value in translational nephrogenetics and help improve the diagnostic yield in kidney disease patients

    Application of ensemble clustering and survival tree analysis for identifying prognostic clinicogenomic features in patients with colorectal cancer from the 100,000 Genomes Project

    Get PDF
    OBJECTIVE: The objective of this study was to employ ensemble clustering and tree-based risk model approaches to identify interactions between clinicogenomic features for colorectal cancer using the 100,000 Genomes Project. RESULTS: Among the 2211 patients with colorectal cancer (mean age of diagnosis: 67.7; 59.7% male), 16.3%, 36.3%, 39.0% and 8.4% had stage 1, 2, 3 and 4 cancers, respectively. Almost every patient had surgery (99.7%), 47.4% had chemotherapy, 7.6% had radiotherapy and 1.4% had immunotherapy. On average, tumour mutational burden (TMB) was 18 mutations/Mb and 34.4%, 31.3% and 25.7% of patients had structural or copy number mutations in KRAS, BRAF and NRAS, respectively. In the fully adjusted Cox model, patients with advanced cancer [stage 3 hazard ratio (HR)  =  3.2; p  <  0.001; stage 4 HR  =  10.2; p  <  0.001] and those who had immunotherapy (HR  =  1.8; p  <  0.04) or radiotherapy (HR  =  1.5; p  <  0.02) treatment had a higher risk of dying. The ensemble clustering approach generated four distinct clusters where patients in cluster 2 had the best survival outcomes (1-year: 98.7%; 2-year: 96.7%; 3-year: 93.0%) while patients in cluster 3 (1-year: 87.9; 2-year: 70.0%; 3-year: 53.1%) had the worst outcomes. Kaplan-Meier analysis and log rank test revealed that the clusters were separated into distinct prognostic groups (p  <  0.0001). Survival tree or recursive partitioning analyses were performed to further explore risk groups within each cluster. Among patients in cluster 2, for example, interactions between cancer stage, grade, radiotherapy, TMB, BRAF mutation status were identified. Patients with stage 4 cancer and TMB  ≥  1.6 mutations/Mb had 4 times higher risk of dying relative to the baseline hazard in that cluster

    Pathogenicity of missense variants affecting the collagen IV α5 carboxy non-collagenous domain in X-linked Alport syndrome.

    Get PDF
    X-linked Alport syndrome is a genetic kidney disease caused by pathogenic COL4A5 variants, but little is known of the consequences of missense variants affecting the NC1 domain of the corresponding collagen IV α5 chain. This study examined these variants in a normal (gnomAD) and other databases (LOVD, Clin Var and 100,000 Genomes Project) to determine their pathogenicity and clinical significance. Males with Cys substitutions in the collagen IV α5 NC1 domain reported in LOVD (n = 25) were examined for typical Alport features, including age at kidney failure. All NC1 variants in LOVD (n = 86) were then assessed for structural damage using an online computational tool, Missense3D. Variants in the ClinVar, gnomAD and 100,000 Genomes Project databases were also examined for structural effects. Predicted damage associated with NC1 substitutions was then correlated with the level of conservation of the affected residues. Cys substitutions in males were associated with the typical features of X-linked Alport syndrome, with a median age at kidney failure of 31 years. NC1 substitutions predicted to cause structural damage were overrepresented in LOVD (p < 0.001), and those affecting Cys residues or 'buried' Gly residues were more common than expected (both p < 0.001). Most NC1 substitutions in gnomAD (88%) were predicted to be structurally-neutral. Substitutions affecting conserved residues resulted in more structural damage than those affecting non-conserved residues (p < 0.001). Many pathogenic missense variants affecting the collagen IV α5 NC1 domain have their effect through molecular structural damage and 3D modelling is a useful tool in their assessment

    Risk-conferring HLA variants in an epilepsy cohort: benefits of multifaceted use of whole genome sequencing in clinical practice

    Get PDF
    BACKGROUND: Whole genome sequencing is increasingly used in healthcare, particularly for diagnostics. However, its clinically multifaceted potential for individually customised diagnostic and therapeutic care remains largely unexploited. We used existing whole genome sequencing data to screen for pharmacogenomic risk factors related to antiseizure medication-induced cutaneous adverse drug reactions (cADRs), such as human leucocyte antigen HLA-B*15:02, HLA-A*31:01 variants. METHODS: Genotyping results, generated from the Genomics England UK 100 000 Genomes Project primarily for identification of disease-causing variants, were used to additionally screen for relevant HLA variants and other pharmacogenomic variants. Medical records were retrospectively reviewed for clinical and cADR phenotypes for HLA variant carriers. Descriptive statistics and the χ2 test were used to analyse phenotype/genotype data for HLA carriers and compare frequencies of additional pharmacogenomic variants between HLA carriers with and without cADRs, respectively. RESULTS: 1043 people with epilepsy were included. Four HLA-B*15:02 and 86 HLA-A*31:01 carriers were identified. One out of the four identified HLA-B*15:02 carriers had suffered antiseizure medication-induced cADRs; the point prevalence of cADRs was 16.9% for HLA-A*31:01 carriers of European origin (n=46) and 14.4% for HLA-A*31:01 carriers irrespective of ancestry (n=83). CONCLUSIONS: Comprehensive utilisation of genetic data spreads beyond the search for causal variants alone and can be extended to additional clinical benefits such as identifying pharmacogenomic biomarkers, which can guide pharmacotherapy for genetically-susceptible individuals

    Spectrum of mutational signatures in T-cell lymphoma reveals a key role for UV radiation in cutaneous T-cell lymphoma

    Get PDF
    T-cell non-Hodgkin's lymphomas develop following transformation of tissue resident T-cells. We performed a meta-analysis of whole exome sequencing data from 403 patients with eight subtypes of T-cell non-Hodgkin's lymphoma to identify mutational signatures and associated recurrent gene mutations. Signature 1, indicative of age-related deamination, was prevalent across all T-cell lymphomas, reflecting the derivation of these malignancies from memory T-cells. Adult T-cell leukemia-lymphoma was specifically associated with signature 17, which was found to correlate with the IRF4 K59R mutation that is exclusive to Adult T-cell leukemia-lymphoma. Signature 7, implicating UV exposure was uniquely identified in cutaneous T-cell lymphoma (CTCL), contributing 52% of the mutational burden in mycosis fungoides and 23% in Sezary syndrome. Importantly this UV signature was observed in CD4 + T-cells isolated from the blood of Sezary syndrome patients suggesting extensive re-circulation of these T-cells through skin and blood. Analysis of non-Hodgkin's T-cell lymphoma cases submitted to the national 100,000 WGS project confirmed that signature 7 was only identified in CTCL strongly implicating UV radiation in the pathogenesis of cutaneous T-cell lymphoma

    Increased COVID-19 mortality rate in rare disease patients: a retrospective cohort study in participants of the Genomics England 100,000 Genomes project

    Get PDF
    BACKGROUND: Several common conditions have been widely recognised as risk factors for COVID-19 related death, but risks borne by people with rare diseases are largely unknown. Therefore, we aim to estimate the difference of risk for people with rare diseases comparing to the unaffected. METHOD: To estimate the correlation between rare diseases and COVID-19 related death, we performed a retrospective cohort study in Genomics England 100k Genomes participants, who tested positive for Sars-Cov-2 during the first wave (16-03-2020 until 31-July-2020) of COVID-19 pandemic in the UK (n = 283). COVID-19 related mortality rates were calculated in two groups: rare disease patients (n = 158) and unaffected relatives (n = 125). Fisher's exact test and logistic regression was used for univariable and multivariable analysis, respectively. RESULTS: People with rare diseases had increased risk of COVID19-related deaths compared to the unaffected relatives (OR [95% CI] = 3.47 [1.21- 12.2]). Although, the effect was insignificant after adjusting for age and number of comorbidities (OR [95% CI] = 1.94 [0.65-5.80]). Neurology and neurodevelopmental diseases was significantly associated with COVID19-related death in both univariable (OR [95% CI] = 4.07 [1.61-10.38]) and multivariable analysis (OR [95% CI] = 4.22 [1.60-11.08]). CONCLUSIONS: Our results showed that rare disease patients, especially ones affected by neurology and neurodevelopmental disorders, in the Genomics England cohort had increased risk of COVID-19 related death during the first wave of the pandemic in UK. The high risk is likely associated with rare diseases themselves, while we cannot rule out possible mediators due to the small sample size. We would like to raise the awareness that rare disease patients may face increased risk for COVID-19 related death. Proper considerations for rare disease patients should be taken when relevant policies (e.g., returning to workplace) are made

    Increased COVID-19 mortality rate in rare disease patients:a retrospective cohort study in participants of the Genomics England 100,000 Genomes project

    Get PDF
    BACKGROUND: Several common conditions have been widely recognised as risk factors for COVID-19 related death, but risks borne by people with rare diseases are largely unknown. Therefore, we aim to estimate the difference of risk for people with rare diseases comparing to the unaffected. METHOD: To estimate the correlation between rare diseases and COVID-19 related death, we performed a retrospective cohort study in Genomics England 100k Genomes participants, who tested positive for Sars-Cov-2 during the first wave (16-03-2020 until 31-July-2020) of COVID-19 pandemic in the UK (n = 283). COVID-19 related mortality rates were calculated in two groups: rare disease patients (n = 158) and unaffected relatives (n = 125). Fisher's exact test and logistic regression was used for univariable and multivariable analysis, respectively. RESULTS: People with rare diseases had increased risk of COVID19-related deaths compared to the unaffected relatives (OR [95% CI] = 3.47 [1.21- 12.2]). Although, the effect was insignificant after adjusting for age and number of comorbidities (OR [95% CI] = 1.94 [0.65-5.80]). Neurology and neurodevelopmental diseases was significantly associated with COVID19-related death in both univariable (OR [95% CI] = 4.07 [1.61-10.38]) and multivariable analysis (OR [95% CI] = 4.22 [1.60-11.08]). CONCLUSIONS: Our results showed that rare disease patients, especially ones affected by neurology and neurodevelopmental disorders, in the Genomics England cohort had increased risk of COVID-19 related death during the first wave of the pandemic in UK. The high risk is likely associated with rare diseases themselves, while we cannot rule out possible mediators due to the small sample size. We would like to raise the awareness that rare disease patients may face increased risk for COVID-19 related death. Proper considerations for rare disease patients should be taken when relevant policies (e.g., returning to workplace) are made

    Late diagnoses of Dravet syndrome: How many individuals are we missing?

    Get PDF
    We report new genetic diagnoses of Dravet syndrome in a group of adults with complex epilepsy of unknown cause, under follow up at a tertiary epilepsy centre. Individuals with epilepsy and other features of unknown cause from our unit underwent whole genome sequencing through the 100,000 Genomes Project. Virtual gene panels were applied to frequency-filtered variants based on phenotype summary. Of 1078 individuals recruited, 8 (0.74%) were identified to have a pathogenic or likely pathogenic variant in SCN1A. Variant types were: nonsense (stopgain) in five (62.5%) and missense in three (37.5%). Detailed review of childhood history confirmed a phenotype compatible with Dravet syndrome. Median age at genetic diagnosis was 44.5 years (range 28-52 years). Tonic-clonic seizures were ongoing in all despite polytherapy including valproate. All had a history of fever sensitivity and myoclonic seizures, which were ongoing in two (25%) and three (37.5%) individuals, respectively. Salient features of Dravet syndrome may be less apparent in adulthood, making clinical diagnosis difficult. Regardless of age, benefits of a genetic diagnosis include access to syndrome-specific treatment options, avoidance of harmful drugs, and monitoring for common complications
    corecore