26 research outputs found
Profiling the Mismatch Tolerance of Argonaute 2 through Deep Sequencing of Sliced Polymorphic Viral RNAs
Low allelic and clonal variability among endogenous RNA interference (RNAi) targets has focused mismatch tolerance studies to RNAi-active guide strands. However, the inherent genomic instability of RNA viruses such as hepatitis C virus (HCV) gives rise to quasi-species mutants within discrete clones: this facilitates mismatch tolerance studies from a target perspective. We recently quantified the slicing imprecision of Argonaute 2 using small interfering RNA (siRNA) analogues of the DNA-directed RNAi drug TT-034 and next generation sequencing of 5’ RNA Ligase-Mediated Rapid Amplification of cDNA Ends (RACE-SEQ). Here, we present an open source, customizable, and computationally light RACE-SEQ bioinformatic pipeline, describing adaptations that semi-quantitatively report the impact of RNAi hybridisation site mismatches from the target perspective. The analysis shows Argonaute 2 has a substitution-specific, 3-5 log activity window between fully complementary targets and targets with mismatches across positions 10-11. It further focuses the endonucleotic Slicer imprecision around positions 13-17, demonstrating its dependence on guide strand central region complementarity, and potentiation by even a single mismatch. We further propose pharmacogenomics value in testing endogenous targets using recombinant replicon systems and RACE-SEQ to report the pharmacodynamics of sequence-specific oligonucleotide therapeutics against all possible polymorphisms in a population, in a minimally-biased, patient-free manner
Genetic constraint at single amino acid resolution in protein domains improves missense variant prioritisation and gene discovery
: Background: One of the major hurdles in clinical genetics is interpreting the clinical consequences associated with germline missense variants in humans. Recent significant advances have leveraged natural variation observed in large-scale human populations to uncover genes or genomic regions that show a depletion of natural variation, indicative of selection pressure. We refer to this as “genetic constraint”. Although existing genetic constraint metrics have been demonstrated to be successful in prioritising genes or genomic regions associated with diseases, their spatial resolution is limited in distinguishing pathogenic variants from benign variants within genes. Methods: We aim to identify missense variants that are significantly depleted in the general human population. Given the size of currently available human populations with exome or genome sequencing data, it is not possible to directly detect depletion of individual missense variants, since the average expected number of observations of a variant at most positions is less than one. We instead focus on protein domains, grouping homologous variants with similar functional impacts to examine the depletion of natural variations within these comparable sets. To accomplish this, we develop the Homologous Missense Constraint (HMC) score. We utilise the Genome Aggregation Database (gnomAD) 125 K exome sequencing data and evaluate genetic constraint at quasi amino-acid resolution by combining signals across protein homologues. Results: We identify one million possible missense variants under strong negative selection within protein domains. Though our approach annotates only protein domains, it nonetheless allows us to assess 22% of the exome confidently. It precisely distinguishes pathogenic variants from benign variants for both early-onset and adult-onset disorders. It outperforms existing constraint metrics and pathogenicity meta-predictors in prioritising de novo mutations from probands with developmental disorders (DD). It is also methodologically independent of these, adding power to predict variant pathogenicity when used in combination. We demonstrate utility for gene discovery by identifying seven genes newly significantly associated with DD that could act through an altered-function mechanism. Conclusions: Grouping variants of comparable functional impacts is effective in evaluating their genetic constraint. HMC is a novel and accurate predictor of missense consequence for improved variant interpretation
Phenotypic expression and outcomes in individuals with rare genetic variants of hypertrophic cardiomyopathy
BACKGROUND: Hypertrophic cardiomyopathy (HCM) is caused by rare variants in sarcomere-encoding genes, but little is known about the clinical significance of these variants in the general population. OBJECTIVES: The goal of this study was to compare lifetime outcomes and cardiovascular phenotypes according to the presence of rare variants in sarcomere-encoding genes among middle-aged adults. METHODS: This study analyzed whole exome sequencing and cardiac magnetic resonance imaging in UK Biobank participants stratified according to sarcomere-encoding variant status. RESULTS: The prevalence of rare variants (allele frequency <0.00004) in HCM-associated sarcomere-encoding genes in 200,584 participants was 2.9% (n = 5,712; 1 in 35), and the prevalence of variants pathogenic or likely pathogenic for HCM (SARC-HCM-P/LP) was 0.25% (n = 493; 1 in 407). SARC-HCM-P/LP variants were associated with an increased risk of death or major adverse cardiac events compared with controls (hazard ratio: 1.69; 95% confidence interval [CI]: 1.38-2.07; P < 0.001), mainly due to heart failure endpoints (hazard ratio: 4.23; 95% CI: 3.07-5.83; P < 0.001). In 21,322 participants with both cardiac magnetic resonance imaging and whole exome sequencing, SARC-HCM-P/LP variants were associated with an asymmetric increase in left ventricular maximum wall thickness (10.9 ± 2.7 mm vs 9.4 ± 1.6 mm; P < 0.001), but hypertrophy (≥13 mm) was only present in 18.4% (n = 9 of 49; 95% CI: 9%-32%). SARC-HCM-P/LP variants were still associated with heart failure after adjustment for wall thickness (hazard ratio: 6.74; 95% CI: 2.43-18.7; P < 0.001). CONCLUSIONS: In this population of middle-aged adults, SARC-HCM-P/LP variants have low aggregate penetrance for overt HCM but are associated with an increased risk of adverse cardiovascular outcomes and an attenuated cardiomyopathic phenotype. Although absolute event rates are low, identification of these variants may enhance risk stratification beyond familial disease
Beyond gene-disease validity: capturing structured data on inheritance, allelic requirement, disease-relevant variant classes, and disease mechanism for inherited cardiac conditions
Background:
As the availability of genomic testing grows, variant interpretation will increasingly be performed by genomic generalists, rather than domain-specific experts. Demand is rising for laboratories to accurately classify variants in inherited cardiac condition (ICC) genes, including secondary findings.
//
Methods:
We analyse evidence for inheritance patterns, allelic requirement, disease mechanism and disease-relevant variant classes for 65 ClinGen-curated ICC gene-disease pairs. We present this information for the first time in a structured dataset, CardiacG2P, and assess application in genomic variant filtering.
//
Results:
For 36/65 gene-disease pairs, loss of function is not an established disease mechanism, and protein truncating variants are not known to be pathogenic. Using the CardiacG2P dataset as an initial variant filter allows for efficient variant prioritisation whilst maintaining a high sensitivity for retaining pathogenic variants compared with two other variant filtering approaches.
//
Conclusions:
Access to evidence-based structured data representing disease mechanism and allelic requirement aids variant filtering and analysis and is a pre-requisite for scalable genomic testing
Beyond gene-disease validity: capturing structured data on inheritance, allelic-requirement, disease-relevant variant classes, and disease mechanism for inherited cardiac conditions
BACKGROUND: As availability of genomic testing grows, variant interpretation will increasingly be performed by genomic generalists, rather than domain-specific experts. Demand is rising for laboratories to accurately classify variants in inherited cardiac condition (ICC) genes, including as secondary findings. METHODS: We analyse evidence for inheritance patterns, allelic requirement, disease mechanism and disease-relevant variant classes for 65 ClinGen-curated ICC gene-disease pairs. We present this information for the first time in a structured dataset, CardiacG2P, and assess application in genomic variant filtering. RESULTS: For 36/65 gene-disease pairs, loss-of-function is not an established disease mechanism, and protein truncating variants are not known to be pathogenic. Using CardiacG2P as an initial variant filter allows for efficient variant prioritisation whilst maintaining a high sensitivity for retaining pathogenic variants compared with two other variant filtering approaches. CONCLUSIONS: Access to evidence-based structured data representing disease mechanism and allelic requirement aids variant filtering and analysis and is pre-requisite for scalable genomic testing
Beyond gene-disease validity: capturing structured data on inheritance, allelic requirement, disease-relevant variant classes, and disease mechanism for inherited cardiac conditions
BACKGROUND: As the availability of genomic testing grows, variant interpretation will increasingly be performed by genomic generalists, rather than domain-specific experts. Demand is rising for laboratories to accurately classify variants in inherited cardiac condition (ICC) genes, including secondary findings. METHODS: We analyse evidence for inheritance patterns, allelic requirement, disease mechanism and disease-relevant variant classes for 65 ClinGen-curated ICC gene-disease pairs. We present this information for the first time in a structured dataset, CardiacG2P, and assess application in genomic variant filtering. RESULTS: For 36/65 gene-disease pairs, loss of function is not an established disease mechanism, and protein truncating variants are not known to be pathogenic. Using the CardiacG2P dataset as an initial variant filter allows for efficient variant prioritisation whilst maintaining a high sensitivity for retaining pathogenic variants compared with two other variant filtering approaches. CONCLUSIONS: Access to evidence-based structured data representing disease mechanism and allelic requirement aids variant filtering and analysis and is a pre-requisite for scalable genomic testing
Recommended from our members
Disease-specific variant pathogenicity prediction significantly improves variant interpretation in inherited cardiac conditions
Funder: Science and Technology Development Fund; doi: https://doi.org/10.13039/Funder: Al-Alfi FoundationFunder: Magdi Yacoub Heart FoundationFunder: Rosetrees and Stoneygate Imperial College Research FellowshipFunder: National Health and Medical Research Council (Australia)Abstract: Purpose: Accurate discrimination of benign and pathogenic rare variation remains a priority for clinical genome interpretation. State-of-the-art machine learning variant prioritization tools are imprecise and ignore important parameters defining gene–disease relationships, e.g., distinct consequences of gain-of-function versus loss-of-function variants. We hypothesized that incorporating disease-specific information would improve tool performance. Methods: We developed a disease-specific variant classifier, CardioBoost, that estimates the probability of pathogenicity for rare missense variants in inherited cardiomyopathies and arrhythmias. We assessed CardioBoost’s ability to discriminate known pathogenic from benign variants, prioritize disease-associated variants, and stratify patient outcomes. Results: CardioBoost has high global discrimination accuracy (precision recall area under the curve [AUC] 0.91 for cardiomyopathies; 0.96 for arrhythmias), outperforming existing tools (4–24% improvement). CardioBoost obtains excellent accuracy (cardiomyopathies 90.2%; arrhythmias 91.9%) for variants classified with >90% confidence, and increases the proportion of variants classified with high confidence more than twofold compared with existing tools. Variants classified as disease-causing are associated with both disease status and clinical severity, including a 21% increased risk (95% confidence interval [CI] 11–29%) of severe adverse outcomes by age 60 in patients with hypertrophic cardiomyopathy. Conclusions: A disease-specific variant classifier outperforms state-of-the-art genome-wide tools for rare missense variants in inherited cardiac conditions (https://www.cardiodb.org/cardioboost/), highlighting broad opportunities for improved pathogenicity prediction through disease specificity
Disease-specific variant pathogenicity prediction significantly improves variant interpretation in inherited cardiac conditions
Funder: Science and Technology Development Fund; doi: https://doi.org/10.13039/Funder: Al-Alfi FoundationFunder: Magdi Yacoub Heart FoundationFunder: Rosetrees and Stoneygate Imperial College Research FellowshipFunder: National Health and Medical Research Council (Australia)Abstract: Purpose: Accurate discrimination of benign and pathogenic rare variation remains a priority for clinical genome interpretation. State-of-the-art machine learning variant prioritization tools are imprecise and ignore important parameters defining gene–disease relationships, e.g., distinct consequences of gain-of-function versus loss-of-function variants. We hypothesized that incorporating disease-specific information would improve tool performance. Methods: We developed a disease-specific variant classifier, CardioBoost, that estimates the probability of pathogenicity for rare missense variants in inherited cardiomyopathies and arrhythmias. We assessed CardioBoost’s ability to discriminate known pathogenic from benign variants, prioritize disease-associated variants, and stratify patient outcomes. Results: CardioBoost has high global discrimination accuracy (precision recall area under the curve [AUC] 0.91 for cardiomyopathies; 0.96 for arrhythmias), outperforming existing tools (4–24% improvement). CardioBoost obtains excellent accuracy (cardiomyopathies 90.2%; arrhythmias 91.9%) for variants classified with >90% confidence, and increases the proportion of variants classified with high confidence more than twofold compared with existing tools. Variants classified as disease-causing are associated with both disease status and clinical severity, including a 21% increased risk (95% confidence interval [CI] 11–29%) of severe adverse outcomes by age 60 in patients with hypertrophic cardiomyopathy. Conclusions: A disease-specific variant classifier outperforms state-of-the-art genome-wide tools for rare missense variants in inherited cardiac conditions (https://www.cardiodb.org/cardioboost/), highlighting broad opportunities for improved pathogenicity prediction through disease specificity
Phenotypic expression and outcomes in individuals with rare genetic variants of hypertrophic cardiomyopathy
Background: Hypertrophic cardiomyopathy (HCM) is caused by rare variants in sarcomereencoding genes, but little is known about the clinical significance of these variants in the general population. Objectives: To compare lifetime outcomes and cardiovascular phenotypes according to the presence of rare variants in sarcomere-encoding genes amongst middle-aged adults. Methods: We analysed whole exome sequencing and cardiac magnetic resonance (CMR) imaging in UK Biobank participants stratified by sarcomere-encoding variant status. Results: The prevalence of rare variants (allele frequency <0.00004) in HCM-associated sarcomere-encoding genes in 200,584 participants was 2.9% (n=5,712; 1 in 35), and the prevalence of variants pathogenic or likely pathogenic for HCM (SARC-HCM-P/LP) was 0.25% (n=493, 1 in 407). SARC-HCM-P/LP variants were associated with increased risk of death or major adverse cardiac events compared to controls (HR 1.69, 95% CI 1.38 to 2.07, p<0.001), mainly due to heart failure endpoints (HR 4.23, 95% CI 3.07 to 5.83, p<0.001). In 21,322 participants with CMR, SARC-HCM-P/LP were associated with asymmetric increase in left ventricular maximum wall thickness (10.9±2.7 vs 9.4±1.6 mm, p<0.001) but hypertrophy (≥13mm) was only present in 18.4% (n=9/49, 95% CI 9 to 32%). SARC-HCMP/LP were still associated with heart failure after adjustment for wall thickness (HR 6.74, 95% CI 2.43 to 18.7, p<0.001). Conclusions: In this population of middle-aged adults, SARC-HCM-P/LP variants have low aggregate penetrance for overt HCM but are associated with increased risk of adverse cardiovascular outcomes and an attenuated cardiomyopathic phenotype. Although absolute event rates are low, identification of these variants may enhance risk stratification beyond familial disease