103 research outputs found

    Evaluation of in silico pathogenicity prediction tools for the classification of small in-frame indels

    Get PDF
    This is the final version. Available from BMC via the DOI in this record.All variants in the final dataset are included in Additional file 2. The publicly available variant datasets are available from gnomAD [22] or ClinVar [10]. Genomic datasets from the DDD Study are available under managed access for research into developmental disorders via the European Genome-phenome Archive (EGAS00001000775). Individual pathogenic/likely pathogenic variants are openly accessible with phenotypes via DECIPHER [38].Background: The use of in silico pathogenicity predictions as evidence when interpreting genetic variants is widely accepted as part of standard variant classification guidelines. Although numerous algorithms have been developed and evaluated for classifying missense variants, in-frame insertions/deletions (indels) have been much less well studied. Methods: We created a dataset of 3964 small (< 100 bp) indels predicted to result in in-frame amino acid insertions or deletions using data from gnomAD v3.1 (minor allele frequency of 1–5%), ClinVar and the Deciphering Developmental Disorders (DDD) study. We used this dataset to evaluate the performance of nine pathogenicity predictor tools: CADD, CAPICE, FATHMM-indel, MutPred-Indel, MutationTaster2021, PROVEAN, SIFT-indel, VEST-indel and VVP. Results: Our dataset consisted of 2224 benign/likely benign and 1740 pathogenic/likely pathogenic variants from gnomAD (n = 809), ClinVar (n = 2882) and, DDD (n = 273). We were able to generate scores across all tools for 91% of the variants, with areas under the ROC curve (AUC) of 0.81–0.96 based on the published recommended thresholds. To avoid biases caused by inclusion of our dataset in the tools’ training data, we also evaluated just DDD variants not present in either gnomAD or ClinVar (70 pathogenic and 81 benign). Using this subset, the AUC of all tools decreased substantially to 0.64–0.87. Several of the tools performed similarly however, VEST-indel had the highest AUCs of 0.93 (full dataset) and 0.87 (DDD subset). Conclusions: Algorithms designed for predicting the pathogenicity of in-frame indels perform well enough to aid clinical variant classification in a similar manner to missense prediction tools.Medical Research Council (MRC)Health Innovation Challenge FundWellcome Trust Sanger Institut

    Recurrent De Novo NAHR Reciprocal Duplications in the ATAD3 Gene Cluster Cause a Neurogenetic Trait with Perturbed Cholesterol and Mitochondrial Metabolism.

    Get PDF
    Recent studies have identified both recessive and dominant forms of mitochondrial disease that result from ATAD3A variants. The recessive form includes subjects with biallelic deletions mediated by non-allelic homologous recombination. We report five unrelated neonates with a lethal metabolic disorder characterized by cardiomyopathy, corneal opacities, encephalopathy, hypotonia, and seizures in whom a monoallelic reciprocal duplication at the ATAD3 locus was identified. Analysis of the breakpoint junction fragment indicated that these 67 kb heterozygous duplications were likely mediated by non-allelic homologous recombination at regions of high sequence identity in ATAD3A exon 11 and ATAD3C exon 7. At the recombinant junction, the duplication allele produces a fusion gene derived from ATAD3A and ATAD3C, the protein product of which lacks key functional residues. Analysis of fibroblasts derived from two affected individuals shows that the fusion gene product is expressed and stable. These cells display perturbed cholesterol and mitochondrial DNA organization similar to that observed for individuals with severe ATAD3A deficiency. We hypothesize that the fusion protein acts through a dominant-negative mechanism to cause this fatal mitochondrial disorder. Our data delineate a molecular diagnosis for this disorder, extend the clinical spectrum associated with structural variation at the ATAD3 locus, and identify a third mutational mechanism for ATAD3 gene cluster variants. These results further affirm structural variant mutagenesis mechanisms in sporadic disease traits, emphasize the importance of copy number analysis in molecular genomic diagnosis, and highlight some of the challenges of detecting and interpreting clinically relevant rare gene rearrangements from next-generation sequencing data

    ReadNet: A Hierarchical Transformer Framework for Web Article Readability Analysis

    Full text link
    Analyzing the readability of articles has been an important sociolinguistic task. Addressing this task is necessary to the automatic recommendation of appropriate articles to readers with different comprehension abilities, and it further benefits education systems, web information systems, and digital libraries. Current methods for assessing readability employ empirical measures or statistical learning techniques that are limited by their ability to characterize complex patterns such as article structures and semantic meanings of sentences. In this paper, we propose a new and comprehensive framework which uses a hierarchical self-attention model to analyze document readability. In this model, measurements of sentence-level difficulty are captured along with the semantic meanings of each sentence. Additionally, the sentence-level features are incorporated to characterize the overall readability of an article with consideration of article structures. We evaluate our proposed approach on three widely-used benchmark datasets against several strong baseline approaches. Experimental results show that our proposed method achieves the state-of-the-art performance on estimating the readability for various web articles and literature.Comment: ECIR 202

    Diagnosis of lethal or prenatal-onset autosomal recessive disorders by parental exome sequencing.

    Get PDF
    OBJECTIVE: Rare genetic disorders resulting in prenatal or neonatal death are genetically heterogeneous, but testing is often limited by the availability of fetal DNA, leaving couples without a potential prenatal test for future pregnancies. We describe our novel strategy of exome sequencing parental DNA samples to diagnose recessive monogenic disorders in an audit of the first 50 couples referred. METHOD: Exome sequencing was carried out in a consecutive series of 50 couples who had 1 or more pregnancies affected with a lethal or prenatal-onset disorder. In all cases, there was insufficient DNA for exome sequencing of the affected fetus. Heterozygous rare variants (MAF < 0.001) in the same gene in both parents were selected for analysis. Likely, disease-causing variants were tested in fetal DNA to confirm co-segregation. RESULTS: Parental exome analysis identified heterozygous pathogenic (or likely pathogenic) variants in 24 different genes in 26/50 couples (52%). Where 2 or more fetuses were affected, a genetic diagnosis was obtained in 18/29 cases (62%). In most cases, the clinical features were typical of the disorder, but in others, they result from a hypomorphic variant or represent the most severe form of a variable phenotypic spectrum. CONCLUSION: We conclude that exome sequencing of parental samples is a powerful strategy with high clinical utility for the genetic diagnosis of lethal or prenatal-onset recessive disorders. © 2017 The Authors Prenatal Diagnosis published by John Wiley & Sons Ltd

    Falls and falls efficacy: the role of sustained attention in older adults

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>Previous evidence indicates that older people allocate more of their attentional resources toward their gait and that the attention-related changes that occur during aging increase the risk of falls. The aim of this study was to investigate whether performance and variability in sustained attention is associated with falls and falls efficacy in older adults.</p> <p>Methods</p> <p>458 community-dwelling adults aged ≥ 60 years underwent a comprehensive geriatric assessment. Mean and variability of reaction time (RT), commission errors and omission errors were recorded during a fixed version of the Sustained Attention to Response Task (SART). RT variability was decomposed using the Fast Fourier Transform (FFT) procedure, to help characterise variability associated with the arousal and vigilance aspects of sustained attention.</p> <p>The number of self-reported falls in the previous twelve months, and falls efficacy (Modified Falls Efficacy Scale) were also recorded.</p> <p>Results</p> <p>Significant increases in the mean and variability of reaction time on the SART were significantly associated with both falls (p < 0.01) and reduced falls efficacy (p < 0.05) in older adults. An increase in omission errors was also associated with falls (p < 0.01) and reduced falls efficacy (p < 0.05). Upon controlling for age and gender affects, logistic regression modelling revealed that increasing variability associated with the vigilance (top-down) aspect of sustained attention was a retrospective predictor of falling (p < 0.01, OR = 1.14, 95% CI: 1.03 - 1.26) in the previous year and was weakly correlated with reduced falls efficacy in non-fallers (p = 0.07).</p> <p>Conclusions</p> <p>Greater variability in sustained attention is strongly correlated with retrospective falls and to a lesser degree with reduced falls efficacy. This cognitive measure may provide a novel and valuable biomarker for falls in older adults, potentially allowing for early detection and the implementation of preventative intervention strategies.</p
    corecore