42 research outputs found

    Infant High-Grade Gliomas Comprise Multiple Subgroups Characterized by Novel Targetable Gene Fusions and Favorable Outcomes.

    Get PDF
    Infant high-grade gliomas appear clinically distinct from their counterparts in older children, indicating that histopathologic grading may not accurately reflect the biology of these tumors. We have collected 241 cases under 4 years of age, and carried out histologic review, methylation profiling, and custom panel, genome, or exome sequencing. After excluding tumors representing other established entities or subgroups, we identified 130 cases to be part of an "intrinsic" spectrum of disease specific to the infant population. These included those with targetable MAPK alterations, and a large proportion of remaining cases harboring gene fusions targeting ALK (n = 31), NTRK1/2/3 (n = 21), ROS1 (n = 9), and MET (n = 4) as their driving alterations, with evidence of efficacy of targeted agents in the clinic. These data strongly support the concept that infant gliomas require a change in diagnostic practice and management. SIGNIFICANCE: Infant high-grade gliomas in the cerebral hemispheres comprise novel subgroups, with a prevalence of ALK, NTRK1/2/3, ROS1, or MET gene fusions. Kinase fusion-positive tumors have better outcome and respond to targeted therapy clinically. Other subgroups have poor outcome, with fusion-negative cases possibly representing an epigenetically driven pluripotent stem cell phenotype.See related commentary by Szulzewsky and Cimino, p. 904.This article is highlighted in the In This Issue feature, p. 890

    Literature mining of protein-residue associations with graph rules learned through distant supervision

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>We propose a method for automatic extraction of protein-specific residue mentions from the biomedical literature. The method searches text for mentions of amino acids at specific sequence positions and attempts to correctly associate each mention with a protein also named in the text. The methods presented in this work will enable improved protein functional site extraction from articles, ultimately supporting protein function prediction. Our method made use of linguistic patterns for identifying the amino acid residue mentions in text. Further, we applied an automated graph-based method to learn syntactic patterns corresponding to protein-residue pairs mentioned in the text. We finally present an approach to automated construction of relevant training and test data using the distant supervision model.</p> <p>Results</p> <p>The performance of the method was assessed by extracting protein-residue relations from a new automatically generated test set of sentences containing high confidence examples found using distant supervision. It achieved a F-measure of 0.84 on automatically created silver corpus and 0.79 on a manually annotated gold data set for this task, outperforming previous methods.</p> <p>Conclusions</p> <p>The primary contributions of this work are to (1) demonstrate the effectiveness of distant supervision for automatic creation of training data for protein-residue relation extraction, substantially reducing the effort and time involved in manual annotation of a data set and (2) show that the graph-based relation extraction approach we used generalizes well to the problem of protein-residue association extraction. This work paves the way towards effective extraction of protein functional residues from the literature.</p
    corecore