33 research outputs found

    How complete are current yeast and human protein-interaction networks?

    Get PDF
    We estimate the full yeast protein-protein interaction network to contain 37,800-75,500 interactions and the human network 154,000-369,000, but owing to a high false-positive rate, current maps are roughly only 50% and 10% complete, respectively. Paradoxically, releasing raw, unfiltered assay data might help separate true from false interactions

    Consolidating the set of known human protein-protein interactions in preparation for large-scale mapping of the human interactome

    Get PDF
    BACKGROUND: Extensive protein interaction maps are being constructed for yeast, worm, and fly to ask how the proteins organize into pathways and systems, but no such genome-wide interaction map yet exists for the set of human proteins. To prepare for studies in humans, we wished to establish tests for the accuracy of future interaction assays and to consolidate the known interactions among human proteins. RESULTS: We established two tests of the accuracy of human protein interaction datasets and measured the relative accuracy of the available data. We then developed and applied natural language processing and literature-mining algorithms to recover from Medline abstracts 6,580 interactions among 3,737 human proteins. A three-part algorithm was used: first, human protein names were identified in Medline abstracts using a discriminator based on conditional random fields, then interactions were identified by the co-occurrence of protein names across the set of Medline abstracts, filtering the interactions with a Bayesian classifier to enrich for legitimate physical interactions. These mined interactions were combined with existing interaction data to obtain a network of 31,609 interactions among 7,748 human proteins, accurate to the same degree as the existing datasets. CONCLUSION: These interactions and the accuracy benchmarks will aid interpretation of current functional genomics data and provide a basis for determining the quality of future large-scale human protein interaction assays. Projecting from the approximately 15 interactions per protein in the best-sampled interaction set to the estimated 25,000 human genes implies more than 375,000 interactions in the complete human protein interaction network. This set therefore represents no more than 10% of the complete network

    High resolution transcriptome maps for wild-type and nonsense-mediated decay-defective Caenorhabditis elegans

    Get PDF
    The high-resolution transcriptome of wild-type and nonsense-mediated decay (NMD) defective C. elegans during development reveals insights into the NMD pathway and it’s role in development

    A map of human protein interactions derived from co-expression of human mRNAs and their orthologs

    Get PDF
    The human protein interaction network will offer global insights into the molecular organization of cells and provide a framework for modeling human disease, but the network's large scale demands new approaches. We report a set of 7000 physical associations among human proteins inferred from indirect evidence: the comparison of human mRNA co-expression patterns with those of orthologous genes in five other eukaryotes, which we demonstrate identifies proteins in the same physical complexes. To evaluate the accuracy of the predicted physical associations, we apply quantitative mass spectrometry shotgun proteomics to measure elution profiles of 3013 human proteins during native biochemical fractionation, demonstrating systematically that putative interaction partners tend to co-sediment. We further validate uncharacterized proteins implicated by the associations in ribosome biogenesis, including WBSCR20C, associated with Williams–Beuren syndrome. This meta-analysis therefore exploits non-protein-based data, but successfully predicts associations, including 5589 novel human physical protein associations, with measured accuracies of 54±10%, comparable to direct large-scale interaction assays. The new associations' derivation from conserved in vivo phenomena argues strongly for their biological relevance

    Whole genome analysis for 163 gRNAs in Cas9-edited mice reveals minimal off-target activity.

    Get PDF
    Genome editing with CRISPR-associated (Cas) proteins holds exceptional promise for correcting variants causing genetic disease. To realize this promise, off-target genomic changes cannot occur during the editing process. Here, we use whole genome sequencing to compare the genomes of 50 Cas9-edited founder mice to 28 untreated control mice to assess the occurrence of S. pyogenes Cas9-induced off-target mutagenesis. Computational analysis of whole-genome sequencing data detects 26 unique sequence variants at 23 predicted off-target sites for 18/163 guides used. While computationally detected variants are identified in 30% (15/50) of Cas9 gene-edited founder animals, only 38% (10/26) of the variants in 8/15 founders validate by Sanger sequencing. In vitro assays for Cas9 off-target activity identify only two unpredicted off-target sites present in genome sequencing data. In total, only 4.9% (8/163) of guides tested have detectable off-target activity, a rate of 0.2 Cas9 off-target mutations per founder analyzed. In comparison, we observe ~1,100 unique variants in each mouse regardless of genome exposure to Cas9 indicating off-target variants comprise a small fraction of genetic heterogeneity in Cas9-edited mice. These findings will inform future design and use of Cas9-edited animal models as well as provide context for evaluating off-target potential in genetically diverse patient populations

    Alterations in ALK/ROS1/NTRK/MET drive a group of infantile hemispheric gliomas

    Get PDF
    © The Author(s) 2019. Open Access. This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.Infant gliomas have paradoxical clinical behavior compared to those in children and adults: low-grade tumors have a higher mortality rate, while high-grade tumors have a better outcome. However, we have little understanding of their biology and therefore cannot explain this behavior nor what constitutes optimal clinical management. Here we report a comprehensive genetic analysis of an international cohort of clinically annotated infant gliomas, revealing 3 clinical subgroups. Group 1 tumors arise in the cerebral hemispheres and harbor alterations in the receptor tyrosine kinases ALK, ROS1, NTRK and MET. These are typically single-events and confer an intermediate outcome. Groups 2 and 3 gliomas harbor RAS/MAPK pathway mutations and arise in the hemispheres and midline, respectively. Group 2 tumors have excellent long-term survival, while group 3 tumors progress rapidly and do not respond well to chemoradiation. We conclude that infant gliomas comprise 3 subgroups, justifying the need for specialized therapeutic strategies.info:eu-repo/semantics/publishedVersio

    2005 Ramani et Volume al. 6, Issue 5, Article r40 Open Access

    No full text
    Consolidating the set of known human protein-protein interactions in preparation for large-scale mapping of the human interactom
    corecore