64 research outputs found

    Untitled

    Get PDF

    A standardized framing for reporting protein identifications in mzIdentML 1.2

    Get PDF
    Inferring which protein species have been detected in bottom-up proteomics experiments has been a challenging problem for which solutions have been maturing over the past decade. While many inference approaches now function well in isolation, comparing and reconciling the results generated across different tools remains difficult. It presently stands as one of the greatest barriers in collaborative efforts such as the Human Proteome Project and public repositories such as the PRoteomics IDEntifications (PRIDE) database. Here we present a framework for reporting protein identifications that seeks to improve capabilities for comparing results generated by different inference tools. This framework standardizes the terminology for describing protein identification results, associated with the HUPO-Proteomics Standards Initiative (PSI) mzIdentML standard, while still allowing for differing methodologies to reach that final state. It is proposed that developers of software for reporting identification results will adopt this terminology in their outputs. While the new terminology does not require any changes to the core mzIdentML model, it represents a significant change in practice, and, as such, the rules will be released via a new version of the mzIdentML specification (version 1.2) so that consumers of files are able to determine whether the new guidelines have been adopted by export software

    Ten Simple Rules for Taking Advantage of Git and GitHub.

    Get PDF
    Bioinformatics is a broad discipline in which one common denominator is the need to produce and/or use software that can be applied to biological data in different contexts. To enable and ensure the replicability and traceability of scientific claims, it is essential that the scientific publication, the corresponding datasets, and the data analysis are made publicly available [1,2]. All software used for the analysis should be either carefully documented (e.g., for commercial software) or, better yet, openly shared and directly accessible to others [3,4]. The rise of openly available software and source code alongside concomitant collaborative development is facilitated by the existence of several code repository services such as SourceForge, Bitbucket, GitLab, and GitHub, among others. These resources are also essential for collaborative software projects because they enable the organization and sharing of programming tasks between different remote contributors. Here, we introduce the main features of GitHub, a popular web-based platform that offers a free and integrated environment for hosting the source code, documentation, and project-related web content for open-source projects. GitHub also offers paid plans for private repositories (see Box 1) for individuals and businesses as well as free plans including private repositories for research and educational use.Biotechnology and Biological Sciences Research CouncilThis is the final version of the article. It first appeared from Public Library of Science via https://doi.org/10.1371/journal.pcbi.1004947

    The PRIDE database and related tools and resources in 2019: improving support for quantification data

    No full text
    The PRoteomics IDEntifications (PRIDE) database (https://www.ebi.ac.uk/pride/) is the world's largest data repository of mass spectrometry-based proteomics data, and is one of the founding members of the global ProteomeXchange (PX) consortium. In this manuscript, we summarize the developments in PRIDE resources and related tools since the previous update manuscript was published in Nucleic Acids Research in 2016. In the last 3years, public data sharing through PRIDE (as part of PX) has definitely become the norm in the field. In parallel, data re-use of public proteomics data has increased enormously, with multiple applications. We first describe the new architecture of PRIDE Archive, the archival component of PRIDE. PRIDE Archive and the related data submission framework have been further developed to support the increase in submitted data volumes and additional data types. A new scalable and fault tolerant storage backend, Application Programming Interface and web interface have been implemented, as a part of an ongoing process. Additionally, we emphasize the improved support for quantitative proteomics data through the mzTab format. At last, we outline key statistics on the current data contents and volume of downloads, and how PRIDE data are starting to be disseminated to added-value resources including Ensembl, UniProt and Expression Atlas

    New insights into the protein aggregation pathology in myotilinopathy by combined proteomic and immunolocalization analyses

    Get PDF
    Introduction: Myofibrillar myopathies are characterized by progressive muscle weakness and impressive abnormal protein aggregation in muscle fibers. In about 10 % of patients, the disease is caused by mutations in the MYOT gene encoding myotilin. The aim of our study was to decipher the composition of protein deposits in myotilinopathy to get new information about aggregate pathology. Results: Skeletal muscle samples from 15 myotilinopathy patients were included in the study. Aggregate and control samples were collected from muscle sections by laser microdissection and subsequently analyzed by a highly sensitive proteomic approach that enables a relative protein quantification. In total 1002 different proteins were detected. Seventy-six proteins showed a significant over-representation in aggregate samples including 66 newly identified aggregate proteins. Z-disc-associated proteins were the most abundant aggregate components, followed by sarcolemmal and extracellular matrix proteins, proteins involved in protein quality control and degradation, and proteins with a function in actin dynamics or cytoskeletal transport. Forty over-represented proteins were evaluated by immunolocalization studies. These analyses validated our mass spectrometric data and revealed different regions of protein accumulation in abnormal muscle fibers. Comparison of data from our proteomic analysis in myotilinopathy with findings in other myofibrillar myopathy subtypes indicates a characteristic basic pattern of aggregate composition and resulted in identification of a highly sensitive and specific diagnostic marker for myotilinopathy. Conclusions: Our findings i) indicate that main protein components of aggregates belong to a network of interacting proteins, ii) provide new insights into the complex regulation of protein degradation in myotilinopathy that may be relevant for new treatment strategies, iii) imply a combination of a toxic gain-of-function leading to myotilin-positive protein aggregates and a loss-of-function caused by a shift in subcellular distribution with a deficiency of myotilin at Z-discs that impairs the integrity of myofibrils, and iv) demonstrate that proteomic analysis can be helpful in differential diagnosis of protein aggregate myopathies

    Differential proteomic analysis of abnormal intramyoplasmic aggregates in desminopathy

    Get PDF
    Desminopathy is a subtype of myofibrillar myopathy caused by desmin mutations and characterized by protein aggregates accumulating in muscle fibers. The aim of this study was to assess the protein composition of these aggregates. Aggregates and intact myofiber sections were obtained from skeletal muscle biopsies of five desminopathy patients by laser microdissection and analyzed by a label-free spectral count-based proteomic approach. We identified 397 proteins with 22 showing significantly higher spectral indices in aggregates (ratio >1.8, p <0.05). Fifteen of these proteins not previously reported as specific aggregate components provide new insights regarding pathomechanisms of desminopathy. Results of proteomic analysis were supported by immunolocalization studies and parallel reaction monitoring. Three mutant desmin variants were detected directly on the protein level as components of the aggregates, suggesting their direct involvement in aggregate-formation and demonstrating for the first time that proteomic analysis can be used for direct identification of a disease-causing mutation in myofibrillar myopathy. Comparison of the proteomic results in desminopathy with our previous analysis of aggregate composition in filaminopathy, another myofibrillar myopathy subtype, allows to determine subtype-specific proteomic profile that facilitates identification of the specific disorder. Biological significance Our proteomic analysis provides essential new insights in the composition of pathological protein aggregates in skeletal muscle fibers of desminopathy patients. The results contribute to a better understanding of pathomechanisms in myofibrillar myopathies and provide the basis for hypothesis-driven studies. The detection of specific proteomic profiles in different myofibrillar myopathy subtypes indicates that proteomic analysis may become a useful tool in differential diagnosis of protein aggregate myopathies. This article is part of a Special Issue entitled: From Genome to Proteome: Open Innovations. (C) 2013 Elsevier B.V. All rights reserved

    Detecting modification of biomedical events using a deep parsing approach

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>This work describes a system for identifying event mentions in bio-molecular research abstracts that are either speculative (e.g. <it>analysis of IkappaBalpha phosphorylation</it>, where it is not specified whether phosphorylation did or did not occur) or negated (e.g. <it>inhibition of IkappaBalpha phosphorylation</it>, where phosphorylation did <it>not </it>occur). The data comes from a standard dataset created for the BioNLP 2009 Shared Task. The system uses a machine-learning approach, where the features used for classification are a combination of shallow features derived from the words of the sentences and more complex features based on the semantic outputs produced by a deep parser.</p> <p>Method</p> <p>To detect event modification, we use a Maximum Entropy learner with features extracted from the data relative to the trigger words of the events. The shallow features are bag-of-words features based on a small sliding context window of 3-4 tokens on either side of the trigger word. The deep parser features are derived from parses produced by the English Resource Grammar and the <it>RASP </it>parser. The outputs of these parsers are converted into the Minimal Recursion Semantics formalism, and from this, we extract features motivated by linguistics and the data itself. All of these features are combined to create training or test data for the machine learning algorithm.</p> <p>Results</p> <p>Over the test data, our methods produce approximately a 4% absolute increase in F-score for detection of event modification compared to a baseline based only on the shallow bag-of-words features.</p> <p>Conclusions</p> <p>Our results indicate that grammar-based techniques can enhance the accuracy of methods for detecting event modification.</p

    New insights into the protein aggregation pathology in myotilinopathy by combined proteomic and immunolocalization analyses

    Get PDF
    Introduction: Myofibrillar myopathies are characterized by progressive muscle weakness and impressive abnormal protein aggregation in muscle fibers. In about 10 % of patients, the disease is caused by mutations in the MYOT gene encoding myotilin. The aim of our study was to decipher the composition of protein deposits in myotilinopathy to get new information about aggregate pathology. Results: Skeletal muscle samples from 15 myotilinopathy patients were included in the study. Aggregate and control samples were collected from muscle sections by laser microdissection and subsequently analyzed by a highly sensitive proteomic approach that enables a relative protein quantification. In total 1002 different proteins were detected. Seventy-six proteins showed a significant over-representation in aggregate samples including 66 newly identified aggregate proteins. Z-disc-associated proteins were the most abundant aggregate components, followed by sarcolemmal and extracellular matrix proteins, proteins involved in protein quality control and degradation, and proteins with a function in actin dynamics or cytoskeletal transport. Forty over-represented proteins were evaluated by immunolocalization studies. These analyses validated our mass spectrometric data and revealed different regions of protein accumulation in abnormal muscle fibers. Comparison of data from our proteomic analysis in myotilinopathy with findings in other myofibrillar myopathy subtypes indicates a characteristic basic pattern of aggregate composition and resulted in identification of a highly sensitive and specific diagnostic marker for myotilinopathy. Conclusions: Our findings i) indicate that main protein components of aggregates belong to a network of interacting proteins, ii) provide new insights into the complex regulation of protein degradation in myotilinopathy that may be relevant for new treatment strategies, iii) imply a combination of a toxic gain-of-function leading to myotilin-positive protein aggregates and a loss-of-function caused by a shift in subcellular distribution with a deficiency of myotilin at Z-discs that impairs the integrity of myofibrils, and iv) demonstrate that proteomic analysis can be helpful in differential diagnosis of protein aggregate myopathies

    Proteomics of rimmed vacuoles define new risk allele in inclusion body myositis

    Get PDF
    OBJECTIVE: Sporadic inclusion body myositis (sIBM) pathogenesis is unknown; however, rimmed vacuoles (RVs) are a constant feature. We propose to identify proteins that accumulate within RVs. METHODS: RVs and intact myofibers were laser microdissected from skeletal muscle of 18 sIBM patients and analyzed by a sensitive mass spectrometry approach using label-free spectral count-based relative protein quantification. Whole exome sequencing was performed on 62 sIBM patients. Immunofluorescence was performed on patient and mouse skeletal muscle. RESULTS: 213 proteins were enriched by >1.5X in RVs compared to controls and included proteins previously reported to accumulate in sIBM tissue or when mutated cause myopathies with RVs. Proteins associated with protein folding and autophagy were the largest group represented. One autophagic adaptor protein not previously identified in sIBM was FYCO1. Rare missense coding FYCO1 variants were present in 11.3% of sIBM patients compared with 2.6% of controls (p=0.003). FYCO1 co-localized at RVs with autophagic proteins such as MAP1LC3 and SQSTM1 in sIBM and other RV myopathies. One FYCO1 variant protein had reduced co-localization with MAP1LC3 when expressed in mouse muscle. INTERPRETATION: This study used an unbiased proteomic approach to identify RV proteins in sIBM that included a novel protein involved in sIBM pathogenesis. FYCO1 accumulates at RVs and rare missense variants in FYCO1 are overrepresented in sIBM patients. These FYCO1 variants may impair autophagic function leading to RV formation in sIBM patient muscle. FYCO1 functionally connects autophagic and endocytic pathways supporting the hypothesis that impaired endolysosmal degradation underlies the pathogenesis of sIBM
    corecore