2 research outputs found

    Quantifying Missing (Phospho)Proteome Regions with the Broad-Specificity Protease Subtilisin

    No full text
    Despite huge efforts to map the human proteome using mass spectrometry the overall sequence coverage achieved to date is still below 50%. Reasons for missing areas of the proteome comprise protease-resistant domains including the lack/excess of enzymatic cleavage sites, nonunique peptide sequences, impaired peptide ionization/separation and low expression levels. To access novel areas of the proteome the beneficial use of enzymes complementary to trypsin, such as Glu-C, Asp-N, Lys-N, Arg-C, LysargiNase has been reported. Here, we present how the broad-specificity protease subtilisin enables mapping of previously hidden areas of the proteome. We systematically evaluated its digestion efficiency and reproducibility and compared it to the gold standard in the field, trypsin. Notably, subtilisin allows reproducible near-complete digestion of cells lysates in 1–5 min. As expected from its broad specificity the generation of overlapping peptide sequences reduces the number of identified proteins compared to trypsin (8363 vs 6807; 1% protein FDR). However, subtilisin considerably improved the coverage of missing and particularly proline-rich areas of the proteome. Along 14 628 high confidence phosphorylation sites identified in total, only 33% were shared between both enzymes, while 37% were exclusive to subtilisin. Notably, 926 of these were not even accessible by additional in silico digestion with either Asp-N, Arg-C, Glu-C, Lys-C, or Lys-N. Thus, subtilisin might be particularly beneficial for system-wide profiling of post-translational modification sites. Finally, we demonstrate that subtilisin can be used for reporter-ion based in-depth quantification, providing a precision comparable to trypsindespite broad specificity and fast digestion that may increase technical variance

    Standardized analysis and sharing of genome-phenome data for neuromuscular and rare disease research through the RD-Connect platform

    No full text
    <b>Abstract: </b><div>RD-Connect (rd-connect.eu) is an EU-funded project building an integrated platform to narrow the gaps in rare disease research, where patient populations, clinical expertise and research communities are small in number and highly fragmented. Guided by the needs of rare disease researchers and with neuromuscular and neurodegenerative researchers as its original collaborators, the RD-Connect platform securely integrates multiple types of omics data (genomics, proteomics and transcriptomics) with biosample and clinical information – at the level of an individual patient, a family or a whole cohort, providing not only a centralized data repository but also a sophisticated and user-friendly online analysis system. Whole-genome, exome or gene panel NGS datasets from individuals with rare diseases and family members are deposited at the European Genome-phenome Archive, a longstanding archiving system designed for long-term storage of these large datasets. The raw data is then processed by RD-Connect's standardised analysis and annotation pipeline to make data from different sequencing providers more comparable. The corresponding clinical information from each individual is recorded in a connected PhenoTips instance, a software solution that simplifies the capture of clinical data using the Human Phenotype Ontology, OMIM and Orpha codes. The results are made available to authorised users through the highly configurable online platform (platform.rd-connect.eu), which runs on a Hadoop cluster and uses ElasticSearch – technologies designed to handle big data at high speeds. The user-friendly interface enables filtering and prioritization of variants using the most common quality, genomic location, effect, pathogenicity and population frequency annotations, enabling users from clinical labs without extensive bioinformatics support to do their primary genomic analysis of their own patients online and compare them with other submitted cohorts. Additional tools, such as Exomiser, DiseaseCard, Alamut Functional Annotation (ALFA) and UMD Predictor (umd-predictor.eu) are integrated at several levels. The RD-Connect platform is designed to enable data sharing at various levels depending on user permissions. At the most basic level (“does this specific variant exist in any individual in this cohort?”) it has lit a Beacon within the Global Alliance for Genomics and Health’s Beacon Network (www.beacon-network.org). At the next stage of sharing – finding similarities between patients in different databases with a matching phenotype and a candidate variant in the same gene – it is actively involved in the development of Matchmaker Exchange (www.matchmakerexchange.org), allowing users of different systems to securely exchange information to find confirmatory cases. And finally, since all patients within the system have been consented for data sharing, users of the system, after validation and authorization, are able to access datasets from other centres, providing an instant means of gathering cohorts for cross-validation and further study. Although open to any rare disease, the platform is currently enriched for neuromuscular and neurodegenerative phenotypes and includes almost 1000 genomic datasets from the NeurOmics project (www.rd-neuromics.eu) with several other contributions in the pipeline, including 1000 limb-girdle muscular dystrophy index cases from the Myo-Seq project (www.myo-seq.org) and more. The platform is free of charge to use and is open for contributions of NGS and phenotypic data from research labs worldwide via [email protected] <p></p></div
    corecore