5,503 research outputs found

    Current challenges in software solutions for mass spectrometry-based quantitative proteomics

    Get PDF
    This work was in part supported by the PRIME-XS project, grant agreement number 262067, funded by the European Union seventh Framework Programme; The Netherlands Proteomics Centre, embedded in The Netherlands Genomics Initiative; The Netherlands Bioinformatics Centre; and the Centre for Biomedical Genetics (to S.C., B.B. and A.J.R.H); by NIH grants NCRR RR001614 and RR019934 (to the UCSF Mass Spectrometry Facility, director: A.L. Burlingame, P.B.); and by grants from the MRC, CR-UK, BBSRC and Barts and the London Charity (to P.C.

    Differential neuroproteomic and systems biology analysis of spinal cord injury

    Get PDF
    Acute spinal cord injury (SCI) is a devastating condition with many consequences and no known effective treatment. Although it is quite easy to diagnose traumatic SCI, the assessment of injury severity and projection of disease progression or recovery are often challenging, as no consensus biomarkers have been clearly identified. Here rats were subjected to experimental moderate or severe thoracic SCI. At 24h and 7d postinjury, spinal cord segment caudal to injury center versus sham samples was harvested and subjected to differential proteomic analysis. Cationic/anionic-exchange chromatography, followed by 1D polyacrylamide gel electrophoresis, was used to reduce protein complexity. A reverse phase liquid chromatography-tandem mass spectrometry proteomic platform was then utilized to identify proteome changes associated with SCI. Twenty-two and 22 proteins were up-regulated at 24 h and 7 day after SCI, respectively; whereas 19 and 16 proteins are down-regulated at 24 h and 7 day after SCI, respectively, when compared with sham control. A subset of 12 proteins were identified as candidate SCI biomarkers - TF (Transferrin), FASN (Fatty acid synthase), NME1 (Nucleoside diphosphate kinase 1), STMN1 (Stathmin 1), EEF2 (Eukaryotic translation elongation factor 2), CTSD (Cathepsin D), ANXA1 (Annexin A1), ANXA2 (Annexin A2), PGM1 (Phosphoglucomutase 1), PEA15 (Phosphoprotein enriched in astrocytes 15), GOT2 (Glutamic-oxaloacetic transaminase 2), and TPI-1 (Triosephosphate isomerase 1), data are available via ProteomeXchange with identifier PXD003473. In addition, Transferrin, Cathepsin D, and TPI-1 and PEA15 were further verified in rat spinal cord tissue and/or CSF samples after SCI and in human CSF samples from moderate/severe SCI patients. Lastly, a systems biology approach was utilized to determine the critical biochemical pathways and interactome in the pathogenesis of SCI. Thus, SCI candidate biomarkers identified can be used to correlate with disease progression or to identify potential SCI therapeutic targets

    A large-scale proteogenomics study of apicomplexan pathogens-Toxoplasma gondii and Neospora caninum

    Get PDF
    Proteomics data can supplement genome annotation efforts, for example being used to confirm gene models or correct gene annotation errors. Here, we present a large‐scale proteogenomics study of two important apicomplexan pathogens: Toxoplasma gondii and Neospora caninum. We queried proteomics data against a panel of official and alternate gene models generated directly from RNASeq data, using several newly generated and some previously published MS datasets for this meta‐analysis. We identified a total of 201 996 and 39 953 peptide‐spectrum matches for T. gondii and N. caninum, respectively, at a 1% peptide FDR threshold. This equated to the identification of 30 494 distinct peptide sequences and 2921 proteins (matches to official gene models) for T. gondii, and 8911 peptides/1273 proteins for N. caninum following stringent protein‐level thresholding. We have also identified 289 and 140 loci for T. gondii and N. caninum, respectively, which mapped to RNA‐Seq‐derived gene models used in our analysis and apparently absent from the official annotation (release 10 from EuPathDB) of these species. We present several examples in our study where the RNA‐Seq evidence can help in correction of the current gene model and can help in discovery of potential new genes

    Machine Learning and Integrative Analysis of Biomedical Big Data.

    Get PDF
    Recent developments in high-throughput technologies have accelerated the accumulation of massive amounts of omics data from multiple sources: genome, epigenome, transcriptome, proteome, metabolome, etc. Traditionally, data from each source (e.g., genome) is analyzed in isolation using statistical and machine learning (ML) methods. Integrative analysis of multi-omics and clinical data is key to new biomedical discoveries and advancements in precision medicine. However, data integration poses new computational challenges as well as exacerbates the ones associated with single-omics studies. Specialized computational approaches are required to effectively and efficiently perform integrative analysis of biomedical data acquired from diverse modalities. In this review, we discuss state-of-the-art ML-based approaches for tackling five specific computational challenges associated with integrative analysis: curse of dimensionality, data heterogeneity, missing data, class imbalance and scalability issues
    corecore