354 research outputs found

    Modeling peptide fragmentation with dynamic Bayesian networks for peptide identification

    Get PDF
    Motivation: Tandem mass spectrometry (MS/MS) is an indispensable technology for identification of proteins from complex mixtures. Proteins are digested to peptides that are then identified by their fragmentation patterns in the mass spectrometer. Thus, at its core, MS/MS protein identification relies on the relative predictability of peptide fragmentation. Unfortunately, peptide fragmentation is complex and not fully understood, and what is understood is not always exploited by peptide identification algorithms

    Initial recommendations for performing, benchmarking, and reporting single-cell proteomics experiments

    Full text link
    Analyzing proteins from single cells by tandem mass spectrometry (MS) has become technically feasible. While such analysis has the potential to accurately quantify thousands of proteins across thousands of single cells, the accuracy and reproducibility of the results may be undermined by numerous factors affecting experimental design, sample preparation, data acquisition, and data analysis. Broadly accepted community guidelines and standardized metrics will enhance rigor, data quality, and alignment between laboratories. Here we propose best practices, quality controls, and data reporting recommendations to assist in the broad adoption of reliable quantitative workflows for single-cell proteomics.Comment: Supporting website: https://single-cell.net/guideline

    Improved quality control processing of peptide-centric LC-MS proteomics data

    Get PDF
    Motivation: In the analysis of differential peptide peak intensities (i.e. abundance measures), LC-MS analyses with poor quality peptide abundance data can bias downstream statistical analyses and hence the biological interpretation for an otherwise high-quality dataset. Although considerable effort has been placed on assuring the quality of the peptide identification with respect to spectral processing, to date quality assessment of the subsequent peptide abundance data matrix has been limited to a subjective visual inspection of run-by-run correlation or individual peptide components. Identifying statistical outliers is a critical step in the processing of proteomics data as many of the downstream statistical analyses [e.g. analysis of variance (ANOVA)] rely upon accurate estimates of sample variance, and their results are influenced by extreme values

    Current challenges in software solutions for mass spectrometry-based quantitative proteomics

    Get PDF
    This work was in part supported by the PRIME-XS project, grant agreement number 262067, funded by the European Union seventh Framework Programme; The Netherlands Proteomics Centre, embedded in The Netherlands Genomics Initiative; The Netherlands Bioinformatics Centre; and the Centre for Biomedical Genetics (to S.C., B.B. and A.J.R.H); by NIH grants NCRR RR001614 and RR019934 (to the UCSF Mass Spectrometry Facility, director: A.L. Burlingame, P.B.); and by grants from the MRC, CR-UK, BBSRC and Barts and the London Charity (to P.C.

    A mass accuracy sensitive probability based scoring algorithm for database searching of tandem mass spectrometry data

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>Liquid chromatography coupled with tandem mass spectrometry (LC-MS/MS) has become one of the most used tools in mass spectrometry based proteomics. Various algorithms have since been developed to automate the process for modern high-throughput LC-MS/MS experiments.</p> <p>Results</p> <p>A probability based statistical scoring model for assessing peptide and protein matches in tandem MS database search was derived. The statistical scores in the model represent the probability that a peptide match is a random occurrence based on the number or the total abundance of matched product ions in the experimental spectrum. The model also calculates probability based scores to assess protein matches. Thus the protein scores in the model reflect the significance of protein matches and can be used to differentiate true from random protein matches.</p> <p>Conclusion</p> <p>The model is sensitive to high mass accuracy and implicitly takes mass accuracy into account during scoring. High mass accuracy will not only reduce false positives, but also improves the scores of true positive matches. The algorithm is incorporated in an automated database search program MassMatrix.</p

    Low density lipoprotein and liposome mediated uptake and cytotoxic effect of N4-octadecyl-1-β-D-arabinofuranosylcytosine in Daudi lymphoma cells

    Get PDF
    Low density lipoprotein (LDL) receptor-mediated uptake and cytotoxic effects of N4-octadecyl-1-beta-D-arabinofuranosylcytosine (NOAC) were studied in Daudi lymphoma cells. NOAC was either incorporated into LDL or liposomes to compare specific and unspecific uptake mechanisms. Binding of LDL to Daudi cells was not altered after NOAC incorporation (K(D) 60 nM). Binding of liposomal NOAC was not saturable with increasing concentrations. Specific binding of NOAC-LDL to Daudi cells was five times higher than to human lymphocytes. LDL receptor binding could be blocked and up- or down-regulated. Co-incubation with colchicine reduced NOAC-LDL uptake by 36%. These results suggested that NOAC-LDL is taken up via the LDL receptor pathway. In an in vitro cytotoxicity test, the IC50 of NOAC-LDL was about 160 microM, whereas with liposomal NOAC the IC50 was 40 microM. Blocking the LDL receptors with empty LDL protected 50% of the cells from NOAC cytotoxicity. The cellular distribution of NOAC-LDL or NOAC-liposomes differed only in the membrane and nuclei fraction with 13% and 6% respectively. Although it is more convenient to prepare NOAC-liposomes as compared to the loading of LDL particles with the drug, the receptor-mediated uptake of NOAC-LDL provides an interesting rationale for the specific delivery of the drug to tumours that express elevated numbers of LDL receptors

    Analysis of Bovine Viral Diarrhea Viruses-infected monocytes: identification of cytopathic and non-cytopathic biotype differences

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>Bovine Viral Diarrhea Virus (BVDV) infection is widespread in cattle worldwide, causing important economic losses. Pathogenesis of the disease caused by BVDV is complex, as each BVDV strain has two biotypes: non-cytopathic (ncp) and cytopathic (cp). BVDV can cause a persistent latent infection and immune suppression if animals are infected with an ncp biotype during early gestation, followed by a subsequent infection of the cp biotype. The molecular mechanisms that underscore the complex disease etiology leading to immune suppression in cattle caused by BVDV are not well understood.</p> <p>Results</p> <p>Using proteomics, we evaluated the effect of cp and ncp BVDV infection of bovine monocytes to determine their role in viral immune suppression and uncontrolled inflammation. Proteins were isolated by differential detergent fractionation and identified by 2D-LC ESI MS/MS. We identified 137 and 228 significantly altered bovine proteins due to ncp and cp BVDV infection, respectively. Functional analysis of these proteins using the Gene Ontology (GO) showed multiple under- and over- represented GO functions in molecular function, biological process and cellular component between the two BVDV biotypes. Analysis of the top immunological pathways affected by BVDV infection revealed that pathways representing macropinocytosis signalling, virus entry via endocytic pathway, integrin signalling and primary immunodeficiency signalling were identified only in ncp BVDV-infected monocytes. In contrast, pathways like actin cytoskeleton signalling, RhoA signalling, clathrin-mediated endocytosis signalling and interferon signalling were identified only in cp BDVD-infected cells. Of the six common pathways involved in cp and ncp BVDV infection, acute phase response signalling was the most significant for both BVDV biotypes. Although, most shared altered host proteins between both BVDV biotypes showed the same type of change, integrin alpha 2b (ITGA2B) and integrin beta 3 (ITGB3) were down- regulated by ncp BVDV and up- regulated by cp BVDV infection.</p> <p>Conclusions</p> <p>This study shows that, as we expected, there are significant functional differences in the host proteins that respond to cp or ncp BVDV infection. The combined use of GO and systems biology network modelling facilitated a better understanding of host-pathogen interactions.</p

    LC-MSsim – a simulation software for liquid chromatography mass spectrometry data

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>Mass Spectrometry coupled to Liquid Chromatography (LC-MS) is commonly used to analyze the protein content of biological samples in large scale studies. The data resulting from an LC-MS experiment is huge, highly complex and noisy. Accordingly, it has sparked new developments in Bioinformatics, especially in the fields of algorithm development, statistics and software engineering. In a quantitative label-free mass spectrometry experiment, crucial steps are the detection of peptide features in the mass spectra and the alignment of samples by correcting for shifts in retention time. At the moment, it is difficult to compare the plethora of algorithms for these tasks. So far, curated benchmark data exists only for peptide identification algorithms but no data that represents a ground truth for the evaluation of feature detection, alignment and filtering algorithms.</p> <p>Results</p> <p>We present <it>LC-MSsim</it>, a simulation software for LC-ESI-MS experiments. It simulates ESI spectra on the MS level. It reads a list of proteins from a FASTA file and digests the protein mixture using a user-defined enzyme. The software creates an LC-MS data set using a predictor for the retention time of the peptides and a model for peak shapes and elution profiles of the mass spectral peaks. Our software also offers the possibility to add contaminants, to change the background noise level and includes a model for the detectability of peptides in mass spectra. After the simulation, <it>LC-MSsim </it>writes the simulated data to mzData, a public XML format. The software also stores the positions (monoisotopic m/z and retention time) and ion counts of the simulated ions in separate files.</p> <p>Conclusion</p> <p><it>LC-MSsim </it>generates simulated LC-MS data sets and incorporates models for peak shapes and contaminations. Algorithm developers can match the results of feature detection and alignment algorithms against the simulated ion lists and meaningful error rates can be computed. We anticipate that <it>LC-MSsim </it>will be useful to the wider community to perform benchmark studies and comparisons between computational tools.</p
    corecore