48 research outputs found

    A parallel method for enumerating amino acid compositions and masses of all theoretical peptides

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>Enumeration of all theoretically possible amino acid compositions is an important problem in several proteomics workflows, including peptide mass fingerprinting, mass defect labeling, mass defect filtering, and de novo peptide sequencing. Because of the high computational complexity of this task, reported methods for peptide enumeration were restricted to cover limited mass ranges (below 2 kDa). In addition, implementation details of these methods as well as their computational performance have not been provided. The increasing availability of parallel (multi-core) computers in all fields of research makes the development of parallel methods for peptide enumeration a timely topic.</p> <p>Results</p> <p>We describe a parallel method for enumerating all amino acid compositions up to a given length. We present recursive procedures which are at the core of the method, and show that a single task of enumeration of all peptide compositions can be divided into smaller subtasks that can be executed in parallel. The computational complexity of the subtasks is compared with the computational complexity of the whole task. Pseudocodes of processes (a master and workers) that are used to execute the enumerating procedure in parallel are given. We present computational times for our method executed on a computer cluster with 12 Intel Xeon X5650 CPUs (72 cores) running Windows HPC Server. Our method has been implemented as a 32- and 64-bit Windows application using Microsoft Visual C++ and the Message Passing Interface. It is available for download at <url>https://ispace.utmb.edu/users/rgsadygo/Proteomics/ParallelMethod</url>.</p> <p>Conclusion</p> <p>We describe implementation of a parallel method for generating mass distributions of all theoretically possible amino acid compositions.</p

    Altered Retinoic Acid Metabolism in Diabetic Mouse Kidney Identified by 18O Isotopic Labeling and 2D Mass Spectrometry

    Get PDF
    Numerous metabolic pathways have been implicated in diabetes-induced renal injury, yet few studies have utilized unbiased systems biology approaches for mapping the interconnectivity of diabetes-dysregulated proteins that are involved. We utilized a global, quantitative, differential proteomic approach to identify a novel retinoic acid hub in renal cortical protein networks dysregulated by type 2 diabetes.Total proteins were extracted from renal cortex of control and db/db mice at 20 weeks of age (after 12 weeks of hyperglycemia in the diabetic mice). Following trypsinization, (18)O- and (16)O-labeled control and diabetic peptides, respectively, were pooled and separated by two dimensional liquid chromatography (strong cation exchange creating 60 fractions further separated by nano-HPLC), followed by peptide identification and quantification using mass spectrometry. Proteomic analysis identified 53 proteins with fold change >or=1.5 and p<or=0.05 after Benjamini-Hochberg adjustment (out of 1,806 proteins identified), including alcohol dehydrogenase (ADH) and retinaldehyde dehydrogenase (RALDH1/ALDH1A1). Ingenuity Pathway Analysis identified altered retinoic acid as a key signaling hub that was altered in the diabetic renal cortical proteome. Western blotting and real-time PCR confirmed diabetes-induced upregulation of RALDH1, which was localized by immunofluorescence predominantly to the proximal tubule in the diabetic renal cortex, while PCR confirmed the downregulation of ADH identified with mass spectrometry. Despite increased renal cortical tissue levels of retinol and RALDH1 in db/db versus control mice, all-trans-retinoic acid was significantly decreased in association with a significant decrease in PPARbeta/delta mRNA.Our results indicate that retinoic acid metabolism is significantly dysregulated in diabetic kidneys, and suggest that a shift in all-trans-retinoic acid metabolism is a novel feature in type 2 diabetic renal disease. Our observations provide novel insights into potential links between altered lipid metabolism and other gene networks controlled by retinoic acid in the diabetic kidney, and demonstrate the utility of using systems biology to gain new insights into diabetic nephropathy

    MassIVE MSV000090148 - Heavy water labeled murine liver LC-MS data

    No full text

    Poisson Model To Generate Isotope Distribution for Biomolecules

    No full text
    We introduce a simplified computational algorithm for computing isotope distributions (relative abundances and masses) of biomolecules. The algorithm is based on Poisson approximation to binomial and multinomial distributions. It leads to a small number of arithmetic operations to compute isotope distributions of molecules. The approach uses three embedded loops to compute the isotope distributions, as compared with the eight embedded loops in exact calculations. The speed improvement is about 3-fold compared to the fast Fourier transformation-based isotope calculations, often termed as ultrafast isotope calculation. The approach naturally incorporates the determination of the masses of each molecular isotopomer. It is applicable to high mass accuracy and resolution mass spectrometry data. The application to tryptic peptides in a UniProt protein database revealed that the mass accuracy of the computed isotopomers is better than 1 ppm. Even better mass accuracy (below 1 ppm) is achievable when the method is paired with the exact calculations, which we term a hybrid approach. The algorithms have been implemented in a freely available C/C++ code

    Predicting the protein half-life in tissue from its cellular properties.

    No full text
    Protein half-life is an important feature of protein homeostasis (proteostasis). The increasing number of in vivo and in vitro studies using high throughput proteomics provide estimates of the protein half-lives in tissues and cells. However, protein half-lives in cells and tissues are different. Due to the resource requirements for researching tissues, more data is available from cellular studies than tissues. We have designed a multivariate linear model for predicting protein half-life in tissue from its cellular properties. Inputs to the model are cellular half-life, abundance, intrinsically disordered sequences, and transcriptional and translational rates. Before the modeling, we determined substructures in the data using the relative distance from the regression line of the protein half-lives in tissues and cells, identifying three separate clusters. The model was trained on and applied to predict protein half-lives from murine liver, brain and heart tissues. In each tissue type we observed similar prediction patterns of protein half-lives. We found that the model provides the best results when there is a strong correlation between tissue and cell culture protein half-lives. Additionally, we clustered the protein half-lives to determine variations in correlation coefficients between the protein half-lives in the tissue versus in cell culture. The clusters identify strongly and weakly correlated protein half-lives, further improves the overall prediction and identifies sub groupings which exhibit specific characteristics. The model described herein, is generalizable to other data sets and has been implemented in a freely available R code

    Software Tool for Visualization and Validation of Protein Turnover Rates Using Heavy Water Metabolic Labeling and LC-MS

    No full text
    Metabolic stable isotope labeling followed by liquid chromatography coupled with mass spectrometry (LC-MS) is a powerful tool for in vivo protein turnover studies of individual proteins on a large scale and with high throughput. Turnover rates of thousands of proteins from dozens of time course experiments are determined by data processing tools, which are essential components of the workflows for automated extraction of turnover rates. The development of sophisticated algorithms for estimating protein turnover has been emphasized. However, the visualization and annotation of the time series data are no less important. The visualization tools help to validate the quality of the model fits, their goodness-of-fit characteristics, mass spectral features of peptides, and consistency of peptide identifications, among others. Here, we describe a graphical user interface (GUI) to visualize the results from the protein turnover analysis tool, d2ome, which determines protein turnover rates from metabolic D2O labeling followed by LC-MS. We emphasize the specific features of the time series data and their visualization in the GUI. The time series data visualized by the GUI can be saved in JPEG format for storage and further dissemination

    Coefficients of the linear regression between the common liver protein half-life in the tissue and cell of each cluster (Fig 4).

    No full text
    <p>Coefficients of the linear regression between the common liver protein half-life in the tissue and cell of each cluster (<a href="http://www.plosone.org/article/info:doi/10.1371/journal.pone.0180428#pone.0180428.g004" target="_blank">Fig 4</a>).</p

    Uncommon protein half-life prediction with noise.

    No full text
    <p>Uncommon protein half-life prediction with noise.</p
    corecore