236 research outputs found

    Proteomics Discovery of Disease Biomarkers

    Get PDF
    Recent technological developments in proteomics have shown promising initiatives in identifying novel biomarkers of various diseases. Such technologies are capable of investigating multiple samples and generating large amount of data end-points. Examples of two promising proteomics technologies are mass spectrometry, including an instrument based on surface enhanced laser desorption/ionization, and protein microarrays. Proteomics data must, however, undergo analytical processing using bioinformatics. Due to limitations in proteomics tools including shortcomings in bioinformatics analysis, predictive bioinformatics can be utilized as an alternative strategy prior to performing elaborate, high-throughput proteomics procedures. This review describes mass spectrometry, protein microarrays, and bioinformatics and their roles in biomarker discovery, and highlights the significance of integration between proteomics and bioinformatics

    Using Decision Forest to Classify Prostate Cancer Samples on the Basis of SELDI-TOF MS Data: Assessing Chance Correlation and Prediction Confidence

    Get PDF
    Class prediction using ā€œomicsā€ data is playing an increasing role in toxicogenomics, diagnosis/prognosis, and risk assessment. These data are usually noisy and represented by relatively few samples and a very large number of predictor variables (e.g., genes of DNA microarray data or m/z peaks of mass spectrometry data). These characteristics manifest the importance of assessing potential random correlation and overfitting of noise for a classification model based on omics data. We present a novel classification method, decision forest (DF), for class prediction using omics data. DF combines the results of multiple heterogeneous but comparable decision tree (DT) models to produce a consensus prediction. The method is less prone to overfitting of noise and chance correlation. A DF model was developed to predict presence of prostate cancer using a proteomic data set generated from surface-enhanced laser deposition/ionization time-of-flight mass spectrometry (SELDI-TOF MS). The degree of chance correlation and prediction confidence of the model was rigorously assessed by extensive cross-validation and randomization testing. Comparison of model prediction with imposed random correlation demonstrated biologic relevance of the model and the reduction of overfitting in DF. Furthermore, two confidence levels (high and low confidences) were assigned to each prediction, where most misclassifications were associated with the low-confidence region. For the high-confidence prediction, the model achieved 99.2% sensitivity and 98.2% specificity. The model also identified a list of significant peaks that could be useful for biomarker identification. DF should be equally applicable to other omics data such as gene expression data or metabolomic data. The DF algorithm is available upon request

    A biological approach to computational models of proteomic networks.

    Get PDF
    Computational modeling is useful as a means to assemble and test what we know about proteins and networks. Models can help address key questions about the measurement, definition and function of proteomic networks. Here, we place these biological questions at the forefront in reviewing the computational strategies that are available to analyze proteomic networks. Recent examples illustrate how models can extract more information from proteomic data, test possible interactions between network proteins and link networks to cellular behavior. No single model can achieve all these goals, however, which is why it is critical to prioritize biological questions before specifying a particular modeling approach. Introduction Our current understanding of the proteins, interactions and pathways that comprise signaling networks is detailed, yet it remains incomplete. Recent experimental techniques for unraveling intricate signaling networks have become increasingly quantitative and multiplex. New approaches are now needed to compile the existing quantitative biological knowledge and to maximize the information extracted from large-scale signaling and proteomic datasets. Computational models formalize a complex biological or experimental process mathematically, which can be useful for assembling and analyzing quantitative data. Modeling is thus critical for fields such as proteomics, genomics and systems biology. As a discipline, biology thrives on clarity through consensus (take, for instance, the central dogma). To model biological networks, however, we and others have argued against a consensus 'one size fits all' philosophy, favoring instead a spectrum of computational techniques Anchoring model sophistication with experimental data Proteomics research is clearly directed at uncovering more biological detail within networks -new proteins, new interactions, new complexes How does increasing the level of model detail decrease believability? With model detail come parameters. In a model, these parameters might define a signaling protein's starting concentration, rate of turnover or diffusivity through the cytoplasm. Model parameters are frequently unknown and must therefore be estimated from data, which reduces believability. Importantly, the number of required parameters multiplies as more biological detail is added Current Opinion in Chemical Biology Network-measurement models Modeling has played an increasingly important role in the measurement of proteomes and networks. Measurement models are a useful way to condense different methodological considerations into a single quantitative description of an experiment In the field of 'global' proteomics, no other experimental method has had as significant an impact as mass spectrometry (MS) Network definition 1a, 1b, 2b, 3c Current Opinion in Chemical Biology Two distinct perspectives on computational models of proteomic networks. sence of proteins in a complex starting mixture [11], which used prior MS measurements as training data to fit an initial frequency distribution of peptides whose assignments were correct and incorrect. The initial distributions were used as prior information in a model that calculates the probability of correct peptide assignment given an MS spectrum. Running the model through all of the spectra in an MS experiment generates a new distribution of (probably) correct and (probably) incorrect peptides, which can update the prior information for the next iteration through the model. In this way, the model learns the most likely peptide assignments from the spectra itself, with initialization provided by a high-quality training dataset. The resulting peptide probabilities can then be fed into downstream models that calculate protein assignments from a set of likely peptides Measurement models are also useful for analyzing data quality itself. Often, quality is synonymous with information [14 ] selected for high-quality proteomic data by calculating the intersection of large-scale phenotypic, transcriptional and interaction datasets in Caenorhabditis elegans. Using the overlap among the measured networks, the Gunsalus et al. model was shown to be enriched in proteins sharing common biological functions. Gaudet et al. [15 ] used the predictive ability of a model to quantify network information content directly from a proteomic measurement set. A key conclusion from this work was the importance of measurement combinations. Different types of assays (kinase activity assays, quantitative western blotting, etc.) used over a range of time points were critical to accurately predicting the response of cells treated with multiple experimental stimuli. As quantitative MS-based experiments evolve Network-definition models An important goal for computational models is to define mathematically the proteins and pathways that constitute a signaling network. Modeling strategies for addressing the question of network definition can be subdivided into two categories: reconstruction models, which build networks from previously reported mechanisms; and inference models, which deduce network structure from large-scale datasets Model class Current Opinion in Chemical Biology What can be learned from these complex models founded on highly parameterized systems of differential equations Many biological networks lack the in-depth mechanistic understanding needed for a plausible network reconstruction. With these networks, inferential modeling approaches can be used to suggest connections between molecules Network-function models Proteomic networks are important because they ultimately control cellular functions. Diverse extracellular stimuli converge upon a common intracellular network, which can mediate an array of cellular responses [38 ] used decision-tree modeling (Box 1) to characterize cell migration based on the phosphorylation levels of five key intracellular proteins. The resulting 'branches' of the decision tree identified the sequence of conditional molecular statements that best predicted low, medium or high cellular speed -for instance, IF extracellular-regulated kinase phosphorylation is low AND IF myosin light-chain phosphorylation is high THEN migration speed is high. It would be interesting to use this approach in larger networks while constraining decision-tree branchpoints based on the approximate positions of molecules in the network (first membrane transducers, then initiators, then effectors, etc.). Prediction of cellular functions can also be achieved more quantitatively by training models on measurements of the upstream signaling network. We have used partial least squares modeling to predict 12 measured apoptotic responses from 19 time-dependent signaling profiles [39 ]. This particular modeling approach calculates the most informative combinations of signals that together predict cellular functions. The combinations of stress, prodeath and prosurvival signals identified by the model were consistent with known mechanisms but could not have been predicted by inspection. Focusing on intracellular signals with recognized but complex roles in cell death thus allowed the model to identify new mechanisms of apoptosis control within the currently understood network. Very recently, we have found that this approach to modeling network function could effectively capture cytokine-induced apoptotic responses that differ between diverse cell types (K Miller-Jensen, KA Janes, DA Lauffenburger, unpublished data). This suggests that different cell types might share a common network that converts signals into cellular responses. If true, then reconstruction models Conclusions The measurement, definition and function of proteomic networks must be addressed in such a way that models complement experiment. These networks remain too unconstrained to study by mathematics alone yet have become too complex to understand completely by intuition. We believe that insights here will come about by approaching models through biological questions. In line with this view, our review has focused on less detailed models with strong foundations in data Update Recent work has provided new examples of how existing reconstruction models can be further explored and refined to aid biological discovery. Cheong et al. [53 ] used an NF-kB signaling mode

    Charge Transport Phenomena in Peptide Molecular Junctions

    Get PDF
    Inelastic electron tunneling spectroscopy (IETS) is a valuable in situ spectroscopic analysis technique that provides a direct portrait of the electron transport properties of a molecular species. In the past, IETS has been applied to small molecules. Using self-assembled nanoelectronic junctions, IETS was performed for the first time on a large polypeptide protein peptide in the phosphorylated and native form, yielding interpretable spectra. A reproducible 10-fold shift of the I/V characteristics of the peptide was observed upon phosphorylation. Phosphorylation can be utilized as a site-specific modification to alter peptide structure and thereby influence electron transport in peptide molecular junctions. It is envisioned that kinases and phosphatases may be used to create tunable systems for molecular electronics applications, such as biosensors and memory devices

    Systems analysis of the NCI-60 cancer cell lines by alignment of protein pathway activation modules with "-OMIC" data fields and therapeutic response signatures

    Get PDF
    The NCI-60 cell line set is likely the most molecularly profiled set of human tumor cell lines in the world. However, a critical missing component of previous analyses has been the inability to place the massive amounts of "-omic" data in the context of functional protein signaling networks, which often contain many of the drug targets for new targeted therapeutics. We used reverse-phase protein array (RPPA) analysis to measure the activation/phosphorylation state of 135 proteins, with a total analysis of nearly 200 key protein isoforms involved in cell proliferation, survival, migration, adhesion, etc., in all 60 cell lines. We aggregated the signaling data into biochemical modules of interconnected kinase substrates for 6 key cancer signaling pathways: AKT, mTOR, EGF receptor (EGFR), insulin-like growth factor-1 receptor (IGF-1R), integrin, and apoptosis signaling. The net activation state of these protein network modules was correlated to available individual protein, phosphoprotein, mutational, metabolomic, miRNA, transcriptional, and drug sensitivity data. Pathway activation mapping identified reproducible and distinct signaling cohorts that transcended organ-type distinctions. Direct correlations with the protein network modules involved largely protein phosphorylation data but we also identified direct correlations of signaling networks with metabolites, miRNA, and DNA data. The integration of protein activation measurements into biochemically interconnected modules provided a novel means to align the functional protein architecture with multiple "-omic" data sets and therapeutic response correlations. This approach may provide a deeper understanding of how cellular biochemistry defines therapeutic response. Such "-omic" portraits could inform rational anticancer agent screenings and drive personalized therapeutic approaches. ƂĀ© 2013 American Association for Cancer Research

    Phosphoproteomic Landscaping Identifies Non-canonical cKIT Signaling in Polycythemia Vera Erythroid Progenitors

    Get PDF
    Although stem cell factor (SCF)/cKIT interaction plays key functions in erythropoiesis, cKIT signaling in human erythroid cells is still poorly defined. To provide new insights into cKIT-mediated erythroid expansion in development and disease, we performed phosphoproteomic profiling of primary erythroid progenitors from adult blood (AB), cord blood (CB), and Polycythemia Vera (PV) at steady-state and upon SCF stimulation. While AB and CB, respectively, activated transient or sustained canonical cKIT-signaling, PV showed a non-canonical signaling including increased mTOR and ERK1 and decreased DEPTOR. Accordingly, screening of FDA-approved compounds showed increased PV sensitivity to JAK, cKIT, and MEK inhibitors. Moreover, differently from AB and CB, in PV the mature 145kDa-cKIT constitutively associated with the tetraspanin CD63 and was not endocytosed upon SCF stimulation, contributing to unrestrained cKIT signaling. These results identify a clinically exploitable variegation of cKIT signaling/metabolism that may contribute to the great erythroid output occurring during development and in PV
    • ā€¦
    corecore