48 research outputs found

    Using a spike-in experiment to evaluate analysis of LC-MS data

    Get PDF
    BACKGROUND: Recent advances in liquid chromatography-mass spectrometry (LC-MS) technology have led to more effective approaches for measuring changes in peptide/protein abundances in biological samples. Label-free LC-MS methods have been used for extraction of quantitative information and for detection of differentially abundant peptides/proteins. However, difference detection by analysis of data derived from label-free LC-MS methods requires various preprocessing steps including filtering, baseline correction, peak detection, alignment, and normalization. Although several specialized tools have been developed to analyze LC-MS data, determining the most appropriate computational pipeline remains challenging partly due to lack of established gold standards. RESULTS: The work in this paper is an initial study to develop a simple model with "presence" or "absence" condition using spike-in experiments and to be able to identify these "true differences" using available software tools. In addition to the preprocessing pipelines, choosing appropriate statistical tests and determining critical values are important. We observe that individual statistical tests could lead to different results due to different assumptions and employed metrics. It is therefore preferable to incorporate several statistical tests for either exploration or confirmation purpose. CONCLUSIONS: The LC-MS data from our spike-in experiment can be used for developing and optimizing LC-MS data preprocessing algorithms and to evaluate workflows implemented in existing software tools. Our current work is a stepping stone towards optimizing LC-MS data acquisition and testing the accuracy and validity of computational tools for difference detection in future studies that will be focused on spiking peptides of diverse physicochemical properties in different concentrations to better represent biomarker discovery of differentially abundant peptides/proteins

    Pathway and Network Approaches for Identification of Cancer Signature Markers from Omics Data

    Full text link
    The advancement of high throughput omic technologies during the past few years has made it possible to perform many complex assays in a much shorter time than the traditional approaches. The rapid accumulation and wide availability of omic data generated by these technologies offer great opportunities to unravel disease mechanisms, but also presents significant challenges to extract knowledge from such massive data and to evaluate the findings. To address these challenges, a number of pathway and network based approaches have been introduced. This review article evaluates these methods and discusses their application in cancer biomarker discovery using hepatocellular carcinoma (HCC) as an example

    A Bayesian two-step integrative procedure incorporating prior knowledge for the identification of miRNA-mRNAs involved in hepatocellular carcinoma

    No full text
    Recent studies have confirmed the role of miRNA regulation of gene expression in oncogenesis for various cancers. In parallel, prior knowledge about relationships between miRNA and mRNA have been accumulated from biological experiments or statistical analyses. Improved identification of disease-associated miRNA-mRNA pairs may be achieved by incorporating prior knowledge into integrative genomic analyses. In this study we focus on 39 patients with hepatocellular carcinoma (HCC) and 25 patients with liver cirrhosis and use a flexible Bayesian two-step integrative method. We found 66 significant miRNA-mRNA pairs, several of which contain molecules that have previously been identified as potential biomarkers. These results demonstrate the utility of the proposed approach in providing a better understanding of relationships between different biological levels, thereby giving insights into the biological mechanisms underlying the diseases, while providing a better selection of biomarkers that may serve as diagnostic, prognostic, or therapeutic biomarker candidates

    Using a spike-in experiment to evaluate analysis of LC-MS data

    No full text
    Abstract Background Recent advances in liquid chromatography-mass spectrometry (LC-MS) technology have led to more effective approaches for measuring changes in peptide/protein abundances in biological samples. Label-free LC-MS methods have been used for extraction of quantitative information and for detection of differentially abundant peptides/proteins. However, difference detection by analysis of data derived from label-free LC-MS methods requires various preprocessing steps including filtering, baseline correction, peak detection, alignment, and normalization. Although several specialized tools have been developed to analyze LC-MS data, determining the most appropriate computational pipeline remains challenging partly due to lack of established gold standards. Results The work in this paper is an initial study to develop a simple model with "presence" or "absence" condition using spike-in experiments and to be able to identify these "true differences" using available software tools. In addition to the preprocessing pipelines, choosing appropriate statistical tests and determining critical values are important. We observe that individual statistical tests could lead to different results due to different assumptions and employed metrics. It is therefore preferable to incorporate several statistical tests for either exploration or confirmation purpose. Conclusions The LC-MS data from our spike-in experiment can be used for developing and optimizing LC-MS data preprocessing algorithms and to evaluate workflows implemented in existing software tools. Our current work is a stepping stone towards optimizing LC-MS data acquisition and testing the accuracy and validity of computational tools for difference detection in future studies that will be focused on spiking peptides of diverse physicochemical properties in different concentrations to better represent biomarker discovery of differentially abundant peptides/proteins.</p

    Convolutional Neural Network-Based Compound Fingerprint Prediction for Metabolite Annotation

    No full text
    Metabolite annotation has been a challenging issue especially in untargeted metabolomics studies by liquid chromatography coupled with mass spectrometry (LC-MS). This is in part due to the limitations of publicly available spectral libraries, which consist of tandem mass spectrometry (MS/MS) data acquired from just a fraction of known metabolites. Machine learning provides the opportunity to predict molecular fingerprints based on MS/MS data. The predicted molecular fingerprints can then be used to help rank putative metabolite IDs obtained by using either the precursor mass or the formula of the unknown metabolite. This method is particularly useful to help annotate metabolites whose corresponding MS/MS spectra are missing or cannot be matched with those in accessible spectral libraries. We investigated a convolutional neural network (CNN) for molecular fingerprint prediction based on data acquired by MS/MS. We used more than 680,000 MS/MS spectra obtained from the MoNA repository and NIST 20, representing about 36,000 compounds for training and testing our CNN model. The trained CNN model is implemented as a python package, MetFID. The package is available on GitHub for users to enter their MS/MS spectra and corresponding putative metabolite IDs to obtain ranked lists of metabolites. Better performance is achieved by MetFID in ranking putative metabolite IDs using the CASMI 2016 benchmark dataset compared to two other machine learning-based tools (CSI:FingerID and ChemDistiller)

    Acquisition of estrogen independence induces TOB1-related mechanisms supporting breast cancer cell proliferation.

    No full text
    Resistance to therapies targeting the estrogen pathway remains a challenge in the treatment of estrogen receptor-positive breast cancer. To address this challenge, a systems biology approach was used. A library of small interfering RNAs targeting an estrogen receptor (ER)- and aromatase-centered network identified 46 genes that are dispensable in estrogen-dependent MCF7 cells, but are selectively required for the survival of estrogen-independent MCF7-derived cells and multiple additional estrogen-independent breast cancer cell lines. Integration of this information identified a tumor suppressor gene TOB1 as a critical determinant of estrogen-independent ER-positive breast cell survival. Depletion of TOB1 selectively promoted G1 phase arrest and sensitivity to AKT and mammalian target of rapmycin (mTOR) inhibitors in estrogen-independent cells but not in estrogen-dependent cells. Phosphoproteomic profiles from reverse-phase protein array analysis supported by mRNA profiling identified a significant signaling network reprogramming by TOB1 that differed in estrogen-sensitive and estrogen-resistant cell lines. These data support a novel function for TOB1 in mediating survival of estrogen-independent breast cancers. These studies also provide evidence for combining TOB1 inhibition and AKT/mTOR inhibition as a therapeutic strategy, with potential translational significance for the management of patients with ER-positive breast cancers
    corecore