5,138 research outputs found

    Mining whole sample mass spectrometry proteomics data for biomarkers: an overview

    No full text
    In this paper we aim to provide a concise overview of designing and conducting an MS proteomics experiment in such a way as to allow statistical analysis that may lead to the discovery of novel biomarkers. We provide a summary of the various stages that make up such an experiment, highlighting the need for experimental goals to be decided upon in advance. We discuss issues in experimental design at the sample collection stage, and good practise for standardising protocols within the proteomics laboratory. We then describe approaches to the data mining stage of the experiment, including the processing steps that transform a raw mass spectrum into a useable form. We propose a permutation-based procedure for determining the significance of reported error rates. Finally, because of its general advantages in speed and cost, we suggest that MS proteomics may be a good candidate for an early primary screening approach to disease diagnosis, identifying areas of risk and making referrals for more specific tests without necessarily making a diagnosis in its own right. Our discussion is illustrated with examples drawn from experiments on bovine blood serum conducted in the Centre for Proteomic Research (CPR) at Southampton University

    Multicentric validation of proteomic biomarkers in urine specific for diabetic nephropathy

    Get PDF
    Background: Urine proteome analysis is rapidly emerging as a tool for diagnosis and prognosis in disease states. For diagnosis of diabetic nephropathy (DN), urinary proteome analysis was successfully applied in a pilot study. The validity of the previously established proteomic biomarkers with respect to the diagnostic and prognostic potential was assessed on a separate set of patients recruited at three different European centers. In this case-control study of 148 Caucasian patients with diabetes mellitus type 2 and duration >= 5 years, cases of DN were defined as albuminuria >300 mg/d and diabetic retinopathy (n = 66). Controls were matched for gender and diabetes duration (n = 82). Methodology/Principal Findings: Proteome analysis was performed blinded using high-resolution capillary electrophoresis coupled with mass spectrometry (CE-MS). Data were evaluated employing the previously developed model for DN. Upon unblinding, the model for DN showed 93.8% sensitivity and 91.4% specificity, with an AUC of 0.948 (95% CI 0.898-0.978). Of 65 previously identified peptides, 60 were significantly different between cases and controls of this study. In <10% of cases and controls classification by proteome analysis not entirely resulted in the expected clinical outcome. Analysis of patient's subsequent clinical course revealed later progression to DN in some of the false positive classified DN control patients. Conclusions: These data provide the first independent confirmation that profiling of the urinary proteome by CE-MS can adequately identify subjects with DN, supporting the generalizability of this approach. The data further establish urinary collagen fragments as biomarkers for diabetes-induced renal damage that may serve as earlier and more specific biomarkers than the currently used urinary albumin

    Informed baseline subtraction of proteomic mass spectrometry data aided by a novel sliding window algorithm

    Get PDF
    Proteomic matrix-assisted laser desorption/ionisation (MALDI) linear time-of-flight (TOF) mass spectrometry (MS) may be used to produce protein profiles from biological samples with the aim of discovering biomarkers for disease. However, the raw protein profiles suffer from several sources of bias or systematic variation which need to be removed via pre-processing before meaningful downstream analysis of the data can be undertaken. Baseline subtraction, an early pre-processing step that removes the non-peptide signal from the spectra, is complicated by the following: (i) each spectrum has, on average, wider peaks for peptides with higher mass-to-charge ratios (m/z), and (ii) the time-consuming and error-prone trial-and-error process for optimising the baseline subtraction input arguments. With reference to the aforementioned complications, we present an automated pipeline that includes (i) a novel `continuous' line segment algorithm that efficiently operates over data with a transformed m/z-axis to remove the relationship between peptide mass and peak width, and (ii) an input-free algorithm to estimate peak widths on the transformed m/z scale. The automated baseline subtraction method was deployed on six publicly available proteomic MS datasets using six different m/z-axis transformations. Optimality of the automated baseline subtraction pipeline was assessed quantitatively using the mean absolute scaled error (MASE) when compared to a gold-standard baseline subtracted signal. Near-optimal baseline subtraction was achieved using the automated pipeline. The advantages of the proposed pipeline include informed and data specific input arguments for baseline subtraction methods, the avoidance of time-intensive and subjective piecewise baseline subtraction, and the ability to automate baseline subtraction completely. Moreover, individual steps can be adopted as stand-alone routines.Comment: 50 pages, 19 figure

    Programmed cell death 6 interacting protein (PDCD6IP) and Rabenosyn-5 (ZFYVE20) are potential urinary biomarkers for upper gastrointestinal cancer

    Get PDF
    PURPOSE: Cancer of the upper digestive tract (uGI) is a major contributor to cancer-related death worldwide. Due to a rise in occurrence, together with poor survival rates and a lack of diagnostic or prognostic clinical assays, there is a clear need to establish molecular biomarkers. EXPERIMENTAL DESIGN: Initial assessment was performed on urine samples from 60 control and 60 uGI cancer patients using MS to establish a peak pattern or fingerprint model, which was validated by a further set of 59 samples. RESULTS: We detected 86 cluster peaks by MS above frequency and detection thresholds. Statistical testing and model building resulted in a peak profiling model of five relevant peaks with 88% overall sensitivity and 91% specificity, and overall correctness of 90%. High-resolution MS of 40 samples in the 2-10 kDa range resulted in 646 identified proteins, and pattern matching identified four of the five model peaks within significant parameters, namely programmed cell death 6 interacting protein (PDCD6IP/Alix/AIP1), Rabenosyn-5 (ZFYVE20), protein S100A8, and protein S100A9, of which the first two were validated by Western blotting. CONCLUSIONS AND CLINICAL RELEVANCE: We demonstrate that MS analysis of human urine can identify lead biomarker candidates in uGI cancers, which makes this technique potentially useful in defining and consolidating biomarker patterns for uGI cancer screening

    Proteomic-biostatistic integrated approach for finding the underlying molecular determinants of hypertension in human plasma

    Get PDF
    Despite advancements in lowering blood pressure, the best approach to lower it remains controversial because of the lack of information on the molecular basis of hypertension. We, therefore, performed plasma proteomics of plasma from patients with hypertension to identify molecular determinants detectable in these subjects but not in controls and vice versa. Plasma samples from hypertensive subjects (cases; n=118) and controls (n=85) from the InGenious HyperCare cohort were used for this study and performed mass spectrometric analysis. Using biostatistical methods, plasma peptides specific for hypertension were identified, and a model was developed using least absolute shrinkage and selection operator logistic regression. The underlying peptides were identified and sequenced off-line using matrix-assisted laser desorption ionization orbitrap mass spectrometry. By comparison of the molecular composition of the plasma samples, 27 molecular determinants were identified differently expressed in cases from controls. Seventy percent of the molecular determinants selected were found to occur less likely in hypertensive patients. In cross-validation, the overall R(2) was 0.434, and the area under the curve was 0.891 with 95% confidence interval 0.8482 to 0.9349, P<0.0001. The mean values of the cross-validated proteomic score of normotensive and hypertensive patients were found to be -2.007±0.3568 and 3.383±0.2643, respectively, P<0.0001. The molecular determinants were successfully identified, and the proteomic model developed shows an excellent discriminatory ability between hypertensives and normotensives. The identified molecular determinants may be the starting point for further studies to clarify the molecular causes of hypertension

    Integrative analysis of the colorectal cancer proteome : potential clinical impact

    Get PDF
    Peer reviewedPostprin

    Lumican is overexpressed in lung adenocarcinoma pleural effusions.

    Get PDF
    Adenocarcinoma (AdC) is the most common lung cancer subtype and is often associated with pleural effusion (PE). Its poor prognosis is attributable to diagnostic delay and lack of effective treatments and there is a pressing need in discovering new biomarkers for early diagnosis or targeted therapies. To date, little is known about lung AdC proteome. We investigated protein expression of lung AdC in PE using the isobaric Tags for Relative and Absolute Quantification (iTRAQ) approach to identify possible novel diagnostic/prognostic biomarkers. This provided the identification of 109 of lung AdC-related proteins. We further analyzed lumican, one of the overexpressed proteins, in 88 resected lung AdCs and in 23 malignant PE cell-blocks (13 lung AdCs and 10 non-lung cancers) using immunohistochemistry. In AdC surgical samples, lumican expression was low in cancer cells, whereas it was strong and diffuse in the stroma surrounding the tumor. However, lumican expression was not associated with tumor grade, stage, and vascular/pleural invasion. None of the lung cancer cell-blocks showed lumican immunoreaction, whereas those of all the other tumors were strongly positive. Finally, immunoblotting analysis showed lumican expression in both cell lysate and conditioned medium of a fibroblast culture but not in those of A549 lung cancer cell line. PE is a valid source of information for proteomic analysis without many of the restrictions of plasma. The high lumican levels characterizing AdC PEs are probably due to its release by the fibroblasts surrounding the tumor. Despite the role of lumican in lung AdC is still elusive, it could be of diagnostic value

    Addressing the needs of traumatic brain injury with clinical proteomics.

    Get PDF
    BackgroundNeurotrauma or injuries to the central nervous system (CNS) are a serious public health problem worldwide. Approximately 75% of all traumatic brain injuries (TBIs) are concussions or other mild TBI (mTBI) forms. Evaluation of concussion injury today is limited to an assessment of behavioral symptoms, often with delay and subject to motivation. Hence, there is an urgent need for an accurate chemical measure in biofluids to serve as a diagnostic tool for invisible brain wounds, to monitor severe patient trajectories, and to predict survival chances. Although a number of neurotrauma marker candidates have been reported, the broad spectrum of TBI limits the significance of small cohort studies. Specificity and sensitivity issues compound the development of a conclusive diagnostic assay, especially for concussion patients. Thus, the neurotrauma field currently has no diagnostic biofluid test in clinical use.ContentWe discuss the challenges of discovering new and validating identified neurotrauma marker candidates using proteomics-based strategies, including targeting, selection strategies and the application of mass spectrometry (MS) technologies and their potential impact to the neurotrauma field.SummaryMany studies use TBI marker candidates based on literature reports, yet progress in genomics and proteomics have started to provide neurotrauma protein profiles. Choosing meaningful marker candidates from such 'long lists' is still pending, as only few can be taken through the process of preclinical verification and large scale translational validation. Quantitative mass spectrometry targeting specific molecules rather than random sampling of the whole proteome, e.g., multiple reaction monitoring (MRM), offers an efficient and effective means to multiplex the measurement of several candidates in patient samples, thereby omitting the need for antibodies prior to clinical assay design. Sample preparation challenges specific to TBI are addressed. A tailored selection strategy combined with a multiplex screening approach is helping to arrive at diagnostically suitable candidates for clinical assay development. A surrogate marker test will be instrumental for critical decisions of TBI patient care and protection of concussion victims from repeated exposures that could result in lasting neurological deficits

    Potential protein biomarkers for burning mouth syndrome discovered by quantitative proteomics.

    Get PDF
    Burning mouth syndrome (BMS) is a chronic pain disorder characterized by severe burning sensation in normal looking oral mucosa. Diagnosis of BMS remains to be a challenge to oral healthcare professionals because the method for definite diagnosis is still uncertain. In this study, a quantitative saliva proteomic analysis was performed in order to identify target proteins in BMS patients' saliva that may be used as biomarkers for simple, non-invasive detection of the disease. By using isobaric tags for relative and absolute quantitation labeling and liquid chromatography-tandem mass spectrometry to quantify 1130 saliva proteins between BMS patients and healthy control subjects, we found that 50 proteins were significantly changed in the BMS patients when compared to the healthy control subjects ( p ≤ 0.05, 39 up-regulated and 11 down-regulated). Four candidates, alpha-enolase, interleukin-18 (IL-18), kallikrein-13 (KLK13), and cathepsin G, were selected for further validation. Based on enzyme-linked immunosorbent assay measurements, three potential biomarkers, alpha-enolase, IL-18, and KLK13, were successfully validated. The fold changes for alpha-enolase, IL-18, and KLK13 were determined as 3.6, 2.9, and 2.2 (burning mouth syndrome vs. control), and corresponding receiver operating characteristic values were determined as 0.78, 0.83, and 0.68, respectively. Our findings indicate that testing of the identified protein biomarkers in saliva might be a valuable clinical tool for BMS detection. Further validation studies of the identified biomarkers or additional candidate biomarkers are needed to achieve a multi-marker prediction model for improved detection of BMS with high sensitivity and specificity

    A proteomic atlas of senescence-associated secretomes for aging biomarker development.

    Get PDF
    The senescence-associated secretory phenotype (SASP) has recently emerged as a driver of and promising therapeutic target for multiple age-related conditions, ranging from neurodegeneration to cancer. The complexity of the SASP, typically assessed by a few dozen secreted proteins, has been greatly underestimated, and a small set of factors cannot explain the diverse phenotypes it produces in vivo. Here, we present the "SASP Atlas," a comprehensive proteomic database of soluble proteins and exosomal cargo SASP factors originating from multiple senescence inducers and cell types. Each profile consists of hundreds of largely distinct proteins but also includes a subset of proteins elevated in all SASPs. Our analyses identify several candidate biomarkers of cellular senescence that overlap with aging markers in human plasma, including Growth/differentiation factor 15 (GDF15), stanniocalcin 1 (STC1), and serine protease inhibitors (SERPINs), which significantly correlated with age in plasma from a human cohort, the Baltimore Longitudinal Study of Aging (BLSA). Our findings will facilitate the identification of proteins characteristic of senescence-associated phenotypes and catalog potential senescence biomarkers to assess the burden, originating stimulus, and tissue of origin of senescent cells in vivo
    • …
    corecore