24 research outputs found

    Stable Feature Selection for Biomarker Discovery

    Full text link
    Feature selection techniques have been used as the workhorse in biomarker discovery applications for a long time. Surprisingly, the stability of feature selection with respect to sampling variations has long been under-considered. It is only until recently that this issue has received more and more attention. In this article, we review existing stable feature selection methods for biomarker discovery using a generic hierarchal framework. We have two objectives: (1) providing an overview on this new yet fast growing topic for a convenient reference; (2) categorizing existing methods under an expandable framework for future research and development

    Shotgun Proteomics of Isolated Urinary Extracellular Vesicles for Investigating Respiratory Impedance in Healthy Preschoolers

    Get PDF
    Urine proteomic applications in children suggested their potential in discriminating between healthy subjects from those with respiratory diseases. The aim of the current study was to combine protein fractionation, by urinary extracellular vesicle isolation, and proteomics analysis in order to establish whether different patterns of respiratory impedance in healthy preschoolers can be characterized from a protein fingerprint. Twenty-one 3-5-yr-old healthy children, representative of 66 recruited subjects, were selected: 12 late preterm (LP) and 9 full-term (T) born. Children underwent measurement of respiratory impedance through Forced Oscillation Technique (FOT) and no significant differences between LP and T were found. Unbiased clustering, based on proteomic signatures, stratified three groups of children (A, B, C) with significantly different patterns of respiratory impedance, which was slightly worse in group A than in groups B and C. Six proteins (Tripeptidyl peptidase I (TPP1), Cubilin (CUBN), SerpinA4, SerpinF1, Thy-1 membrane glycoprotein (THY1) and Angiopoietin-related protein 2 (ANGPTL2)) were identified in order to type the membership of subjects to the three groups. The differential levels of the six proteins in groups A, B and C suggest that proteomic-based profiles of urinary fractionated exosomes could represent a link between respiratory impedance and underlying biological profiles in healthy preschool children

    Multivariate meta-analysis of proteomics data from human prostate and colon tumours

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>There is a vast need to find clinically applicable protein biomarkers as support in cancer diagnosis and tumour classification. In proteomics research, a number of methods can be used to obtain systemic information on protein and pathway level on cells and tissues. One fundamental tool in analysing protein expression has been two-dimensional gel electrophoresis (2DE). Several cancer 2DE studies have reported partially redundant lists of differently expressed proteins. To be able to further extract valuable information from existing 2DE data, the power of a multivariate meta-analysis will be evaluated in this work.</p> <p>Results</p> <p>We here demonstrate a multivariate meta-analysis of 2DE proteomics data from human prostate and colon tumours. We developed a bioinformatic workflow for identifying common patterns over two tumour types. This included dealing with pre-processing of data and handling of missing values followed by the development of a multivariate Partial Least Squares (PLS) model for prediction and variable selection. The variable selection was based on the variables performance in the PLS model in combination with stability in the validation. The PLS model development and variable selection was rigorously evaluated using a double cross-validation scheme. The most stable variables from a bootstrap validation gave a mean prediction success of 93% when predicting left out test sets on models discriminating between normal and tumour tissue, common for the two tumour types. The analysis conducted in this study identified 14 proteins with a common trend between the tumour types prostate and colon, i.e. the same expression profile between normal and tumour samples.</p> <p>Conclusions</p> <p>The workflow for meta-analysis developed in this study enabled the finding of a common protein profile for two malign tumour types, which was not possible to identify when analysing the data sets separately.</p

    Computational diagnosis and risk evaluation for canine lymphoma

    Full text link
    The canine lymphoma blood test detects the levels of two biomarkers, the acute phase proteins (C-Reactive Protein and Haptoglobin). This test can be used for diagnostics, for screening, and for remission monitoring as well. We analyze clinical data, test various machine learning methods and select the best approach to these problems. Three family of methods, decision trees, kNN (including advanced and adaptive kNN) and probability density evaluation with radial basis functions, are used for classification and risk estimation. Several pre-processing approaches were implemented and compared. The best of them are used to create the diagnostic system. For the differential diagnosis the best solution gives the sensitivity and specificity of 83.5% and 77%, respectively (using three input features, CRP, Haptoglobin and standard clinical symptom). For the screening task, the decision tree method provides the best result, with sensitivity and specificity of 81.4% and >99%, respectively (using the same input features). If the clinical symptoms (Lymphadenopathy) are considered as unknown then a decision tree with CRP and Hapt only provides sensitivity 69% and specificity 83.5%. The lymphoma risk evaluation problem is formulated and solved. The best models are selected as the system for computational lymphoma diagnosis and evaluation the risk of lymphoma as well. These methods are implemented into a special web-accessed software and are applied to problem of monitoring dogs with lymphoma after treatment. It detects recurrence of lymphoma up to two months prior to the appearance of clinical signs. The risk map visualisation provides a friendly tool for explanatory data analysis.Comment: 24 pages, 86 references in the bibliography, Significantly extended version with review of lymphoma biomarkers and data mining methods (Three new sections are added: 1.1. Biomarkers for canine lymphoma, 1.2. Acute phase proteins as lymphoma biomarkers and 3.1. Data mining methods for biomarker cancer diagnosis. Flowcharts of data analysis are included as supplementary material (20 pages

    Blood-based biomarkers for Alzheimer disease: mapping the road to the clinic.

    Get PDF
    Biomarker discovery and development for clinical research, diagnostics and therapy monitoring in clinical trials have advanced rapidly in key areas of medicine - most notably, oncology and cardiovascular diseases - allowing rapid early detection and supporting the evolution of biomarker-guided, precision-medicine-based targeted therapies. In Alzheimer disease (AD), breakthroughs in biomarker identification and validation include cerebrospinal fluid and PET markers of amyloid-β and tau proteins, which are highly accurate in detecting the presence of AD-associated pathophysiological and neuropathological changes. However, the high cost, insufficient accessibility and/or invasiveness of these assays limit their use as viable first-line tools for detecting patterns of pathophysiology. Therefore, a multistage, tiered approach is needed, prioritizing development of an initial screen to exclude from these tests the high numbers of people with cognitive deficits who do not demonstrate evidence of underlying AD pathophysiology. This Review summarizes the efforts of an international working group that aimed to survey the current landscape of blood-based AD biomarkers and outlines operational steps for an effective academic-industry co-development pathway from identification and assay development to validation for clinical use.I recieved an honorarium from Roche Diagnostics for my participation in the advisory panel meeting leading to this pape
    corecore