    Computational Models for Transplant Biomarker Discovery.

    Translational medicine offers a rich promise for improved diagnostics and drug discovery for biomedical research in the field of transplantation, where continued unmet diagnostic and therapeutic needs persist. Current advent of genomics and proteomics profiling called "omics" provides new resources to develop novel biomarkers for clinical routine. Establishing such a marker system heavily depends on appropriate applications of computational algorithms and software, which are basically based on mathematical theories and models. Understanding these theories would help to apply appropriate algorithms to ensure biomarker systems successful. Here, we review the key advances in theories and mathematical models relevant to transplant biomarker developments. Advantages and limitations inherent inside these models are discussed. The principles of key -computational approaches for selecting efficiently the best subset of biomarkers from high--dimensional omics data are highlighted. Prediction models are also introduced, and the integration of multi-microarray data is also discussed. Appreciating these key advances would help to accelerate the development of clinically reliable biomarker systems

    Integrative Analysis To Select Cancer Candidate Biomarkers To Targeted Validation

    Targeted proteomics has flourished as the method of choice for prospecting for and validating potential candidate biomarkers in many diseases. However, challenges still remain due to the lack of standardized routines that can prioritize a limited number of proteins to be further validated in human samples. To help researchers identify candidate biomarkers that best characterize their samples under study, a well-designed integrative analysis pipeline, comprising MS-based discovery, feature selection methods, clustering techniques, bioinformatic analyses and targeted approaches was performed using discovery-based proteomic data from the secretomes of three classes of human cell lines (carcinoma, melanoma and non-cancerous). Three feature selection algorithms, namely, Beta-binomial, Nearest Shrunken Centroids (NSC), and Support Vector Machine-Recursive Features Elimination (SVM-RFE), indicated a panel of 137 candidate biomarkers for carcinoma and 271 for melanoma, which were differentially abundant between the tumor classes. We further tested the strength of the pipeline in selecting candidate biomarkers by immunoblotting, human tissue microarrays, label-free targeted MS and functional experiments.     Integrative analysis to select cancer candidate biomarkers to targeted validation

    Targeted proteomics has flourished as the method of choice for prospecting for and validating potential candidate biomarkers in many diseases. However, challenges still remain due to the lack of standardized routines that can prioritize a limited number of proteins to be further validated in human samples. To help researchers identify candidate biomarkers that best characterize their samples under study, a well-designed integrative analysis pipeline, comprising MS-based discovery, feature selection methods, clustering techniques, bioinformatic analyses and targeted approaches was performed using discovery-based proteomic data from the secretomes of three classes of human cell lines (carcinoma, melanoma and non-cancerous). Three feature selection algorithms, namely, Beta-binomial, Nearest Shrunken Centroids (NSC), and Support Vector Machine-Recursive Features Elimination (SVM-RFE), indicated a panel of 137 candidate biomarkers for carcinoma and 271 for melanoma, which were differentially abundant between the tumor classes. We further tested the strength of the pipeline in selecting candidate biomarkers by immunoblotting, human tissue microarrays, label-free targeted MS and functional experiments. In conclusion, the proposed integrative analysis was able to pre-qualify and prioritize candidate biomarkers from discovery-based proteomics to targeted MS

    Proteomics in cardiovascular disease: recent progress and clinical implication and implementation

    Introduction: Although multiple efforts have been initiated to shed light into the molecular mechanisms underlying cardiovascular disease, it still remains one of the major causes of death worldwide. Proteomic approaches are unequivocally powerful tools that may provide deeper understanding into the molecular mechanisms associated with cardiovascular disease and improve its management. Areas covered: Cardiovascular proteomics is an emerging field and significant progress has been made during the past few years with the aim of defining novel candidate biomarkers and obtaining insight into molecular pathophysiology. To summarize the recent progress in the field, a literature search was conducted in PubMed and Web of Science. As a result, 704 studies from PubMed and 320 studies from Web of Science were retrieved. Findings from original research articles using proteomics technologies for the discovery of biomarkers for cardiovascular disease in human are summarized in this review. Expert commentary: Proteins associated with cardiovascular disease represent pathways in inflammation, wound healing and coagulation, proteolysis and extracellular matrix organization, handling of cholesterol and LDL. Future research in the field should target to increase proteome coverage as well as integrate proteomics with other omics data to facilitate both drug development as well as clinical implementation of findings

    Integrative analysis of multi-omics data reveals links between human diseases and the gut microbiota

    The gut microbiota plays a critical role in human diseases, including type 2 diabetes (T2D) and osteoporosis. Especially, probiotics have been suggested to provide potential intervention strategies for improving human health. This thesis focuses on elucidating the interrelationships between the gut microbiota, probiotics and human diseases by integrative analysis of plasma metabolomics and gut metagenomics, using machine learning (ML) and genome-scale metabolic model (GEM). This work is mainly structured into two parts, including a systematical investigation of: (I) associations between the gut microbiota and T2D, (II) the effects of probiotic Lactobacillus reuteri ATCC PTA 6475 on bone metabolism of the elderly.\ua0\ua0\ua0 \ua0For the first part, a derivative of phenylalanine was identified as a potential link between the gut microbiota and T2D. It was associated with insulin resistance and might contribute to the metabolic imbalance of (pre)diabetes. By performing a systematical analysis of four metagenomic datasets, several short-chain fatty acids (SCFAs)-producing bacteria and metabolic reactions were consistently identified to be important for predicting T2D status across different studies. For the second part, this work revealed that supplementation with L. reuteri ATCC PTA 6475 prevented detrimental alterations in the metabolisms of both the gut microbiota and the elderly as well as increased the microbial gene richness, which might link the beneficial effects of probiotic L. reuteri ATCC PTA 6475 to bone metabolism. In addition, it was demonstrated that the use of ML and GEM have the potential to identify key disease-related metabolic signatures of single L. reuteri strain, the entire gut microbes, or the human host, based on the metabolomics and metagenomics data.\ua0\ua0\ua0 \ua0Taken together, this work provides novel insights into links between the gut microbiota and the human diseases as well as the positive effects of L. reuteri ATCC PTA 6475 on bone metabolism by integrating omics data using ML and GEMs

    The promise of multi-omics approaches to discover biological alterations with clinical relevance in Alzheimer's disease

    Beyond the core features of Alzheimer's disease (AD) pathology, i.e. amyloid pathology, tau-related neurodegeneration and microglia response, multiple other molecular alterations and pathway dysregulations have been observed in AD. Their inter-individual variations, complex interactions and relevance for clinical manifestation and disease progression remain poorly understood, however. Heterogeneity at both pathophysiological and clinical levels complicates diagnosis, prognosis, treatment and drug design and testing. High-throughput "omics" comprise unbiased and untargeted data-driven methods which allow the exploration of a wide spectrum of disease-related changes at different endophenotype levels without focussing a priori on specific molecular pathways or molecules. Crucially, new methodological and statistical advances now allow for the integrative analysis of data resulting from multiple and different omics methods. These multi-omics approaches offer the unique advantage of providing a more comprehensive characterisation of the AD endophenotype and to capture molecular signatures and interactions spanning various biological levels. These new insights can then help decipher disease mechanisms more deeply. In this review, we describe the different multi-omics tools and approaches currently available and how they have been applied in AD research so far. We discuss how multi-omics can be used to explore molecular alterations related to core features of the AD pathologies and how they interact with comorbid pathological alterations. We further discuss whether the identified pathophysiological changes are relevant for the clinical manifestation of AD, in terms of both cognitive impairment and neuropsychiatric symptoms, and for clinical disease progression over time. Finally, we address the opportunities for multi-omics approaches to help discover novel biomarkers for diagnosis and monitoring of relevant pathophysiological processes, along with personalised intervention strategies in AD

    Machine Learning and Integrative Analysis of Biomedical Big Data.

    Recent developments in high-throughput technologies have accelerated the accumulation of massive amounts of omics data from multiple sources: genome, epigenome, transcriptome, proteome, metabolome, etc. Traditionally, data from each source (e.g., genome) is analyzed in isolation using statistical and machine learning (ML) methods. Integrative analysis of multi-omics and clinical data is key to new biomedical discoveries and advancements in precision medicine. However, data integration poses new computational challenges as well as exacerbates the ones associated with single-omics studies. Specialized computational approaches are required to effectively and efficiently perform integrative analysis of biomedical data acquired from diverse modalities. In this review, we discuss state-of-the-art ML-based approaches for tackling five specific computational challenges associated with integrative analysis: curse of dimensionality, data heterogeneity, missing data, class imbalance and scalability issues