50,734 research outputs found

    Machine Learning and Integrative Analysis of Biomedical Big Data.

    Get PDF
    Recent developments in high-throughput technologies have accelerated the accumulation of massive amounts of omics data from multiple sources: genome, epigenome, transcriptome, proteome, metabolome, etc. Traditionally, data from each source (e.g., genome) is analyzed in isolation using statistical and machine learning (ML) methods. Integrative analysis of multi-omics and clinical data is key to new biomedical discoveries and advancements in precision medicine. However, data integration poses new computational challenges as well as exacerbates the ones associated with single-omics studies. Specialized computational approaches are required to effectively and efficiently perform integrative analysis of biomedical data acquired from diverse modalities. In this review, we discuss state-of-the-art ML-based approaches for tackling five specific computational challenges associated with integrative analysis: curse of dimensionality, data heterogeneity, missing data, class imbalance and scalability issues

    Pathway-Based Genomics Prediction using Generalized Elastic Net.

    Get PDF
    We present a novel regularization scheme called The Generalized Elastic Net (GELnet) that incorporates gene pathway information into feature selection. The proposed formulation is applicable to a wide variety of problems in which the interpretation of predictive features using known molecular interactions is desired. The method naturally steers solutions toward sets of mechanistically interlinked genes. Using experiments on synthetic data, we demonstrate that pathway-guided results maintain, and often improve, the accuracy of predictors even in cases where the full gene network is unknown. We apply the method to predict the drug response of breast cancer cell lines. GELnet is able to reveal genetic determinants of sensitivity and resistance for several compounds. In particular, for an EGFR/HER2 inhibitor, it finds a possible trans-differentiation resistance mechanism missed by the corresponding pathway agnostic approach

    Pharmacometabolomic mapping of early biochemical changes induced by sertraline and placebo.

    Get PDF
    In this study, we characterized early biochemical changes associated with sertraline and placebo administration and changes associated with a reduction in depressive symptoms in patients with major depressive disorder (MDD). MDD patients received sertraline or placebo in a double-blind 4-week trial; baseline, 1 week, and 4 weeks serum samples were profiled using a gas chromatography time of flight mass spectrometry metabolomics platform. Intermediates of TCA and urea cycles, fatty acids and intermediates of lipid biosynthesis, amino acids, sugars and gut-derived metabolites were changed after 1 and 4 weeks of treatment. Some of the changes were common to the sertraline- and placebo-treated groups. Changes after 4 weeks of treatment in both groups were more extensive. Pathway analysis in the sertraline group suggested an effect of drug on ABC and solute transporters, fatty acid receptors and transporters, G signaling molecules and regulation of lipid metabolism. Correlation between biochemical changes and treatment outcomes in the sertraline group suggested a strong association with changes in levels of branched chain amino acids (BCAAs), lower BCAAs levels correlated with better treatment outcomes; pathway analysis in this group revealed that methionine and tyrosine correlated with BCAAs. Lower levels of lactic acid, higher levels of TCA/urea cycle intermediates, and 3-hydroxybutanoic acid correlated with better treatment outcomes in placebo group. Results of this study indicate that biochemical changes induced by drug continue to evolve over 4 weeks of treatment and that might explain partially delayed response. Response to drug and response to placebo share common pathways but some pathways are more affected by drug treatment. BCAAs seem to be implicated in mechanisms of recovery from a depressed state following sertraline treatment

    Application of pharmacogenomics and bioinformatics to exemplify the utility of human <i>ex vivo</i> organoculture models in the field of precision medicine

    Get PDF
    Here we describe a collaboration between industry, the National Health Service (NHS) and academia that sought to demonstrate how early understanding of both pharmacology and genomics can improve strategies for the development of precision medicines. Diseased tissue ethically acquired from patients suffering from chronic obstructive pulmonary disease (COPD), was used to investigate inter-patient variability in drug efficacy using ex vivo organocultures of fresh lung tissue as the test system. The reduction in inflammatory cytokines in the presence of various test drugs was used as the measure of drug efficacy and the individual patient responses were then matched against genotype and microRNA profiles in an attempt to identify unique predictors of drug responsiveness. Our findings suggest that genetic variation in CYP2E1 and SMAD3 genes may partly explain the observed variation in drug response

    Identification of a selective G1-phase benzimidazolone inhibitor by a senescence-targeted virtual screen using artificial neural networks

    Get PDF
    Cellular senescence is a barrier to tumorigenesis in normal cells and tumour cells undergo senescence responses to genotoxic stimuli, which is a potential target phenotype for cancer therapy. However, in this setting, mixed-mode responses are common with apoptosis the dominant effect. Hence, more selective senescence inducers are required. Here we report a machine learning-based in silico screen to identify potential senescence agonists. We built profiles of differentially affected biological process networks from expression data obtained under induced telomere dysfunction conditions in colorectal cancer cells and matched these to a panel of 17 protein targets with confirmatory screening data in PubChem. We trained a neural network using 3517 compounds identified as active or inactive against these targets. The resulting classification model was used to screen a virtual library of ~2M lead-like compounds. 147 virtual hits were acquired for validation in growth inhibition and senescence-associated β-galactosidase (SA-β-gal) assays. Among the found hits a benzimidazolone compound, CB-20903630, had low micromolar IC50 for growth inhibition of HCT116 cells and selectively induced SA-β-gal activity in the entire treated cell population without cytotoxicity or apoptosis induction. Growth suppression was mediated by G1 blockade involving increased p21 expression and suppressed cyclin B1, CDK1 and CDC25C. Additionally, the compound inhibited growth of multicellular spheroids and caused severe retardation of population kinetics in long term treatments. Preliminary structure-activity and structure clustering analyses are reported and expression analysis of CB-20903630 against other cell cycle suppressor compounds suggested a PI3K/AKT-inhibitor-like profile in normal cells, with different pathways affected in cancer cells

    Spectral analysis of gene expression profiles using gene networks

    Full text link
    Microarrays have become extremely useful for analysing genetic phenomena, but establishing a relation between microarray analysis results (typically a list of genes) and their biological significance is often difficult. Currently, the standard approach is to map a posteriori the results onto gene networks to elucidate the functions perturbed at the level of pathways. However, integrating a priori knowledge of the gene networks could help in the statistical analysis of gene expression data and in their biological interpretation. Here we propose a method to integrate a priori the knowledge of a gene network in the analysis of gene expression data. The approach is based on the spectral decomposition of gene expression profiles with respect to the eigenfunctions of the graph, resulting in an attenuation of the high-frequency components of the expression profiles with respect to the topology of the graph. We show how to derive unsupervised and supervised classification algorithms of expression profiles, resulting in classifiers with biological relevance. We applied the method to the analysis of a set of expression profiles from irradiated and non-irradiated yeast strains. It performed at least as well as the usual classification but provides much more biologically relevant results and allows a direct biological interpretation

    Using stratified medicine to understand, diagnose, and treat neuropathic pain

    Get PDF
    Neuropathic pain (NeuP) is defined as pain arising from a lesion or disease of the somatosensory nervous system. NeuP is common, affecting approximately 6-8% of the general population and currently treatment is inadequate due to both poor drug efficacy and tolerability. Many different types of injury can cause neuropathic pain including genetic (e.g. SCN9A gain of function variants), metabolic (e.g. diabetic polyneuropathy), infective (e.g. HIV associated neuropathy, hepatitis), traumatic and toxic (e.g. chemotherapy induced neuropathy) causes. Such injurious events can impact on anatomically distinct regions of the somatosensory nervous system ranging from the terminals of nociceptive afferents (in small fiber neuropathy) to the thalamus (in post-stroke pain). Classification of neuropathic pain using etiology and location remains an important aspect of routine clinical practice; however, pain medicine is coming to the realization that we need more precision in this classification. The hope is that improved classification will lead to better understanding of risk, prognosis and optimal treatment of NeuP
    corecore