8 research outputs found

    Protein domain-based prediction of drug/compound–target interactions and experimental validation on LIM kinases

    Get PDF
    Predictive approaches such as virtual screening have been used in drug discovery with the objective of reducing developmental time and costs. Current machine learning and network-based approaches have issues related to generalization, usability, or model interpretability, especially due to the complexity of target proteins’ structure/function, and bias in system training datasets. Here, we propose a new method “DRUIDom” (DRUg Interacting Domain prediction) to identify bio-interactions between drug candidate compounds and targets by utilizing the domain modularity of proteins, to overcome problems associated with current approaches. DRUIDom is composed of two methodological steps. First, ligands/compounds are statistically mapped to structural domains of their target proteins, with the aim of identifying their interactions. As such, other proteins containing the same mapped domain or domain pair become new candidate targets for the corresponding compounds. Next, a million-scale dataset of small molecule compounds, including those mapped to domains in the previous step, are clustered based on their molecular similarities, and their domain associations are propagated to other compounds within the same clusters. Experimentally verified bioactivity data points, obtained from public databases, are meticulously filtered to construct datasets of active/interacting and inactive/non-interacting drug/compound–target pairs (~2.9M data points), and used as training data for calculating parameters of compound–domain mappings, which led to 27,032 high-confidence associations between 250 domains and 8,165 compounds, and a finalized output of ~5 million new compound–protein interactions. DRUIDom is experimentally validated by syntheses and bioactivity analyses of compounds predicted to target LIM-kinase proteins, which play critical roles in the regulation of cell motility, cell cycle progression, and differentiation through actin filament dynamics. We showed that LIMK-inhibitor-2 and its derivatives significantly block the cancer cell migration through inhibition of LIMK phosphorylation and the downstream protein cofilin. One of the derivative compounds (LIMKi-2d) was identified as a promising candidate due to its action on resistant Mahlavu liver cancer cells. The results demonstrated that DRUIDom can be exploited to identify drug candidate compounds for intended targets and to predict new target proteins based on the defined compound–domain relationships. Datasets, results, and the source code of DRUIDom are fully-available at: https://github.com/cansyl/DRUIDom

    CROssBAR: comprehensive resource of biomedical relations with knowledge graph representations

    Get PDF
    Systemic analysis of available large-scale biological/biomedical data is critical for studying biological mechanisms, and developing novel and effective treatment approaches against diseases. However, different layers of the available data are produced using different technologies and scattered across individual computational resources without any explicit connections to each other, which hinders extensive and integrative multi-omics-based analysis. We aimed to address this issue by developing a new data integration/representation methodology and its application by constructing a biological data resource. CROssBAR is a comprehensive system that integrates large-scale biological/biomedical data from various resources and stores them in a NoSQL database. CROssBAR is enriched with the deep-learning-based prediction of relationships between numerous data entries, which is followed by the rigorous analysis of the enriched data to obtain biologically meaningful modules. These complex sets of entities and relationships are displayed to users via easy-tointerpret, interactive knowledge graphs within an open-access service. CROssBAR knowledge graphs incorporate relevant genes-proteins, molecular interactions, pathways, phenotypes, diseases, as well as known/predicted drugs and bioactive compounds, and they are constructed on-the-fly based on simple non-programmatic user queries. These intensely processed heterogeneous networks are expected to aid systems-level research, especially to infer biological mechanisms in relation to genes, proteins, their ligands, and diseases

    Crowdsourced mapping of unexplored target space of kinase inhibitors

    Get PDF
    Despite decades of intensive search for compounds that modulate the activity of particular protein targets, a large proportion of the human kinome remains as yet undrugged. Effective approaches are therefore required to map the massive space of unexplored compound-kinase interactions for novel and potent activities. Here, we carry out a crowdsourced benchmarking of predictive algorithms for kinase inhibitor potencies across multiple kinase families tested on unpublished bioactivity data. We find the top-performing predictions are based on various models, including kernel learning, gradient boosting and deep learning, and their ensemble leads to a predictive accuracy exceeding that of single-dose kinase activity assays. We design experiments based on the model predictions and identify unexpected activities even for under-studied kinases, thereby accelerating experimental mapping efforts. The open-source prediction algorithms together with the bioactivities between 95 compounds and 295 kinases provide a resource for benchmarking prediction algorithms and for extending the druggable kinome. The IDG-DREAM Challenge carried out crowdsourced benchmarking of predictive algorithms for kinase inhibitor activities on unpublished data. This study provides a resource to compare emerging algorithms and prioritize new kinase activities to accelerate drug discovery and repurposing efforts

    Anti-Cancer Effects of Clofazimine As a Single Agent and in Combination with Cisplatin in Multiple Myeloma

    No full text
    Multiple Myeloma (MM) is a malignant neoplasm of bone marrow plasma B cells with high morbidity. Clofazimine (CLF) is an FDA-approved leprostatic, anti-tuberculosis, and anti-inflammatory drug that was previously shown to have growth suppression effect on various cancer types such as hepatocellular, lung, cervix, esophageal, colon, and breast cancer as well as melanoma, neuroblastoma, and leukemia. The objective of this study was to evaluate the anticancer effect and mechanism of CLF on U266 MM cell line. CLF (10μM, 24h) treatment resulted up to 72% growth suppression on a panel of hematological cell lines. Dose-response study conducted on U266 MM cell line revealed an IC50 value of 9.8±0.7μM. CLF also showed a synergistic inhibition effect in combination with cisplatin. In mechanistic assays, CLF treatment caused mitochondrial membrane depolarization, change in cell membrane asymmetry and increase in caspase-3 activity; indicating to an intrinsic apoptosis mechanism. This study provides new evidence for the anticancer effect of CLF on U266 cell line. Further in vivo and clinical studies are warranted to evaluate its therapeutic potential for MM treatment

    Data Centric Molecular Analysis and Evaluation of Hepatocellular Carcinoma Therapeutics Using Machine Intelligence-Based Tools

    No full text
    Purpose Computational approaches have been used at different stages of drug development with the purpose of decreasing the time and cost of conventional experimental procedures. Lately, techniques mainly developed and applied in the field of artificial intelligence (AI), have been transferred to different application domains such as biomedicine. Methods In this study, we conducted an investigative analysis via data-driven evaluation of potential hepatocellular carcinoma (HCC) therapeutics in the context of AI-assisted drug discovery/repurposing. First, we discussed basic concepts, computational approaches, databases, modeling approaches, and featurization techniques in drug discovery/repurposing. In the analysis part, we automatically integrated HCC-related biological entities such as genes/proteins, pathways, phenotypes, drugs/compounds, and other diseases with similar implications, and represented these heterogeneous relationships via a knowledge graph using the CROssBAR system. Results Following the system-level evaluation and selection of critical genes/proteins and pathways to target, our deep learning-based drug/compound-target protein interaction predictors DEEPScreen and MDeePred have been employed for predicting new bioactive drugs and compounds for these critical targets. Finally, we embedded ligands of selected HCC-associated proteins which had a significant enrichment with the CROssBAR system into a 2-D space to identify and repurpose small molecule inhibitors as potential drug candidates based on their molecular similarities to known HCC drugs. Conclusions We expect that these series of data-driven analyses can be used as a roadmap to propose early-stage potential inhibitors (from database-scale sets of compounds) to both HCC and other complex diseases, which may subsequently be analyzed with more targeted in silico and experimental approaches

    CROssBAR: Comprehensive Resource of Biomedical Relations with Network Representations and Deep Learning

    No full text
    Biomedical information is scattered across different biological data resources, which are biologically related but only loosely linked to each other in terms of data connections. This hinders the applications of integrative systems biology applications on data. We aim to develop a comprehensive resource, CROssBAR, to address these shortcomings by establishing relationships between relevant biological data sources to present a well-connected database, focusing on the fields of drug discovery and precision medicine. CROssBAR will contain 3 modules: (1) novel computational methods using graph theory and deep learning algorithms, to reveal unknown drug-target interactions and gene/protein-disease associations; (2) multi-partite biological networks where nodes will represent compounds/drugs, genes/proteins, pathways/systems and diseases, the edges will represent known and predicted pairwise relations in-between; and (3) an open access database and web-service to provide access to the resultant networks with its components. We have developed data pipelines for the heavy lifting of data from different data sources like UniProt, ChEMBL, PubChem, Drugbank and EFO persisting only specific data attributes for biomedical entity networks. The database is hosted in self-sufficient collections in MongoDB. The CROssBAR resource should help researchers in the interpretation of biomedical data by observing biological entities together with their relations

    Recent applications of deep learning and machine intelligence on in silico drug discovery: methods, tools and databases

    No full text
    corecore