17 research outputs found

    Causal reasoning over knowledge graphs leveraging drug-perturbed and disease-specific transcriptomic signatures for drug discovery

    Get PDF
    Network-based approaches are becoming increasingly popular for drug discovery as they provide a systems-level overview of the mechanisms underlying disease pathophysiology. They have demonstrated significant early promise over other methods of biological data representation, such as in target discovery, side effect prediction and drug repurposing. In parallel, an explosion of -omics data for the deep characterization of biological systems routinely uncovers molecular signatures of disease for similar applications. Here, we present RPath, a novel algorithm that prioritizes drugs for a given disease by reasoning over causal paths in a knowledge graph (KG), guided by both drug-perturbed as well as disease-specific transcriptomic signatures. First, our approach identifies the causal paths that connect a drug to a particular disease. Next, it reasons over these paths to identify those that correlate with the transcriptional signatures observed in a drug-perturbation experiment, and anti-correlate to signatures observed in the disease of interest. The paths which match this signature profile are then proposed to represent the mechanism of action of the drug. We demonstrate how RPath consistently prioritizes clinically investigated drug-disease pairs on multiple datasets and KGs, achieving better performance over other similar methodologies. Furthermore, we present two case studies showing how one can deconvolute the predictions made by RPath as well as predict novel targets.DDF, YG, AP, CWD, BBM, DH, JR, and VC have been funded by Enveda Biosciences. This work has been funded by Enveda Biosciences (https://www.envedabio.com/). The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript. SM and DRB received no specific funding for this work.Peer ReviewedPostprint (author's final draft

    Comprehensive Fragment Screening of the SARS-CoV-2 Proteome Explores Novel Chemical Space for Drug Development

    Get PDF
    12 pags., 4 figs., 3 tabs.SARS-CoV-2 (SCoV2) and its variants of concern pose serious challenges to the public health. The variants increased challenges to vaccines, thus necessitating for development of new intervention strategies including anti-virals. Within the international Covid19-NMR consortium, we have identified binders targeting the RNA genome of SCoV2. We established protocols for the production and NMR characterization of more than 80 % of all SCoV2 proteins. Here, we performed an NMR screening using a fragment library for binding to 25 SCoV2 proteins and identified hits also against previously unexplored SCoV2 proteins. Computational mapping was used to predict binding sites and identify functional moieties (chemotypes) of the ligands occupying these pockets. Striking consensus was observed between NMR-detected binding sites of the main protease and the computational procedure. Our investigation provides novel structural and chemical space for structure-based drug design against the SCoV2 proteome.Work at BMRZ is supported by the state of Hesse. Work in Covid19-NMR was supported by the Goethe Corona Funds, by the IWBEFRE-program 20007375 of state of Hesse, the DFG through CRC902: “Molecular Principles of RNA-based regulation.” and through infrastructure funds (project numbers: 277478796, 277479031, 392682309, 452632086, 70653611) and by European Union’s Horizon 2020 research and innovation program iNEXT-discovery under grant agreement No 871037. BY-COVID receives funding from the European Union’s Horizon Europe Research and Innovation Programme under grant agreement number 101046203. “INSPIRED” (MIS 5002550) project, implemented under the Action “Reinforcement of the Research and Innovation Infrastructure,” funded by the Operational Program “Competitiveness, Entrepreneurship and Innovation” (NSRF 2014–2020) and co-financed by Greece and the EU (European Regional Development Fund) and the FP7 REGPOT CT-2011-285950—“SEE-DRUG” project (purchase of UPAT’s 700 MHz NMR equipment). The support of the CERM/CIRMMP center of Instruct-ERIC is gratefully acknowledged. This work has been funded in part by a grant of the Italian Ministry of University and Research (FISR2020IP_02112, ID-COVID) and by Fondazione CR Firenze. A.S. is supported by the Deutsche Forschungsgemeinschaft [SFB902/B16, SCHL2062/2-1] and the Johanna Quandt Young Academy at Goethe [2019/AS01]. M.H. and C.F. thank SFB902 and the Stiftung Polytechnische Gesellschaft for the Scholarship. L.L. work was supported by the French National Research Agency (ANR, NMR-SCoV2-ORF8), the Fondation de la Recherche Médicale (FRM, NMR-SCoV2-ORF8), FINOVI and the IR-RMN-THC Fr3050 CNRS. Work at UConn Health was supported by grants from the US National Institutes of Health (R01 GM135592 to B.H., P41 GM111135 and R01 GM123249 to J.C.H.) and the US National Science Foundation (DBI 2030601 to J.C.H.). Latvian Council of Science Grant No. VPP-COVID-2020/1-0014. National Science Foundation EAGER MCB-2031269. This work was supported by the grant Krebsliga KFS-4903-08-2019 and SNF-311030_192646 to J.O. P.G. (ITMP) The EOSC Future project is co-funded by the European Union Horizon Programme call INFRAEOSC-03-2020—Grant Agreement Number 101017536. Open Access funding enabled and organized by Projekt DEALPeer reviewe

    cthoyt/chembl-downloader: v0.4.3

    No full text
    What's Changed Refresh datasets from Cortés-Ciriano by @cthoyt in https://github.com/cthoyt/chembl-downloader/pull/9 Update README.md by @YojanaGadiya in https://github.com/cthoyt/chembl-downloader/pull/5 Update cortes-ciriano-refresh.ipynb by @rflameiro in https://github.com/cthoyt/chembl-downloader/pull/12 New Contributors @YojanaGadiya made their first contribution in https://github.com/cthoyt/chembl-downloader/pull/5 @rflameiro made their first contribution in https://github.com/cthoyt/chembl-downloader/pull/12 Full Changelog: https://github.com/cthoyt/chembl-downloader/compare/v0.4.2...v0.4.

    cthoyt/chembl-downloader: v0.4.2

    No full text
    Full Changelog: https://github.com/cthoyt/chembl-downloader/compare/v0.4.1...v0.4.

    Pharmacophore-based ML model to predict ligand selectivity for E3 ligase binders

    No full text
    E3 ligases are enzymes that play a critical role in ubiquitin-mediated protein degradation and are involved in various cellular processes. Pharmacophore analysis is a useful approach for predicting E3 ligase binding selectivity, which involves identifying key chemical features necessary for a ligand to interact with a specific protein target cavity. While pharmacophore analysis is not always sufficient to accurately predict ligand binding affinity, it can be a valuable tool for filtering and/or designing focused libraries for screening campaigns. In this study, we present a fast and inexpensive approach using a pharmacophore fingerprinting scheme known as ErG, which is used in a multiclass machine learning classification model. This model can assign the correct E3 ligase binder to its known E3 ligase and predict the probability of each molecule to bind to different E3 ligases. Practical applications of this approach are demonstrated on commercial libraries for rational design of E3 ligase binders

    Pharmaceutical patent landscaping: A novel approach to understand patents from the drug discovery perspective

    No full text
    Patents play a crucial role in the drug discovery process by providing legal protection for discoveries and incentivising investments in research and development. By identifying patterns within patent data resources, researchers can gain insight into the market trends and priorities of the pharmaceutical and biotechnology industries, as well as provide additional perspectives on more fundamental aspects such as the emergence of potential new drug targets. In this paper, we used the patent enrichment tool, PEMT, to extract, integrate, and analyse patent literature for rare diseases (RD) and Alzheimer's disease (AD). This is followed by a systematic review of the underlying patent landscape to decipher trends and applications in patents for these diseases. To do so, we discuss prominent organisations involved in drug discovery research in AD and RD. This allows us to gain an understanding of the importance of AD and RD from specific organisational (pharmaceutical or university) perspectives. Next, we analyse the historical focus of patents in relation to individual therapeutic targets and correlate them with market scenarios allowing the identification of prominent targets for a disease. Lastly, we identified drug repurposing activities within the two diseases with the help of patents. This resulted in identifying existing repurposed drugs and novel potential therapeutic approaches applicable to the indication areas. The study demonstrates the expanded applicability of patent documents from legal to drug discovery, design, and research, thus, providing a valuable resource for future drug discovery efforts. Moreover, this study is an attempt towards understanding the importance of data underlying patent documents and raising the need for preparing the data for machine learning-based applications

    BioDataFuse: Enhancing Data Interoperability through Modular Queries and Knowledge Graph Construction

    No full text
    In biological research, integrating experimental data with publicly available resources is pivotal for understanding complex biological mechanisms. However, this process is often intricate and time-consuming due to the complexity and diversity of data. Furthermore, the lack of consistent harmonization across different data types complicates the management of disparate data formats and sources. Addressing this, we introduce BioDataFuse, a query-based Python tool for seamless integration of biomedical data resources. BioDataFuse establishes a modular framework for efficient data wrangling, enabling context-specific knowledge graph creation and supporting graph-based analyses. With a user-friendly interface, it enables users to dynamically create knowledge graphs from their input experimental data. Supported by a robust Python package, pyBiodatafuse, this tool excels in data harmonization, aggregating diverse sources through modular queries. Moreover, BioDataFuse provides plugin capabilities for Cytoscape and Neo4j, allowing local graph hosting. Ongoing refinements enhance the graph utility through tasks like link prediction, making BioDataFuse a versatile solution for efficient and effective biological data integration

    Modern drug discovery using ethnobotany: A large-scale cross-cultural analysis of traditional medicine reveals common therapeutic uses

    No full text
    Summary: For millennia, numerous cultures and civilizations have relied on traditional remedies derived from plants to treat a wide range of conditions and ailments. Here, we systematically analyzed ethnobotanical patterns across taxonomically related plants, demonstrating that congeneric medicinal plants are more likely to be used for treating similar indications. Next, we reconstructed the phytochemical space covered by medicinal plants to reveal that (i) taxonomically related medicinal plants cover a similar phytochemical space, and (ii) chemical similarity correlates with similar therapeutic usage. Lastly, we present several case scenarios illustrating how mining this information can be used for drug discovery applications, including: (i) investigating taxonomic hotspots around particular indications, (ii) exploring shared patterns of congeneric plants located in different geographic areas, but which have been used to treat the same indications, and (iii) showing the concordance between ethnobotanical patterns among non-taxonomically related plants and the presence of shared bioactive phytochemicals

    Linking COVID-19 and Heme-Driven Pathophysiologies: A Combined Computational–Experimental Approach

    No full text
    The SARS-CoV-2 outbreak was declared a worldwide pandemic in 2020. Infection triggers the respiratory tract disease COVID-19, which is accompanied by serious changes in clinical biomarkers such as hemoglobin and interleukins. The same parameters are altered during hemolysis, which is characterized by an increase in labile heme. We present two computational–experimental approaches aimed at analyzing a potential link between heme-related and COVID-19 pathophysiologies. Herein, we performed a detailed analysis of the common pathways induced by heme and SARS-CoV-2 by superimposition of knowledge graphs covering heme biology and COVID-19 pathophysiology. Focus was laid on inflammatory pathways and distinct biomarkers as the linking elements. In a second approach, four COVID-19-related proteins, the host cell proteins ACE2 and TMPRSS2 as well as the viral proteins 7a and S protein were computationally analyzed as potential heme-binding proteins with an experimental validation. The results contribute to the understanding of the progression of COVID-19 infections in patients with different clinical backgrounds and may allow for a more individual diagnosis and therapy in the future

    Selection of data sets for FAIRification in drug discovery and development: Which, why, and how?

    No full text
    Despite the intuitive value of adopting the Findable, Accessible, Interoperable, and Reusable (FAIR) principles in both academic and industrial sectors, challenges exist in resourcing, balancing long- versus short-term priorities, and achieving technical implementation. This situation is exacerbated by the unclear mechanisms by which costs and benefits can be assessed when decisions on FAIR are made. Scientific and research and development (R&D) leadership need reliable evidence of the potential benefits and information on effective implementation mechanisms and remediating strategies. In this article, we describe procedures for cost–benefit evaluation, and identify best-practice approaches to support the decision-making process involved in FAIR implementation
    corecore