43 research outputs found

    Development of an infrastructure for the prediction of biological endpoints in industrial environments. Lessons learned at the eTOX Project

    No full text
    In silico methods are increasingly being used for assessing the chemical safety of substances, as a part of integrated approaches involving in vitro and in vivo experiments. A paradigmatic example of these strategies is the eTOX project http://www.etoxproject.eu, funded by the European Innovative Medicines Initiative (IMI), which aimed at producing high quality predictions of in vivo toxicity of drug candidates and resulted in generating about 200 models for diverse endpoints of toxicological interest. In an industry-oriented project like eTOX, apart from the predictive quality, the models need to meet other quality parameters related to the procedures for their generation and their intended use. For example, when the models are used for predicting the properties of drug candidates, the prediction system must guarantee the complete confidentiality of the compound structures. The interface of the system must be designed to provide non-expert users all the information required to choose the models and appropriately interpret the results. Moreover, procedures like installation, maintenance, documentation, validation and versioning, which are common in software development, must be also implemented for the models and for the prediction platform in which they are implemented. In this article we describe our experience in the eTOX project and the lessons learned after 7 years of close collaboration between industrial and academic partners. We believe that some of the solutions found and the tools developed could be useful for supporting similar initiatives in the future.The eTOX project (Grant Agreement No. 115002), was developed under the Innovative Medicines Initiative Joint Undertaking (IMI), resources of which are composed of a financial contribution from the European Union’s Seventh Framework Programme (FP7/2007-2013) and EFPIA companies’ in kind contributions. The authors of this article are also involved in other related IMI projects, such as iPiE (no. 115735), TransQST (no. 116030) and eTRANSAFE (no. 777365), as well as the H2020 EU-ToxRisk project (no. 681002

    An automated tool for obtaining QSAR-ready series of compounds using semantic web technologies

    No full text
    SUMMARY: We describe an application (Collector) for obtaining series of compounds annotated with bioactivity data, ready to be used for the development of quantitative structure-activity relationships (QSAR) models. The tool extracts data from the 'Open Pharmacological Space' (OPS) developed by the Open PHACTS project, using as input a valid name of the biological target. Collector uses the OPS ontologies for expanding the query using all known target synonyms and extracts compounds with bioactivity data against the target from multiple sources. The extracted data can be filtered to retain only drug-like compounds and the bioactivities can be automatically summarised to assign a single value per compound, yielding data ready to be used for QSAR modeling. The data obtained is locally stored facilitating the traceability and auditability of the process. Collector was used successfully for the development of models for toxicity endpoints within the eTOX project. AVAILABILITY AND IMPLEMENTATION: The software is available at http://phi.upf.edu/collector. The source code is located at https://github.com/phi-grib/Collector and is free for use under the GPL3 license. The web version is hosted at http://collector.upf.edu/. CONTACT: [email protected]. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.This project was developed under the Innovative Medicines Initiative Joint Undertaking Open PHACTS Project, grant agreement number 115191 and eTOX project, grant agreement n° 115002, resources of which are composed of financial contribution from the European Union’s Seventh Framework Programme (FP7/2007–2013) and EFPIA companies’ in kind contribution

    Ensemble prediction of mitochondrial toxicity using machine learning technology

    No full text
    Mitochondria are intracellular organelles found in most eukaryotic cells. Mitochondrial function includes the generation of cellular energy, maintenance of cellular homeostasis, and metabolic processes. Mitochondrial impairment has increasingly been recognized as a contributor to drug-induced toxicity. We have developed predictive models using machine learning methods for the prediction of the in vitro outcome of the MMP and the GluGal assays. These models were built using the open-source software Flame, which supports the combination of models in ensembles and the extension of the training data with further experimental data, continuously improving the predictive power of the models Despite the large amount of available data for training of models, most compounds are evaluated as negative, resulting in an imbalanced class distribution. This paper demonstrates the application of a combination of equally distributed low-level models in an ensemble to account for the imbalance in the training set resulting in a classifier providing high sensitivity and specificity of 92 and 87% respectively. The model generated can further be integrated with mechanistic in vitro data for improved screening for mitochondrial toxicity.This research was funded from the Innovative Medicines Initiative 2 Joint Undertaking under grant agreements eTRANSAFE (777365). This Joint Undertaking receives support from the European Union’s Horizon 2020 research and innovation program and EFPIA companies in kind contribution

    Flame: an open source framework for model development, hosting, and usage in production environments

    No full text
    This article describes Flame, an open source software for building predictive models and supporting their use in production environments. Flame is a web application with a web-based graphic interface, which can be used as a desktop application or installed in a server receiving requests from multiple users. Models can be built starting from any collection of biologically annotated chemical structures since the software supports structural normalization, molecular descriptor calculation, and machine learning model generation using predefined workflows. The model building workflow can be customized from the graphic interface, selecting the type of normalization, molecular descriptors, and machine learning algorithm to be used from a panel of state-of-the-art methods implemented natively. Moreover, Flame implements a mechanism allowing to extend its source code, adding unlimited model customization. Models generated with Flame can be easily exported, facilitating collaborative model development. All models are stored in a model repository supporting model versioning. Models are identified by unique model IDs and include detailed documentation formatted using widely accepted standards. The current version is the result of nearly 3 years of development in collaboration with users from the pharmaceutical industry within the IMI eTRANSAFE project, which aims, among other objectives, to develop high-quality predictive models based on shared legacy data for assessing the safety of drug candidates.This work has received funding from the eTRANSAFE project (Grant Agreement No. 777365), developed under the Innovative Medicines Initiative Joint Undertaking (IMI2), resources of which are composed of a financial contribution from the European Union’s Seventh Framework Programme (FP7/2007–2013) and EFPIA companies’ in kind contributions. The authors of this article are also involved in other related IMI projects which contributed funding, such as TransQST (No. 116030) as well as the H2020 EU-ToxRisk project (No. 681002) and FAIRplus (No. 802750). The Research Programme on Biomedical Informatics (GRIB) is a member of the Spanish National Bioinformatics Institute (INB), funded by ISCIII and FEDER (PT17/0009/0014). The DCEXS is a ‘Unidad de Excelencia María de Maeztu’, funded by the AEI (CEX2018-000782-M). The GRIB is also supported by the Agùncia de Gestió d’Ajuts Universitaris i de Recerca (AGAUR), Generalitat de Catalunya (2017 SGR 00519)

    eTOXlab, an open source modeling framework for implementing predictive models in production environments.

    No full text
    BACKGROUND: Computational models based in Quantitative-Structure Activity Relationship (QSAR) methodologies are widely used tools for predicting the biological properties of new compounds. In many instances, such models are used as a routine in the industry (e.g. food, cosmetic or pharmaceutical industry) for the early assessment of the biological properties of new compounds. However, most of the tools currently available for developing QSAR models are not well suited for supporting the whole QSAR model life cycle in production environments./nRESULTS: We have developed eTOXlab; an open source modeling framework designed to be used at the core of a self-contained virtual machine that can be easily deployed in production environments, providing predictions as web services. eTOXlab consists on a collection of object-oriented Python modules with methods mapping common tasks of standard modeling workflows. This framework allows building and validating QSAR models as well as predicting the properties of new compounds using either a command line interface or a graphic user interface (GUI). Simple models can be easily generated by setting a few parameters, while more complex models can be implemented by overriding pieces of the original source code. eTOXlab benefits from the object-oriented capabilities of Python for providing high flexibility: any model implemented using eTOXlab inherits the features implemented in the parent model, like common tools and services or the automatic exposure of the models as prediction web services. The particular eTOXlab architecture as a self-contained, portable prediction engine allows building models with confidential information within corporate facilities, which can be safely exported and used for prediction without disclosing the structures of the training series. CONCLUSIONS: The software presented here provides full support to the specific needs of users that want to develop, use and maintain predictive models in corporate environments. The technologies used by eTOXlab (web services, VM, object-oriented programming) provide an elegant solution to common practical issues; the system can be installed easily in heterogeneous environments and integrates well with other software. Moreover, the system provides a simple and safe solution for building models with confidential structures that can be shared without disclosing sensitive informationThe research leading to these results has received support from the Innovative Medicines Initiative (IMI) Joint Undertaking under grant agreement n° 115002 (eTOX), resources of which are composed of financial contribution from the/nEuropean Union’s Seventh Framework Programme (FP7/2007-2013) and EFPIA companies’ in kind contribution

    Development and validation of AMANDA, a new algorithm for selecting highly relevant regions in Molecular Interaction Fields

    No full text
    Descriptors based on Molecular Interaction Fields (MIF) are highly suitable for drug discovery, but their size (thousands of variables) often limits their application in practice. Here we describe a simple and fast computational method that extracts from a MIF a handful of highly informative points (hot spots) which summarize the most relevant information. The method was specifically developed for drug discovery, is fast, and does not require human supervision, being suitable for its application on very large series of compounds. The quality of the results has been tested by running the method on the ligand structure of a large number of ligand-receptor complexes and then comparing the position of the selected hot spots with actual atoms of the receptor. As an additional test, the hot spots obtained with the novel method were used to obtain GRIND-like molecular descriptors which were compared with the original GRIND. In both cases the results show that the novel method is highly suitable for describing ligand-receptor interactions and compares favorably with other state-of-the-art methods

    Suitability of GRIND-based principal properties for the description of molecular similarity and ligand-based virtual screening

    No full text
    The information provided by the alignment-independent GRid Independent Descriptors (GRIND) can be condensed by the application of principal component analysis, obtaining a small number of principal properties (GRIND-PP), which is more suitable for describing molecular similarity. The objective of the present study is to optimize diverse parameters involved in the obtention of the GRIND-PP and validate their suitability for applications, requiring a biologically relevant description of the molecular similarity. With this aim, GRIND-PP computed with a collection of diverse settings were used to carry out ligand-based virtual screening (LBVS) on standard conditions. The quality of the results obtained was remarkable and comparable with other LBVS methods, and their detailed statistical analysis allowed to identify the method settings more determinant for the quality of the results and their optimum. Remarkably, some of these optimum settings differ significantly from those used in previously published applications, revealing their unexplored potential. Their applicability in large compound database was also explored by comparing the equivalence of the results obtained using either computed or projected principal properties. In general, the results of the study confirm the suitability of the GRIND-PP for practical applications and provide useful hints about how they should be computed for obtaining optimum results

    Toward a unifying strategy for the structure-based prediction of toxicological endpoints

    No full text
    Most computational methods used for the prediction of toxicity endpoints are based on the assumption that similar compounds have similar biological properties. This principle can be exploited using computational methods like read across or quantitative structure-activity relationships. However, there is no general agreement about which method is the most appropriate for quantifying compound similarity neither for exploiting the similarity principle in order to obtain reliable estimations of the compound properties. Moreover, optimal similarity metrics and modeling methods might depend on the characteristics of the endpoints and training series used in each case. This study describes a comparative analysis of the predictive performance of diverse similarity metrics and modeling methods in toxicological applications. A collection of two quantitative (n = 660, n = 1114) and three qualitative (n = 447, n = 905, n = 1220) datasets representing very different endpoints of interest in drug safety evaluation and rigorous methods were used to estimate the external predictive ability in each case. The results confirm that no single approach produces the best results in all instances, and the best predictions were obtained using different tools in different situations. The trends observed in this study were exploited to propose a unifying strategy allowing the use of the most suitable method for every compound. A comparison of the quality of the predictions obtained by the unifying strategy with those obtained by standard prediction methods confirmed the usefulness of the proposed approach.The research leading to these results has received support from the Innovative Medicines Initiative Joint Undertaking, under Grant Agreement No. 115002 (eTOX), resources of which are composed of a financial contribution from the European Union’s Seventh Framework Programme (FP7/2007–2013) and EFPIA companies’ in kind contribution

    Hepatotoxicity prediction by systems biology modeling of disturbed metabolic pathways using gene expression data

    No full text
    The present study applies a systems biology approach for the in silico predictive modeling of drug toxicity on the basis of high-quality preclinical drug toxicity data with the aim of increasing the mechanistic understanding of toxic effects of compounds at different levels (pathway, cell, tissue, organ). The model development was carried out using 77 compounds for which gene expression data for treated primary human hepatocytes is available in the LINCS database and for which rodent in vivo hepatotoxicity information is available in the eTOX database. The data from LINCS were used to determine the type and number of pathways disturbed by each compound and to estimate the extent of disturbance (network perturbation elasticity), and were used to analyze the correspondence with the in vivo information from eTOX. Predictive models were developed through this integrative analysis, and their specificity and sensitivity were assessed. The quality of the predictions was determined on the basis of the area under the curve (AUC) of plots of true positive vs. false positive rates (ROC curves). The ROC AUC reached values of up to 0.9 (out of 1.0) for some hepatotoxicity endpoints. Moreover, the most frequently disturbed metabolic pathways were determined across the studied toxicants. They included, e.g., mitochondrial beta-oxidation of fatty acids and amino acid metabolism. The process was exemplified by successful predictions on various statins. In conclusion, an entirely new approach linking gene expression alterations to the prediction of complex organ toxicity was developed and evaluated.The research leading to these results has received support from the Innovative Medicines Initiative (IMI) Joint Undertaking under grant agreement nÂș 115002 (eTOX), resources of which are composed of financial contribution from the European Union's Seventh Framework Programme (FP7/200-2013) and EFPIA companie's in kind contributions

    Induced effects of sodium ions on dopaminergic G-protein coupled receptors

    No full text
    G-protein coupled receptors, the largest family of proteins in the human genome, are involved in many complex signal transduction pathways, typically activated by orthosteric ligand binding and subject to allosteric modulation. Dopaminergic receptors, belonging to the class A family of G-protein coupled receptors, are known to be modulated by sodium ions from an allosteric binding site, although the details of sodium effects on the receptor have not yet been described. In an effort to understand these effects, we performed microsecond scale all-atom molecular dynamics simulations on the dopaminergic D2 receptor, finding that sodium ions enter the receptor from the extracellular side and bind at a deep allosteric site (Asp2.50). Remarkably, the presence of a sodium ion at this allosteric site induces a conformational change of the rotamer toggle switch Trp6.48 which locks in a conformation identical to the one found in the partially inactive state of the crystallized human ÎČ2 adrenergic receptor. This study provides detailed quantitative information about binding of sodium ions in the D2 receptor and reports a possibly important sodium-induced conformational change for modulation of D2 receptor function.We acknowledge partial support from the EU funded VPH Network of Excellence, La MARATO de TV3 Foundation (Ref.-No. 091010), the HERACLES (RD06/0009) and COMBIOMED (RD07/0067), the Spanish Ministerio de Educacion y Ciencia (SAF2009-13609-C04-04). GDF acknowledges support from the Ramon y Cajal scheme and by the Spanish Ministry of Science and Innovation (Ref. FIS2008-01040). The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscrip
    corecore