9 research outputs found

    Development of Knowledge Within a Chemical-Toxicological Database to Formulate Novel Computational Approaches for Predicting Repeated Dose Toxicity of Cosmetics-Related Compounds

    Get PDF
    The European Union (EU) Cosmetics Regulation established the ban on animal testing for cosmetics ingredients. This ban does not assume that all cosmetics ingredients are safe, but that the non-testing procedures (in vitro and in silico) have to be applied for their safety assessment. To this end, the SEURAT-1 cluster was funded by EU 7th Framework Programme and Cosmetics Europe. The COSMOS (Integrated In Silico Models for the Prediction of Human Repeated Dose Toxicity of COSMetics to Optimise Safety) project was initiated as one of the seven consortia of the cluster, with the purpose of facilitating the prediction of human repeated dose toxicity associated with exposure to cosmetics-related compounds through in silico approaches. A critical objective of COSMOS was to address the paucity of publicly available data for cosmetics ingredients and related chemicals. Therefore a database was established containing (i) an inventory of cosmetics ingredients and related structures; (ii) skin permeability/absorption data (route of exposure relevant to cosmetics); and (iii) repeated dose toxicity data. This thesis describes the process of “knowledge discovery from the data”, including collation of the content of the COSMOS database and its subsequent application for developing tools to support the prediction of repeated dose toxicity of cosmetics and related compounds. A rigorous strategy of curation and quality control of chemical records was applied in developing the database (as documented in the Standard Operating Procedure, chapter 2). The chemical space of the cosmetics-related compounds was compared to food-related compounds from the U.S. FDA CFSAN PAFA database using the novel approach combining the analysis of structural features (ToxPrint chemotypes) and physicochemical properties. The cosmetics- and food- specific structural classes related to particular use functions and manifested by distinct physicochemical properties were identified (chapter 3). The novel COSMOS Skin Permeability Database containing in vivo and in vitro skin permeability/absorption data was developed by integrating existing databases and enriching them with new data for cosmetics harvested from regulatory documents and scientific literature (chapter 4). Compounds with available data on human in vitro maximal flux (JMAX) were subsequently extracted from the developed database and analysed in terms of their structural features (ToxPrint chemotypes) and physicochemical properties. The profile of compounds exhibiting low or high skin permeability potential was determined. The results of this analysis can support rapid screening and classification of the compounds without experimental data (chapter 5). The new COSMOS oral repeated dose toxicity database was established through consolidation of existing data sources and harvesting new regulatory documents and scientific literature. The unique data structure of the COSMOS oRepeatToxDB allows capturing all toxicological effects observed at particular dose levels and sites, which are hierarchically differentiated as organs, tissues, and cells (chapter 6). Such design of this database enabled the development of liver toxicity ontology, followed by mechanistic mining of in vivo data (chapter 7). As a result, compounds associated with liver steatosis, steatohepatitis and fibrosis phenotypic effects were identified and further analysed. The probable mechanistic reasoning for toxicity (Peroxisome Proliferator-Activated Receptor gamma (PPAR ) activation) was formulated for two hepatotoxicants, namely 1,3-bis-(2,4-diaminophenoxy)-propane and piperonyl butoxide. Key outcomes of this thesis include an extensive curated database, Standard Operating Procedures, skin permeability potential classification rules, and the set of structural features associated with liver steatosis. Such knowledge is particularly important in the light of the 21st Century Toxicology (NRC, 2007) and the ongoing need to move away from animal toxicity testing to non-testing alternatives

    Development and application of distributed computing tools for virtual screening of large compound libraries

    Get PDF
    Im derzeitigen Drug Discovery Prozess ist die Identifikation eines neuen Targetproteins und dessen potenziellen Liganden langwierig, teuer und zeitintensiv. Die Verwendung von in silico Methoden gewinnt hier zunehmend an Bedeutung und hat sich als wertvolle Strategie zur Erkennung komplexer Zusammenhänge sowohl im Bereich der Struktur von Proteinen wie auch bei Bioaktivitäten erwiesen. Die zunehmende Nachfrage nach Rechenleistung im wissenschaftlichen Bereich sowie eine detaillierte Analyse der generierten Datenmengen benötigen innovative Strategien für die effiziente Verwendung von verteilten Computerressourcen, wie z.B. Computergrids. Diese Grids ergänzen bestehende Technologien um einen neuen Aspekt, indem sie heterogene Ressourcen zur Verfügung stellen und koordinieren. Diese Ressourcen beinhalten verschiedene Organisationen, Personen, Datenverarbeitung, Speicherungs- und Netzwerkeinrichtungen, sowie Daten, Wissen, Software und Arbeitsabläufe. Das Ziel dieser Arbeit war die Entwicklung einer universitätsweit anwendbaren Grid-Infrastruktur - UVieCo (University of Vienna Condor pool) -, welche für die Implementierung von akademisch frei verfügbaren struktur- und ligandenbasierten Drug Discovery Anwendungen verwendet werden kann. Firewall- und Sicherheitsprobleme wurden mittels eines virtuellen privaten Netzwerkes gelöst, wohingegen die Virtualisierung der Computerhardware über das CoLinux Konzept ermöglicht wurde. Dieses ermöglicht, dass unter Linux auszuführende Aufträge auf Windows Maschinen laufen können. Die Effektivität des Grids wurde durch Leistungsmessungen anhand sequenzieller und paralleler Aufgaben ermittelt. Als Anwendungsbeispiel wurde die Assoziation der Expression bzw. der Sensitivitätsprofile von ABC-Transportern mit den Aktivitätsprofilen von Antikrebswirkstoffen durch Data-Mining des NCI (National Cancer Institute) Datensatzes analysiert. Die dabei generierten Datensätze wurden für liganden-basierte Computermethoden wie Shape-Similarity und Klassifikationsalgorithmen mit dem Ziel verwendet, P-glycoprotein (P-gp) Substrate zu identifizieren und sie von Nichtsubstraten zu trennen. Beim Erstellen vorhersagekräftiger Klassifikationsmodelle konnte das Problem der extrem unausgeglichenen Klassenverteilung durch Verwendung der „Cost-Sensitive Bagging“ Methode gelöst werden. Applicability Domain Studien ergaben, dass unser Modell nicht nur die NCI Substanzen gut vorhersagen kann, sondern auch für wirkstoffähnliche Moleküle verwendet werden kann. Die entwickelten Modelle waren relativ einfach, aber doch präzise genug um für virtuelles Screening einer großen chemischen Bibliothek verwendet werden zu können. Dadurch könnten P-gp Substrate schon frühzeitig erkannt werden, was möglicherweise nützlich sein kann zur Entfernung von Substanzen mit schlechten ADMET-Eigenschaften bereits in einer frühen Phase der Arzneistoffentwicklung. Zusätzlich wurden Shape-Similarity und Self-organizing Map Techniken verwendet um neue Substanzen in einer hauseigenen sowie einer großen kommerziellen Datenbank zu identifizieren, die ähnlich zu selektiven Serotonin-Reuptake-Inhibitoren (SSRI) sind und Apoptose induzieren können. Die erhaltenen Treffer besitzen neue chemische Grundkörper und können als Startpunkte für Leitstruktur-Optimierung in Betracht gezogen werden. Die in dieser Arbeit beschriebenen Studien werden nützlich sein um eine verteilte Computerumgebung zu kreieren die vorhandene Ressourcen in einer Organisation nutzt, und die für verschiedene Anwendungen geeignet ist, wie etwa die effiziente Handhabung der Klassifizierung von unausgeglichenen Datensätzen, oder mehrstufiges virtuelles Screening.In the current drug discovery process, the identification of new target proteins and potential ligands is very tedious, expensive and time-consuming. Thus, use of in silico techniques is of utmost importance and proved to be a valuable strategy in detecting complex structural and bioactivity relationships. Increased demands of computational power for tremendous calculations in scientific fields and timely analysis of generated piles of data require innovative strategies for efficient utilization of distributed computing resources in the form of computational grids. Such grids add a new aspect to the emerging information technology paradigm by providing and coordinating the heterogeneous resources such as various organizations, people, computing, storage and networking facilities as well as data, knowledge, software and workflows. The aim of this study was to develop a university-wide applicable grid infrastructure, UVieCo (University of Vienna Condor pool) which can be used for implementation of standard structure- and ligand-based drug discovery applications using freely available academic software. Firewall and security issues were resolved with a virtual private network setup whereas virtualization of computer hardware was done using the CoLinux concept in a way to run Linux-executable jobs inside Windows machines. The effectiveness of the grid was assessed by performance measurement experiments using sequential and parallel tasks. Subsequently, the association of expression/sensitivity profiles of ABC transporters with activity profiles of anticancer compounds was analyzed by mining the data from NCI (National Cancer Institute). The datasets generated in this analysis were utilized with ligand-based computational methods such as shape similarity and classification algorithms to identify and separate P-gp substrates from non-substrates. While developing predictive classification models, the problem of imbalanced class distribution was proficiently addressed using the cost-sensitive bagging approach. Applicability domain experiment revealed that our model not only predicts NCI compounds well, but it can also be applied to drug-like molecules. The developed models were relatively simple but precise enough to be applicable for virtual screening of large chemical libraries for the early identification of P-gp substrates which can potentially be useful to remove compounds of poor ADMET properties in an early phase of drug discovery. Additionally, shape-similarity and self-organizing maps techniques were used to screen in-house as well as a large vendor database for identification of novel selective serotonin reuptake inhibitor (SSRI) like compounds to induce apoptosis. The retrieved hits possess novel chemical scaffolds and can be considered as a starting point for lead optimization studies. The work described in this thesis will be useful to create distributed computing environment using available resources within an organization and can be applied to various applications such as efficient handling of imbalanced data classification problems or multistep virtual screening approach

    Methods for the Analysis of Matched Molecular Pairs and Chemical Space Representations

    Get PDF
    Compound optimization is a complex process where different properties are optimized to increase the biological activity and therapeutic effects of a molecule. Frequently, the structure of molecules is modified in order to improve their property values. Therefore, computational analysis of the effects of structure modifications on property values is of great importance for the drug discovery process. It is also essential to analyze chemical space, i.e., the set of all chemically feasible molecules, in order to find subsets of molecules that display favorable property values. This thesis aims to expand the computational repertoire to analyze the effect of structure alterations and visualize chemical space. Matched molecular pairs are defined as pairs of compounds that share a large common substructure and only differ by a small chemical transformation. They have been frequently used to study property changes caused by structure modifications. These analyses are expanded in this thesis by studying the effect of chemical transformations on the ionization state and ligand efficiency, both measures of great importance in drug design. Additionally, novel matched molecular pairs based on retrosynthetic rules are developed to increase their utility for prospective use of chemical transformations in compound optimization. Further, new methods based on matched molecular pairs are described to obtain preliminary SAR information of screening hit compounds and predict the potency change caused by a chemical transformation. Visualizations of chemical space are introduced to aid compound optimization efforts. First, principal component plots are used to rationalize a matched molecular pair based multi-objective compound optimization procedure. Then, star coordinate and parallel coordinate plots are introduced to analyze drug-like subspaces, where compounds with favorable property values can be found. Finally, a novel network-based visualization of high-dimensional property space is developed. Concluding, the applications developed in this thesis expand the methodological spectrum of computer-aided compound optimization

    Animal Experimentation: Working Towards a Paradigm Change

    Get PDF
    Animal experimentation has been one of the most controversial areas of animal use, mainly due to the intentional harms inflicted upon animals for the sake of hoped-for benefits in humans. Despite this rationale for continued animal experimentation, shortcomings of this practice have become increasingly more apparent and well-documented. However, these limitations are not yet widely known or appreciated, and there is a danger that they may simply be ignored. The 51 experts who have contributed to Animal Experimentation: Working Towards a Paradigm Change critically review current animal use in science, present new and innovative non-animal approaches to address urgent scientific questions, and offer a roadmap towards an animal-free world of science.https://www.wellbeingintlstudiesrepository.org/ebooks/1018/thumbnail.jp

    Designing molecules possessing desired physical property values

    Get PDF
    Thesis (Ph. D.)--Massachusetts Institute of Technology, Dept. of Chemical Engineering, 1989.Includes bibliographical references (leaves 276-288).by Kevin G. Joback.Ph.D
    corecore