59 research outputs found
Virtual screening of potential bioactive substances using the support vector machine approach
Die vorliegende Dissertation stellt eine kumulative Arbeit dar, die in insgesamt acht wissenschaftlichen Publikationen (fünf publiziert, zwei eingerichtet und eine in Vorbereitung) dargelegt ist. In diesem Forschungsprojekt wurden Anwendungen von maschinellem Lernen für das virtuelle Screening von Moleküldatenbanken durchgeführt. Das Ziel war primär die Einführung und Überprüfung des Support-Vector-Machine (SVM) Ansatzes für das virtuelle Screening nach potentiellen Wirkstoffkandidaten. In der Einleitung der Arbeit ist die Rolle des virtuellen Screenings im Wirkstoffdesign beschrieben. Methoden des virtuellen Screenings können fast in jedem Bereich der gesamten pharmazeutischen Forschung angewendet werden. Maschinelles Lernen kann einen Einsatz finden von der Auswahl der ersten Moleküle, der Optimierung der Leitstrukturen bis hin zur Vorhersage von ADMET (Absorption, Distribution, Metabolism, Toxicity) Eigenschaften. In Abschnitt 4.2 werden möglichen Verfahren dargestellt, die zur Beschreibung von chemischen Strukturen eingesetzt werden können, um diese Strukturen in ein Format zu bringen (Deskriptoren), das man als Eingabe für maschinelle Lernverfahren wie Neuronale Netze oder SVM nutzen kann. Der Fokus ist dabei auf diejenigen Verfahren gerichtet, die in der vorliegenden Arbeit verwendet wurden. Die meisten Methoden berechnen Deskriptoren, die nur auf der zweidimensionalen (2D) Struktur basieren. Standard-Beispiele hierfür sind physikochemische Eigenschaften, Atom- und Bindungsanzahl etc. (Abschnitt 4.2.1). CATS Deskriptoren, ein topologisches Pharmakophorkonzept, sind ebenfalls 2D-basiert (Abschnitt 4.2.2). Ein anderer Typ von Deskriptoren beschreibt Eigenschaften, die aus einem dreidimensionalen (3D) Molekülmodell abgeleitet werden. Der Erfolg dieser Beschreibung hangt sehr stark davon ab, wie repräsentativ die 3D-Konformation ist, die für die Berechnung des Deskriptors angewendet wurde. Eine weitere Beschreibung, die wir in unserer Arbeit eingesetzt haben, waren Fingerprints. In unserem Fall waren die verwendeten Fingerprints ungeeignet zum Trainieren von Neuronale Netzen, da der Fingerprintvektor zu viele Dimensionen (~ 10 hoch 5) hatte. Im Gegensatz dazu hat das Training von SVM mit Fingerprints funktioniert. SVM hat den Vorteil im Vergleich zu anderen Methoden, dass sie in sehr hochdimensionalen Räumen gut klassifizieren kann. Dieser Zusammenhang zwischen SVM und Fingerprints war eine Neuheit, und wurde von uns erstmalig in die Chemieinformatik eingeführt. In Abschnitt 4.3 fokussiere ich mich auf die SVM-Methode. Für fast alle Klassifikationsaufgaben in dieser Arbeit wurde der SVM-Ansatz verwendet. Ein Schwerpunkt der Dissertation lag auf der SVM-Methode. Wegen Platzbeschränkungen wurde in den beigefügten Veröffentlichungen auf eine detaillierte Beschreibung der SVM verzichtet. Aus diesem Grund wird in Abschnitt 4.3 eine vollständige Einführung in SVM gegeben. Darin enthalten ist eine vollständige Diskussion der SVM Theorie: optimale Hyperfläche, Soft-Margin-Hyperfläche, quadratische Programmierung als Technik, um diese optimale Hyperfläche zu finden. Abschnitt 4.3 enthält auch eine Diskussion von Kernel-Funktionen, welche die genaue Form der optimalen Hyperfläche bestimmen. In Abschnitt 4.4 ist eine Einleitung in verschiede Methoden gegeben, die wir für die Auswahl von Deskriptoren genutzt haben. In diesem Abschnitt wird der Unterschied zwischen einer „Filter“- und der „Wrapper“-basierten Auswahl von Deskriptoren herausgearbeitet. In Veröffentlichung 3 (Abschnitt 7.3) haben wir die Vorteile und Nachteile von Filter- und Wrapper-basierten Methoden im virtuellen Screening vergleichend dargestellt. Abschnitt 7 besteht aus den Publikationen, die unsere Forschungsergebnisse enthalten. Unsere erste Publikation (Veröffentlichung 1) war ein Übersichtsartikel (Abschnitt 7.1). In diesem Artikel haben wir einen Gesamtüberblick der Anwendungen von SVM in der Bio- und Chemieinformatik gegeben. Wir diskutieren Anwendungen von SVM für die Gen-Chip-Analyse, die DNASequenzanalyse und die Vorhersage von Proteinstrukturen und Proteininteraktionen. Wir haben auch Beispiele beschrieben, wo SVM für die Vorhersage der Lokalisation von Proteinen in der Zelle genutzt wurden. Es wird dabei deutlich, dass SVM im Bereich des virtuellen Screenings noch nicht verbreitet war. Um den Einsatz von SVM als Hauptmethode unserer Forschung zu begründen, haben wir in unserer nächsten Publikation (Veröffentlichung 2) (Abschnitt 7.2) einen detaillierten Vergleich zwischen SVM und verschiedenen neuronalen Netzen, die sich als eine Standardmethode im virtuellen Screening etabliert haben, durchgeführt. Verglichen wurde die Trennung von wirstoffartigen und nicht-wirkstoffartigen Molekülen („Druglikeness“-Vorhersage). Die SVM konnte 82% aller Moleküle richtig klassifizieren. Die Klassifizierung war zudem robuster als mit dreilagigen feedforward-ANN bei der Verwendung verschiedener Anzahlen an Hidden-Neuronen. In diesem Projekt haben wir verschiedene Deskriptoren zur Beschreibung der Moleküle berechnet: Ghose-Crippen Fragmentdeskriptoren [86], physikochemische Eigenschaften [9] und topologische Pharmacophore (CATS) [10]. Die Entwicklung von weiteren Verfahren, die auf dem SVM-Konzept aufbauen, haben wir in den Publikationen in den Abschnitten 7.3 und 7.8 beschrieben. Veröffentlichung 3 stellt die Entwicklung einer neuen SVM-basierten Methode zur Auswahl von relevanten Deskriptoren für eine bestimmte Aktivität dar. Eingesetzt wurden die gleichen Deskriptoren wie in dem oben beschriebenen Projekt. Als charakteristische Molekülgruppen haben wir verschiedene Untermengen der COBRA Datenbank ausgewählt: 195 Thrombin Inhibitoren, 226 Kinase Inhibitoren und 227 Faktor Xa Inhibitoren. Es ist uns gelungen, die Anzahl der Deskriptoren von ursprünglich 407 auf ungefähr 50 zu verringern ohne signifikant an Klassifizierungsgenauigkeit zu verlieren. Unsere Methode haben wir mit einer Standardmethode für diese Anwendung verglichen, der Kolmogorov-Smirnov Statistik. Die SVM-basierte Methode erwies sich hierbei in jedem betrachteten Fall als besser als die Vergleichsmethoden hinsichtlich der Vorhersagegenauigkeit bei der gleichen Anzahl an Deskriptoren. Eine ausführliche Beschreibung ist in Abschnitt 4.4 gegeben. Dort sind auch verschiedene „Wrapper“ für die Deskriptoren-Auswahl beschrieben. Veröffentlichung 8 beschreibt die Anwendung von aktivem Lernen mit SVM. Die Idee des aktiven Lernens liegt in der Auswahl von Molekülen für das Lernverfahren aus dem Bereich an der Grenze der verschiedenen zu unterscheidenden Molekülklassen. Auf diese Weise kann die lokale Klassifikation verbessert werden. Die folgenden Gruppen von Moleküle wurden genutzt: ACE (Angiotensin converting enzyme), COX2 (Cyclooxygenase 2), CRF (Corticotropin releasing factor) Antagonisten, DPP (Dipeptidylpeptidase) IV, HIV (Human immunodeficiency virus) protease, Nuclear Receptors, NK (Neurokinin receptors), PPAR (peroxisome proliferator-activated receptor), Thrombin, GPCR und Matrix Metalloproteinasen. Aktives Lernen konnte die Leistungsfähigkeit des virtuellen Screenings verbessern, wie sich in dieser retrospektiven Studie zeigte. Es bleibt abzuwarten, ob sich das Verfahren durchsetzen wird, denn trotzt des Gewinns an Vorhersagegenauigkeit ist es aufgrund des mehrfachen SVMTrainings aufwändig. Die Publikationen aus den Abschnitten 7.5, 7.6 und 7.7 (Veröffentlichungen 5-7) zeigen praktische Anwendungen unserer SVM-Methoden im Wirkstoffdesign in Kombination mit anderen Verfahren, wie der Ähnlichkeitssuche und neuronalen Netzen zur Eigenschaftsvorhersage. In zwei Fällen haben wir mit dem Verfahren neuartige Liganden für COX-2 (cyclooxygenase 2) und dopamine D3/D2 Rezeptoren gefunden. Wir konnten somit klar zeigen, dass SVM-Methoden für das virtuelle Screening von Substanzdatensammlungen sinnvoll eingesetzt werden können. Es wurde im Rahmen der Arbeit auch ein schnelles Verfahren zur Erzeugung großer kombinatorischer Molekülbibliotheken entwickelt, welches auf der SMILES Notation aufbaut. Im frühen Stadium des Wirstoffdesigns ist es wichtig, eine möglichst „diverse“ Gruppe von Molekülen zu testen. Es gibt verschiedene etablierte Methoden, die eine solche Untermenge auswählen können. Wir haben eine neue Methode entwickelt, die genauer als die bekannte MaxMin-Methode sein sollte. Als erster Schritt wurde die „Probability Density Estimation“ (PDE) für die verfügbaren Moleküle berechnet. [78] Dafür haben wir jedes Molekül mit Deskriptoren beschrieben und die PDE im N-dimensionalen Deskriptorraum berechnet. Die Moleküle wurde mit dem Metropolis Algorithmus ausgewählt. [87] Die Idee liegt darin, wenige Moleküle aus den Bereichen mit hoher Dichte auszuwählen und mehr Moleküle aus den Bereichen mit niedriger Dichte. Die erhaltenen Ergebnisse wiesen jedoch auf zwei Nachteile hin. Erstens wurden Moleküle mit unrealistischen Deskriptorwerten ausgewählt und zweitens war unser Algorithmus zu langsam. Dieser Aspekt der Arbeit wurde daher nicht weiter verfolgt. In Veröffentlichung 6 (Abschnitt 7.6) haben wir in Zusammenarbeit mit der Molecular-Modeling Gruppe von Aventis-Pharma Deutschland (Frankfurt) einen SVM-basierten ADME Filter zur Früherkennung von CYP 2C9 Liganden entwickelt. Dieser nichtlineare SVM-Filter erreichte eine signifikant höhere Vorhersagegenauigkeit (q2 = 0.48) als ein auf den gleichen Daten entwickelten PLS-Modell (q2 = 0.34). Es wurden hierbei Dreipunkt-Pharmakophordeskriptoren eingesetzt, die auf einem dreidimensionalen Molekülmodell aufbauen. Eines der wichtigen Probleme im computerbasierten Wirkstoffdesign ist die Auswahl einer geeigneten Konformation für ein Molekül. Wir haben versucht, SVM auf dieses Problem anzuwenden. Der Trainingdatensatz wurde dazu mit jeweils mehreren Konformationen pro Molekül angereichert und ein SVM Modell gerechnet. Es wurden anschließend die Konformationen mit den am schlechtesten vorhergesagten IC50 Wert aussortiert. Die verbliebenen gemäß dem SVM-Modell bevorzugten Konformationen waren jedoch unrealistisch. Dieses Ergebnis zeigt Grenzen des SVM-Ansatzes auf. Wir glauben jedoch, dass weitere Forschung auf diesem Gebiet zu besseren Ergebnissen führen kann
A computational study of the substrate conversion and selective inhibition of aldosterone synthase
When a functional or structural impairment of cardiac output has occurred, the cardiovascular system will attempt to compensate for the reduced blood flow. Unfortunately, many of the resulting processes, such as the renin angiotensin aldosterone system, will progressively weaken the heart, resulting in the condition called heart failure. The renin angiotensin aldosterone regulatory system is currently targeted with medicine for heart failure. Many successes for the prolongation of patient age have been achieved by inhibition of angiotensin II synthesis and action. It has become apparent that this approach is suboptimal. Antagonists of aldosterone have provided better treatment options, however, side-effects are still observed. In the search for an alternative therapeutic application, we have studied a novel treatment involving the selective inhibition of aldosterone biosynthesis. The scope of this study has involved the in silico design and prediction of novel inhibitors, the synthesis of these inhibitors and analogues, and finally the in vitro measurement of their potency. The biosynthesis of aldosterone is performed by two cytochrome p450 enzymes, 11B1 and 11B2, denoted as CYP11B1 and CYP11B2, respectively. From these two family members, only CYP11B2 can perform the final synthesis step that converts 18-hydroxycorticosterone into aldosterone. CYP11B1 performs the synthesis of glucocorticoids that are responsible for metabolic, immunologic and homeostatic functions. Because these glucocorticoid actions should not be inhibited, the newly designed medicine must be CYP11B2 selective. Since CYP11B1 is highly homologous to CYP11B2, we have performed an in silico study that allows us to model the interactions of substrates and inhibitors in both the active sites of CYP11B1 and CYP11B2. Using comparative modelling, we have constructed models for the three dimensional architecture of both proteins. These models have been validated by investigating the torsional properties of the protein backbone and residue side chains, the overall protein packing and the dynamic behaviour of the protein models. Subsequently, the models have been used to evaluate the binding mechanisms and conversion mechanisms for the natural steroidal ligands of CYP11B1 and CYP11B2. A hypothetical binding mode has been proposed for 18-hydroxycorticosterone in CYP11B2, featuring the presence of stabilising hydrogen bonding interactions required for its conversion. Quantum mechanical analyses on the conversion of the steroids involved have shown a favourable conversion for this conformation, thereby supporting our hypothesis. In addition, the quantum mechanical analyses have provided insights on steroid conformations in the active sites during conversion. The suitability of the protein models for inhibitor design has been tested by subjecting the models to a case study with four known inhibitors of CYP11B1 and CYP11B2. Using molecular dynamics and molecular docking, the inhibitor potencies for CYP11B1 and CYP11B2 have been predicted, and their interactions with the proteins have been evaluated. The trends in inhibitor potency found by these computational methods have been confirmed by in vitro inhibition measurements. As a next step, the molecular docking study has been expanded to improve the confidence in the predictive power of the models. Using the protein states evaluated by the molecular dynamics study, the molecular docking results of inhibitor analogues have been investigated and the predictive power of the models has been qualitatively improved. In a final approach, we have performed a ligand-based investigation of the inhibitor analogues to determine which ligand characteristics are important for the potency for CYP11B1 and CYP11B2. To this end, we have conducted decision tree analyses on the physico-chemical properties of inhibitor substituents, resulting in a collection of descriptors that can be used for the prediction and design of novel inhibitors. We have shown that a combination of synthesis, molecular modelling and experimental measurements form a promising approach towards the design of potentially new inhibitors
Computational prediction of metabolism: sites, products, SAR, P450 enzyme dynamics, and mechanisms.
Metabolism of xenobiotics remains a central challenge for the discovery and development of drugs, cosmetics, nutritional supplements, and agrochemicals. Metabolic transformations are frequently related to the incidence of toxic effects that may result from the emergence of reactive species, the systemic accumulation of metabolites, or by induction of metabolic pathways. Experimental investigation of the metabolism of small organic molecules is particularly resource demanding; hence, computational methods are of considerable interest to complement experimental approaches. This review provides a broad overview of structure- and ligand-based computational methods for the prediction of xenobiotic metabolism. Current computational approaches to address xenobiotic metabolism are discussed from three major perspectives: (i) prediction of sites of metabolism (SOMs), (ii) elucidation of potential metabolites and their chemical structures, and (iii) prediction of direct and indirect effects of xenobiotics on metabolizing enzymes, where the focus is on the cytochrome P450 (CYP) superfamily of enzymes, the cardinal xenobiotics metabolizing enzymes. For each of these domains, a variety of approaches and their applications are systematically reviewed, including expert systems, data mining approaches, quantitative structure-activity relationships (QSARs), and machine learning-based methods, pharmacophore-based algorithms, shape-focused techniques, molecular interaction fields (MIFs), reactivity-focused techniques, protein-ligand docking, molecular dynamics (MD) simulations, and combinations of methods. Predictive metabolism is a developing area, and there is still enormous potential for improvement. However, it is clear that the combination of rapidly increasing amounts of available ligand- and structure-related experimental data (in particular, quantitative data) with novel and diverse simulation and modeling approaches is accelerating the development of effective tools for prediction of in vivo metabolism, which is reflected by the diverse and comprehensive data sources and methods for metabolism prediction reviewed here. This review attempts to survey the range and scope of computational methods applied to metabolism prediction and also to compare and contrast their applicability and performance.JK, MJW, JT, PJB, AB and RCG thank Unilever for funding
Modeling Chemical Interaction Profiles: II. Molecular Docking, Spectral Data-Activity Relationship, and Structure-Activity Relationship Models for Potent and Weak Inhibitors of Cytochrome P450 CYP3A4 Isozyme
Polypharmacy increasingly has become a topic of public health concern, particularly as the U.S. population ages. Drug labels often contain insufficient information to enable the clinician to safely use multiple drugs. Because many of the drugs are bio-transformed by cytochrome P450 (CYP) enzymes, inhibition of CYP activity has long been associated with potentially adverse health effects. In an attempt to reduce the uncertainty pertaining to CYP-mediated drug-drug/chemical interactions, an interagency collaborative group developed a consensus approach to prioritizing information concerning CYP inhibition. The consensus involved computational molecular docking, spectral data-activity relationship (SDAR), and structure-activity relationship (SAR) models that addressed the clinical potency of CYP inhibition. The models were built upon chemicals that were categorized as either potent or weak inhibitors of the CYP3A4 isozyme. The categorization was carried out using information from clinical trials because currently available in vitro high-throughput screening data were not fully representative of the in vivo potency of inhibition. During categorization it was found that compounds, which break the Lipinski rule of five by molecular weight, were about twice more likely to be inhibitors of CYP3A4 compared to those, which obey the rule. Similarly, among inhibitors that break the rule, potent inhibitors were 2–3 times more frequent. The molecular docking classification relied on logistic regression, by which the docking scores from different docking algorithms, CYP3A4 three-dimensional structures, and binding sites on them were combined in a unified probabilistic model. The SDAR models employed a multiple linear regression approach applied to binned 1D 13C-NMR and 1D 15N-NMR spectral descriptors. Structure-based and physical-chemical descriptors were used as the basis for developing SAR models by the decision forest method. Thirty-three potent inhibitors and 88 weak inhibitors of CYP3A4 were used to train the models. Using these models, a synthetic majority rules consensus classifier was implemented, while the confidence of estimation was assigned following the percent agreement strategy. The classifier was applied to a testing set of 120 inhibitors not included in the development of the models. Five compounds of the test set, including known strong inhibitors dalfopristin and tioconazole, were classified as probable potent inhibitors of CYP3A4. Other known strong inhibitors, such as lopinavir, oltipraz, quercetin, raloxifene, and troglitazone, were among 18 compounds classified as plausible potent inhibitors of CYP3A4. The consensus estimation of inhibition potency is expected to aid in the nomination of pharmaceuticals, dietary supplements, environmental pollutants, and occupational and other chemicals for in-depth evaluation of the CYP3A4 inhibitory activity. It may serve also as an estimate of chemical interactions via CYP3A4 metabolic pharmacokinetic pathways occurring through polypharmacy and nutritional and environmental exposures to chemical mixtures
Prioritizing Small Molecules for Drug Discovery or Chemical Safety Assessments using Ligand- and Structure-based Cheminformatics Approaches
Recent growth in the experimental data describing the effects of chemicals at the molecular, cellular, and organism level has triggered the development of novel computational approaches for the prediction of a chemical's effect on an organism. The studies described in this dissertation research predict chemical activity at three levels of biological complexity: binding of drugs to a single protein target, selective binding to a family of protein targets, and systemic toxicity. Optimizing cheminformatics methods that examine diverse sources of experimental data can lead to novel insight into the therapeutic use and toxicity of chemicals. In the first study, a combinatorial Quantitative Structure-Activity Relationship (QSAR) modeling workflow was successfully applied to the discovery of novel bioactive compound against one specific protein target: histone deacetylase inhibitors (HDACIs). Four candidate molecules were selected from the virtual screening hits to be tested experimentally, and three of them were confirmed active against HDAC. Next, a receptor-based protocol was established and applied to discover target-selective ligands within a family of proteins. This protocol extended the concept of protein/ligand interaction-guided pose selection by employing a binary classifier to discriminate poses of interest from a calibration set. The resulting virtual screening tools were applied for enriching beta2-adrenergic receptor (β2AR) ligands that are selective against other subtypes in the βAR family (i.e. β1AR and β3AR). Moreover, some computational 3D protein structures used in this study have exhibited comparative or even better performance in virtual screening than X-ray crystal structures of β2AR, and therefore computational tools that use these computational structures could complement tools utilizing experimental structures. Finally, a two-step hierarchical QSAR modeling approach was developed to estimate in vivo toxicity effects of small molecules. Besides the chemical structural descriptors, the developed models utilized additional biological information from in vitro bioassays. The derived models were more accurate than traditional QSAR models utilizing chemical descriptors only. Moreover, retrospective analysis of the developed models helped to identify the most informative bioassays, suggesting potential applicability of this methodology in guiding future toxicity experiments. These studies contribute to the development of computational strategies for comprehensive analysis of small molecules' biological properties, and have the potential to be integrated into existing methods for modern rational drug design and discovery.Doctor of Philosoph
Development of in silico models for the prediction of toxicity incorporating ADME information
Drug discovery is a process that requires a significant investment in both time and resources. Although recent developments have reduced the number of drugs failing at the later stages of development due to poor pharmacokinetic and/or toxicokinetic profiles, late stage attrition of drug candidates remains a problem. Additionally, there is a need to reduce animal testing for toxicological risk assessment for ethical and financial reasons. In silico methods offer an alternative that can address these challenges.
A variety of computational approaches have been developed in the last two decades, these must be evaluated to ensure confidence in their use. The research presented in this thesis has assessed a range of existing tools for the prediction of toxicity and absorption, distribution, metabolism and elimination (ADME) parameters with an emphasis on absorption and xenobiotic metabolism. These two ADME properties largely determine bioavailability of a drug and, in turn, also influence toxicity. In vitro (Caco-2 cells and the parallel artificial membrane permeation assay) and in silico approaches, such as various druglikeness filters, can be used to estimate human intestinal absorption; a comparison between different methods was performed to identify relative strengths and weaknesses of the approaches. In terms of xenobiotic metabolism it is not only important to predict metabolites correctly, but it is also crucial to identify those compounds that can be biotransformed into species that can covalently bind to biomolecules. Structural alerts are routinely used to screen for such potential reactive metabolites. The balance between sensitivity and specificity of such reactive metabolite alerts has been discussed in the context of correctly predicting reactive metabolites of pharmaceuticals (using data available from DrugBank). Off-target toxicity, exemplified by human Ether-à-go-go-Related Gene (hERG) channel inhibition, was also explored. A number of novel structural alerts for hERG toxicity were developed based on groups of structurally similar compounds. Finally, the importance of predicting potential ecotoxicological effects of drugs was also considered. The utility of zebrafish embryos to distinguish between baseline and excess toxicity was investigated. In evaluating this selection of existing tools, improvements to the methods have been proposed where possible
Lead optimization for new antimalarials and Successful lead identification for metalloproteinases: A Fragment-based approach Using Virtual Screening
Lead optimization for new antimalarials and Successful lead identification
for metalloproteinases: A Fragment-based approach Using Virtual Screening
Computer-aided drug design is an essential part of the modern medicinal
chemistry, and has led to the acceleration of many projects. The herein
described thesis presents examples for its application in the field of lead
optimization and lead identification for three metalloproteins.
DOXP-reductoisomerase (DXR) is a key enzyme of the mevalonate independent
isoprenoid biosynthesis. Structure-activity relationships for 43 DXR
inhibitors are established, derived from protein-based docking, ligand-based
3D QSAR and a combination of both approaches as realized by AFMoC. As part
of an effort to optimize the properties of the established inhibitor
Fosmidomycin, analogues have been synthesized and tested to gain further
insights into the primary determinants of structural affinity.
Unfortunately, these structures still leave the active Fosmidomycin
conformation and detailed reaction mechanism undetermined. This fact,
together with the small inhibitor data set provides a major challenge for
presently available docking programs and 3D QSAR tools. Using the recently
developed protein tailored scoring protocol AFMoC precise prediction of
binding affinities for related ligands as well as the capability to estimate
the affinities of structurally distinct inhibitors has been achieved.
Farnesyltransferase is a zinc-metallo enzyme that catalyzes the
posttranslational modification of numerous proteins involved in
intracellular signal transduction. The development of farnesyltransferase
inhibitors is directed towards the so-called non-thiol inhibitors because of
adverse drug effects connected to free thiols. A first step on the way to
non-thiol farnesyltransferase inhibitors was the development of an
CAAX-benzophenone peptidomimetic based on a pharmacophore model. On its
basis bisubstrate analogues were developed as one class of non-thiol
farnesyltransferase inhibitors. In further studies two aryl binding and two
distinct specificity sites were postulated. Flexible docking of model
compounds was applied to investigate the sub-pockets and design highly
active non-thiol farnesyltransferase inhibitor. In addition to affinity,
special attention was paid towards in vivo activity and species specificity.
The second part of this thesis describes a possible strategy for
computer-aided lead discovery. Assembling a complex ligand from simple
fragments has recently been introduced as an alternative to traditional HTS.
While frequently applied experimentally, only a few examples are known for
computational fragment-based approaches. Mostly, computational tools are
applied to compile the libraries and to finally assess the assembled
ligands. Using the metalloproteinase thermolysin (TLN) as a model target, a
computational fragment-based screening protocol has been established.
Starting with a data set of commercially available chemical compounds, a
fragment library has been compiled considering (1) fragment likeness and (2)
similarity to known drugs. The library is screened for target specificity,
resulting in 112 fragments to target the zinc binding area and 75 fragments
targeting the hydrophobic specificity pocket of the enzyme. After analyzing
the performance of multiple docking programs and scoring functions forand the most 14 candidates are selected for further analysis. Soaking
experiments were performed for reference fragment to derive a general
applicable crystallization protocol for TLN and subsequently for new
protein-fragment complex structures. 3-Methylsaspirin could be determined to
bind to TLN. Additional studies addressed a retrospective performance
analysis of the applied scoring functions and modification on the screening
hit. Curios about the differences of aspirin and 3-methylaspirin,
3-chloroaspirin has been synthesized and affinities could be determined to
be 2.42 mM; 1.73 mM und 522 μM respectively.
The results of the thesis show, that computer aided drug design approaches
could successfully support projects in lead optimization and lead
identification.
fragments in general, the fragments derived from the screening are docke
- …