10 research outputs found
Virtual screening for PPAR-gamma ligands using the ISOAK molecular graph kernel and gaussian processes
For a virtual screening study, we introduce a combination of machine learning techniques, employing a graph kernel, Gaussian process regression and clustered cross-validation. The aim was to find ligands of peroxisome-proliferator activated receptor gamma (PPAR-y). The receptors in the PPAR family belong to the steroid-thyroid-retinoid superfamily of nuclear receptors and act as transcription factors. They play a role in the regulation of lipid and glucose metabolism in vertebrates and are linked to various human processes and diseases. For this study, we used a dataset of 176 PPAR-y agonists published by Ruecker et al. ..
Kernel learning for ligand-based virtual screening: discovery of a new PPARgamma agonist
Poster presentation at 5th German Conference on Cheminformatics: 23. CIC-Workshop Goslar, Germany. 8-10 November 2009 We demonstrate the theoretical and practical application of modern kernel-based machine learning methods to ligand-based virtual screening by successful prospective screening for novel agonists of the peroxisome proliferator-activated receptor gamma (PPARgamma) [1]. PPARgamma is a nuclear receptor involved in lipid and glucose metabolism, and related to type-2 diabetes and dyslipidemia. Applied methods included a graph kernel designed for molecular similarity analysis [2], kernel principle component analysis [3], multiple kernel learning [4], and, Gaussian process regression [5]. In the machine learning approach to ligand-based virtual screening, one uses the similarity principle [6] to identify potentially active compounds based on their similarity to known reference ligands. Kernel-based machine learning [7] uses the "kernel trick", a systematic approach to the derivation of non-linear versions of linear algorithms like separating hyperplanes and regression. Prerequisites for kernel learning are similarity measures with the mathematical property of positive semidefiniteness (kernels). The iterative similarity optimal assignment graph kernel (ISOAK) [2] is defined directly on the annotated structure graph, and was designed specifically for the comparison of small molecules. In our virtual screening study, its use improved results, e.g., in principle component analysis-based visualization and Gaussian process regression. Following a thorough retrospective validation using a data set of 176 published PPARgamma agonists [8], we screened a vendor library for novel agonists. Subsequent testing of 15 compounds in a cell-based transactivation assay [9] yielded four active compounds. The most interesting hit, a natural product derivative with cyclobutane scaffold, is a full selective PPARgamma agonist (EC50 = 10 ± 0.2 microM, inactive on PPARalpha and PPARbeta/delta at 10 microM). We demonstrate how the interplay of several modern kernel-based machine learning approaches can successfully improve ligand-based virtual screening results
Robust brain-computer interfaces
A brain-computer interface (BCI) enables direct communication from the brain to devices, bypassing the traditional pathway of peripheral nerves and muscles. Current BCIs aimed at patients require that the user invests weeks, or even months, to learn the skill to intentionally modify their brain signals. This can be reduced to a calibration session of about half an hour per session if machine learning (ML) methods are used. The laborious recalibration is still needed due to inter-session differences in the statistical properties of the electroencephalography (EEG) signal. Further, the natural variability in spontaneous EEG violates basic assumptions made by the ML methods used to train the BCI classifier, and causes the classification accuracy to fluctuate unpredictably. These fluctuations make the current generation of BCIs unreliable. In this dissertation,we will investigate the nature of these variations in the EEG distributions, and introduce two new, complementary methods to overcome these two key issues. To confirm the problem of non-stationary brain signals, we first show that BCIs based on commonly used signal features are sensitive to changes in the mental state of the user. We proceed by describing a method aimed at removing these changes in signal feature distributions. We have devised a method that uses a second-order baseline (SOB) to specifically isolate these relative changes in neuronal firing synchrony. To the best of our knowledge this is the first BCI classifier that works on out-of-sample subjects without any loss of performance. Still, the assumption made by ML methods that the training data consists of samples that are independent and identically distributed (iid) is violated, because EEG samples nearby in time are highly correlated. Therefore we derived a generalization of the well-known support vector machine (SVM) classifier, that takes the resulting chronological structure of classification errors into account. Both on artificial data and real BCI data, overfitting is reduced with this dependent samples support vector machine (dSVM), leading to BCIs with an increased information throughput
GWK.
Die Autoren – Professoren und Mitarbeiter der GWK – stellen richtige und falsche Fragen, üben mit uns das Liken und Faven und duellieren Times und Helvetica. Sie bewerben Wirkungsfiktionen, verknüpfen das Gesagte mit dem Gemeinten und ... lesen Sie selbst! GWK001 bereist die Landschaft der Kommunikation, verhandelt Positionen, knüpft Verbindungen – erzählt manche Geschichte und fordert Sie auch mal zum Handeln auf! So wirkt GWK als Seismograph, Übersetzer und (Mit-) Gestalter von soziokulturellen und kommunikativen Wandlungen