Search CORE

160 research outputs found

A chemogenomics view on protein-ligand spaces

Author: A Bateman
A Golovin
A Henschel
A Lindström
AG Murzin
C McInnes
C O'Donovan
CA Lipinski
CM Dobson
D Dimitropoulos
D Rognan
DB Kitchen
DG Haylett
DS Wishart
G Wang
Gerard J Kleywegt
GJ Kleywegt
H Strömbergsson
H Strömbergsson
Helena Strömbergsson
HM Berman
I Dubchak
J Gasteiger
J Larsson
JES Wikberg
JR Bock
K Park
KP Seiler
L Eriksson
L Terfloth
LA Boothby
LH Greene
M Campillos
M Nayal
M Nervall
M Pastor
M Sandberg
P Rice
R Guha
RA Laskowski
RE Martin
SB Needleman
T Hastie
TA Eyre
TF Smith
TI Oprea
TR Hvidsten
U Pieper
W Guba
ZR Li
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Link Mining for Kernel-based Compound-Protein Interaction Predictions Using a Chemogenomics Approach

Author: A Lavecchia
ACA Nascimento
C-C Chang
D Rogers
H Ding
L Jacob
M Bouchard
M Gonen
M Hattori
MN Drwal
S Daminelli
T Laarhoven van
T Laarhoven van
TF Smith
Y Liu
Y Yamanishi
Publication venue
Publication date: 29/06/2017
Field of study

Virtual screening (VS) is widely used during computational drug discovery to reduce costs. Chemogenomics-based virtual screening (CGBVS) can be used to predict new compound-protein interactions (CPIs) from known CPI network data using several methods, including machine learning and data mining. Although CGBVS facilitates highly efficient and accurate CPI prediction, it has poor performance for prediction of new compounds for which CPIs are unknown. The pairwise kernel method (PKM) is a state-of-the-art CGBVS method and shows high accuracy for prediction of new compounds. In this study, on the basis of link mining, we improved the PKM by combining link indicator kernel (LIK) and chemical similarity and evaluated the accuracy of these methods. The proposed method obtained an average area under the precision-recall curve (AUPR) value of 0.562, which was higher than that achieved by the conventional Gaussian interaction profile (GIP) method (0.425), and the calculation time was only increased by a few percent

arXiv.org e-Print Archive

Crossref

Virtual screening of GPCRs: An in silico chemogenomics approach

Author: Hoffmann Brice
Jacob Laurent
Stoven Véronique
Vert Jean-Philippe
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2008
Field of study

International audienceThe G-protein coupled receptor (GPCR) superfamily is currently the largest class of therapeutic targets. In silico prediction of interactions between GPCRs and small molecules in the transmembrane ligand-binding site is therefore a crucial step in the drug discovery process, which remains a daunting task due to the difficulty to characterize the 3D structure of most GPCRs, and to the limited amount of known ligands for some members of the superfamily. Chemogenomics, which attempts to characterize interactions between all members of a target class and all small molecules simultaneously, has recently been proposed as an interesting alternative to traditional docking or ligand-based virtual screening strategies

arXiv.org e-Print Archive

Machine Learning for In Silico Virtual Screening and Chemical Genomics: New Strategies

Author: Jacob Laurent
Vert Jean-Philippe
Publication venue: Bentham Science Publishers Ltd.
Publication date: 01/01/2008
Field of study

Support vector machines and kernel methods belong to the same class of machine learning algorithms that has recently become prominent in both computational biology and chemistry, although both fields have largely ignored each other. These methods are based on a sound mathematical and computationally efficient framework that implicitly embeds the data of interest, respectively proteins and small molecules, in high-dimensional feature spaces where various classification or regression tasks can be performed with linear algorithms. In this review, we present the main ideas underlying these approaches, survey how both the “biological” and the “chemical” spaces have been separately constructed using the same mathematical framework and tricks, and suggest different avenues to unify both spaces for the purpose of in silico chemogenomics

Protein-ligand interaction prediction: an improved chemogenomics approach

Author: Jacob Laurent
Vert Jean-Philippe
Publication venue: Oxford University Press
Publication date: 01/10/2008
Field of study

Motivation: Predicting interactions between small molecules and proteins is a crucial step to decipher many biological processes, and plays a critical role in drug discovery. When no detailed 3D structure of the protein target is available, ligand-based virtual screening allows the construction of predictive models by learning to discriminate known ligands from non-ligands. However, the accuracy of ligand-based models quickly degrades when the number of known ligands decreases, and in particular the approach is not applicable for orphan receptors with no known ligand

Benchmarking of protein descriptor sets in proteochemometric modeling (part 2): modeling performance of 13 amino acid descriptor sets.

Author: Bender A.
Cortes-Ciriano I.
IJzerman A.P.
Overington J.P.
Swier R.F.
Vlijmen H. van
Wegner J.K.
Westen G.J.P. van
Publication venue
Publication date: 01/01/2013
Field of study

Background While a large body of work exists on comparing and benchmarking descriptors of molecular structures, a similar comparison of protein descriptor sets is lacking. Hence, in the current work a total of 13 amino acid descriptor sets have been benchmarked with respect to their ability of establishing bioactivity models. The descriptor sets included in the study are Z-scales (3 variants), VHSE, T-scales, ST-scales, MS-WHIM, FASGAI, BLOSUM, a novel protein descriptor set (termed ProtFP (4 variants)), and in addition we created and benchmarked three pairs of descriptor combinations. Prediction performance was evaluated in seven structure-activity benchmarks which comprise Angiotensin Converting Enzyme (ACE) dipeptidic inhibitor data, and three proteochemometric data sets, namely (1) GPCR ligands modeled against a GPCR panel, (2) enzyme inhibitors (NNRTIs) with associated bioactivities against a set of HIV enzyme mutants, and (3) enzyme inhibitors (PIs) with associated bioactivities on a large set of HIV enzyme mutants. Results The amino acid descriptor sets compared here show similar performance ( 0.3 log units RMSE difference and >0.7 difference in MCC). Combining different descriptor sets generally leads to better modeling performance than utilizing individual sets. The best performers were Z-scales (3) combined with ProtFP (Feature), or Z-Scales (3) combined with an average Z-Scale value for each target, while ProtFP (PCA8), ST-Scales, and ProtFP (Feature) rank last. Conclusions While amino acid descriptor sets capture different aspects of amino acids their ability to be used for bioactivity modeling is still – on average – surprisingly similar. Still, combining sets describing complementary information consistently leads to small but consistent improvement in modeling performance (average MCC 0.01 better, average RMSE 0.01 log units lower). Finally, performance differences exist between the targets compared thereby underlining that choosing an appropriate descriptor set is of fundamental for bioactivity modeling, both from the ligand- as well as the protein side

Leiden University Scholary Publications

Systems biology approaches to a rational drug discovery paradigm

Author: Mizuguchi Kenji
Prathipati Philip
Publication venue: 'Bentham Science Publishers Ltd.'
Publication date: 25/11/2015
Field of study

The published manuscript is available at EurekaSelect via http://www.eurekaselect.com/openurl/content.php?genre=article&doi=10.2174/1568026615666150826114524.Prathipati P., Mizuguchi K.. Systems biology approaches to a rational drug discovery paradigm. Current Topics in Medicinal Chemistry, 16, 9, 1009. https://doi.org/10.2174/1568026615666150826114524

Osaka University Knowledge Archive

On the integration of in silico drug design methods for drug repurposing

Author: Chen Hongming
Engkvist Ola
MARCH VILA ERIC
PINZI LUCA
RASTELLI Giulio
Sturm Noã
TINIVELLA ANNACHIARA
Publication venue: 'Frontiers Media SA'
Publication date: 01/01/2017
Field of study

Drug repurposing has become an important branch of drug discovery. Several computational approaches that help to uncover new repurposing opportunities and aid the discovery process have been put forward, or adapted from previous applications. A number of successful examples are now available. Overall, future developments will greatly benefit from integration of different methods, approaches and disciplines. Steps forward in this direction are expected to help to clarify, and therefore to rationally predict, new drug-target, target-disease, and ultimately drug-disease associations

ZENODO

Directory of Open Access Journals

NEUROSURGERY ENTHUSIASTIC WOMEN SOCIETY

Archivio istituzionale della ricerca - Università di Modena e Reggio Emilia

Exact and efficient top-K inference for multi-target prediction by querying separable linear relational models

Author: De Baets Bernard
Dembczynski Krzysztof
Stock Michiel
Waegeman Willem
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2016
Field of study

Many complex multi-target prediction problems that concern large target spaces are characterised by a need for efficient prediction strategies that avoid the computation of predictions for all targets explicitly. Examples of such problems emerge in several subfields of machine learning, such as collaborative filtering, multi-label classification, dyadic prediction and biological network inference. In this article we analyse efficient and exact algorithms for computing the top-

K

predictions in the above problem settings, using a general class of models that we refer to as separable linear relational models. We show how to use those inference algorithms, which are modifications of well-known information retrieval methods, in a variety of machine learning settings. Furthermore, we study the possibility of scoring items incompletely, while still retaining an exact top-K retrieval. Experimental results in several application domains reveal that the so-called threshold algorithm is very scalable, performing often many orders of magnitude more efficiently than the naive approach

arXiv.org e-Print Archive

Ghent University Academic Bibliography