Search CORE

138 research outputs found

Effect of missing data on multitask prediction methods

Author: A Anighoro
A Mayr
A Tropsha
Antonio de la Vega de León
AP Bento
B Chen
B Ramsundar
Beining Chen
D Fourches
D Rogers
D Weininger
G Harper
J Ma
J Simm
JG Moffat
KY Helal
L Breiman
M Glick
MR Berthold
S Kim
S Knapp
SL Kinnings
SM Wilhelm
T Unterthiner
TWH Backman
Valerie J. Gillet
Y LeCun
Y Wang
Y Xu
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/05/2018
Field of study

There has been a growing interest in multitask prediction in chemoinformatics, helped by the increasing use of deep neural networks in this field. This technique is applied to multitarget data sets, where compounds have been tested against different targets, with the aim of developing models to predict a profile of biological activities for a given compound. However, multitarget data sets tend to be sparse; i.e., not all compound-target combinations have experimental values. There has been little research on the effect of missing data on the performance of multitask methods. We have used two complete data sets to simulate sparseness by removing data from the training set. Different models to remove the data were compared. These sparse sets were used to train two different multitask methods, deep neural networks and Macau, which is a Bayesian probabilistic matrix factorization technique. Results from both methods were remarkably similar and showed that the performance decrease because of missing data is at first small before accelerating after large amounts of data are removed. This work provides a first approximation to assess how much data is required to produce good performance in multitask prediction exercises

Crossref

Directory of Open Access Journals

White Rose Research Online

Antigenic Complementarity in the Origins of Autoimmunity: A General Theory Illustrated With a Case Study of Idiopathic Thrombocytopenia Purpura

Author: Balint JP
Bar Meir E
Billingham MEJ
Blalock JE
Candelli M
Clancy R
Cruz MA
Emsley J
Esquivil PS
Fisgin T
Hamner DL
He R
He R
Hou M
Huang X
Humblot S
Ichiche M
Jacob Couturier
Kahane S
Kouwabunpat D
Krook A
Kurata Y
McGuire KL
McMillan R
Morrow WJW
Morrow WJW
Musaji A
Nardi M
Oyaizu N
Pearson WR
Pendergraft WF
Pendergraft WF
Plotz PH
Prakken BJ
Puram V
Rahal JJ
Reddy MM
Robert Root-Bernstein
Root-Bernstein RS
Root-Bernstein RS
Root-Bernstein RS
Root-Bernstein RS
Root-Bernstein RS
Root-Bernstein RS
Root-Bernstein RS
Root-Bernstein RS
Root-Bernstein RS
Root-Bernstein RS
Root-Bernstein RS
Root-Bernstein RS
Root-Bernstein RS
Root-Bernstein RS
Root-Bernstein RS
Root-Bernstein RS
Root-BernsteinRS
Sakata H
Shimizu A
Shoelson SE
Shoenfeld Y
Siemion IZ
Silvestris F
SonnabendJA
Stanojevic M
Stéphan F
Takahashi T
Takeuchi Y
Titani K
Trent RJ
Tropsha A
Ulmansky R
Van Spronsen DJ
Vicente V
Wadenvik H
Westall FC
Westall FC
Westall FC
Wright JF
Yang X-D
Zandman-Goddard G
Zhang ZY
Publication venue: Hindawi Publishing Corporation
Publication date: 01/01/2006
Field of study

We describe a novel, testable theory of autoimmunity, outline novel predictions made by the theory, and illustrate its application to unravelling the possible causes of idiopathic thrombocytopenia purpura (ITP). Pairs of stereochemically complementary antigens induce complementary immune responses (antibody or T-cell) that create loss of regulation and civil war within the immune system itself. Antibodies attack antibodies creating circulating immune complexes; T-cells attack T-cells creating perivascular cuffing. This immunological civil war abrogates the self-nonself distinction. If at least one of the complementary antigens mimics a self antigen, then this unregulated immune response will target host tissues as well. Data demonstrating that complementary antigens are found in some animal models of autoimmunity and may be present in various human diseases, especially ITP, are reviewed. Specific mechanisms for preventing autoimmunity or suppressing existing autoimmunity are derived from the theory, and critical tests proposed. Finally, we argue that Koch's postulates are inadequate for establishing disease causation for multiple-antigen diseases and discuss the possibility that current research has failed to elucidate the causes of human autoimmune diseases because we are using the wrong criteria

Crossref

Directory of Open Access Journals

PubMed Central

Identification of Novel Functional Inhibitors of Acid Sphingomyelinase

Author: A Goede
A Guerra
A Haimovitz-Friedman
A Rebillard
A Sakata
A Schwarz
A Tropsha
AH Futerman
Astrid Friedl
BM Altura
C Andres
C de Duve
C De Simone
C Malaplate-Armand
C Williams
CA Lipinski
Christiane Mühle
D Amaratunga
D Canals
DA Konovalov
DG Altman
DL Streiner
DR Ragland
E Alpaydin
E Gulbins
E Gulbins
E Gulbins
E Schwarz
EB Roecker
EH Schuchman
EL Smith
Erich Gulbins
F Paris
FD Testai
G Dawson
G Gerebtzoff
G Pantaleo
GD Purvis 3rd
Gudrun M. Spitzer
H Fischer
H Grassmé
H Grassmé
H von der Voet
Howard Riezman
HP Deigner
HS Chung
I Guyon
I Kononenko
I Mierswa
IH Witten
J Kelder
J Kornhuber
J Kornhuber
J Kornhuber
J Kornhuber
J Kornhuber
J Riethmüller
JA Platts
Johannes Kornhuber
JP Hobson
K Kira
K Lanevskij
K Rose
K Thevissen
KA Becker
Klaus R. Liedl
L Breiman
L Zhang
LB Akella
Lothar Terfloth
M Feher
M Kölzer
M Kölzer
M Meloun
M Muehlbacher
M Pascual
M Zerara
MA Mikati
Markus Muehlbacher
Martin Reichel
MC Raff
MH Abraham
MK Cox
MN Ndengele
MW Bradbury
N Sakuragawa
O Cuvillier
P Garg
P Santana
P Willett
Philipp Tripal
PI Darroch
PP Van Veldhoven
R Bisiani
R Cecchelli
R Fluss
R Göggel
R Hurwitz
R Kohavi
R Kolesnick
R Liu
R Narayanan
R Todeschini
RE Toman
RJ Mintzer
RO Brady
RW Jenkins
S Albouz
S Cuzzocrea
S Elojeimy
S Jin
S Kirschnek
S Pandey
S Spiegel
S Spiegel
S Spiegel
S Trapp
S Vilar
S Wold
SR Mente
SS Castillo
Stefan Trapp
Stefanie Pechmann
T Ewing
Teja W. Groemer
U Norinder
V Svetnik
V Teichgräber
VA Levin
WJ Youden
WS Bush
X Han
X He
X-C Fu
Y Benjamini
Y Saeys
Y Wang
Y Yoshida
YA Hannun
ZF Yu
ZH Zhou
Publication venue: Public Library of Science
Publication date: 01/01/2011
Field of study

We describe a hitherto unknown feature for 27 small drug-like molecules, namely functional inhibition of acid sphingomyelinase (ASM). These entities named FIASMAs (Functional Inhibitors of Acid SphingoMyelinAse), therefore, can be potentially used to treat diseases associated with enhanced activity of ASM, such as Alzheimer's disease, major depression, radiation- and chemotherapy-induced apoptosis and endotoxic shock syndrome. Residual activity of ASM measured in the presence of 10 µM drug concentration shows a bimodal distribution; thus the tested drugs can be classified into two groups with lower and higher inhibitory activity. All FIASMAs share distinct physicochemical properties in showing lipophilic and weakly basic properties. Hierarchical clustering of Tanimoto coefficients revealed that FIASMAs occur among drugs of various chemical scaffolds. Moreover, FIASMAs more frequently violate Lipinski's Rule-of-Five than compounds without effect on ASM. Inhibition of ASM appears to be associated with good permeability across the blood-brain barrier. In the present investigation, we developed a novel structure-property-activity relationship by using a random forest-based binary classification learner. Virtual screening revealed that only six out of 768 (0.78%) compounds of natural products functionally inhibit ASM, whereas this inhibitory activity occurs in 135 out of 2028 (6.66%) drugs licensed for medical use in humans

Public Library of Science (PLOS)

Crossref

Directory of Open Access Journals

PubMed Central

Online Research Database In Technology

Investigation of substituent effect of 1-(3,3-diphenylpropyl)-piperidinyl phenylacetamides on CCR5 binding affinity using QSAR and virtual screening techniques

Author: A. Atkinson
A. Golbraikh
A. Golbraikh
A. Tropsha
A.K. Debnath
A.O. Aptula
Antreas Afantitis
B. Efron
C. Hansch
D.W. Osten
F. Sellebjerg
G. Melagraki
Georgia Melagraki
Haralambos Sarimveis
J.N. Burrows
J.T. Leonard
John Markopoulos
K. Roy
M. Fischereder
M. Shen
M. Song
N. Pipitone
Olga Igglessi-Markopoulou
P.G. Andres
Panayiotis A. Koutentis
W. Kazmierski
W.P.A. Walters
Y. Xu
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

Predicting Binding Affinity of CSAR Ligands Using Both Structure-Based and Ligand-Based Approaches

Author: Alexander Tropsha
Artemenko A.
Artemenko A. G.
Biesiada J.
Breiman L.
Bursulaya B. D.
Chen Y.
Denis Fourches
Ding F.
Eugene Muratov
Feng Ding
Fourches D.
Hsieh J.-H.
Jain S. V
Kuz’min V. E.
Kuz’min V. E.
Maggiora G. M.
Muratov E. N.
Nikolay V. Dokholyan
Polishchuk P. G.
Scotti L.
Tropsha A.
Tropsha A.
Vapnik V. N.
Warren G. L.
Yin S.
Publication venue: 'American Chemical Society (ACS)'
Publication date: 01/01/2013
Field of study

We report on the prediction accuracy of ligand-based (2D QSAR) and structure-based (MedusaDock) methods used both independently and in consensus for ranking the congeneric series of ligands binding to three protein targets (UK, ERK2, and CHK1) from the CSAR 2011 benchmark exercise. An ensemble of predictive QSAR models was developed using known binders of these three targets extracted from the publicly-available ChEMBL database. Selected models were used to predict the binding affinity of CSAR compounds towards the corresponding targets and rank them accordingly; the overall ranking accuracy evaluated by Spearman correlation was as high as 0.78 for UK, 0.60 for ERK2, and 0.56 for CHK1, placing our predictions in top-10% among all the participants. In parallel, MedusaDock designed to predict reliable docking poses was also used for ranking the CSAR ligands according to their docking scores; the resulting accuracy (Spearman correlation) for UK, ERK2, and CHK1 were 0.76, 0.31, and 0.26, respectively. In addition, performance of several consensus approaches combining MedusaDock and QSAR predicted ranks altogether has been explored; the best approach yielded Spearman correlation coefficients for UK, ERK2, and CHK1 of 0.82, 0.50, and 0.45, respectively. This study shows that (i) externally validated 2D QSAR models were capable of ranking CSAR ligands at least as accurately as more computationally intensive structure-based approaches used both by us and by other groups and (ii) ligand-based QSAR models can complement structure-based approaches by boosting the prediction performances when used in consensus

Crossref

PubMed Central

Carolina Digital Repository

Application of Predictive QSAR Models to Database Mining: Identification and Experimental Validation of Novel Anticonvulsant Compounds

Author: Begley C. E.
Bialer M.
Burney R. G.
Cho S. J.
Choi D.
Cortes S.
Geurts M.
Golbraikh A.
Golbraikh A.
Güner O. F
Hoffman B. T.
Kohn H.
Kohn H.
Kovatcheva A.
Krall R. L.
Kurogi Y.
Lazar J.
LeTiran A.
Levy R. H.
Martin H.
Martin Y. C.
Mattson R. H.
MolConn Z
Nelson S. D.
Pellock J. M.
Rogawski M. A.
Sawamura M.
Shen M.
Sotaniemi E. A.
Tropsha A.
Tropsha A.
Tropsha A.
Tropsha A.
Tucher G. T
Unverferth K.
Zheng W.
Publication venue: 'American Chemical Society (ACS)'
Publication date
Field of study

Crossref

Layer-by-layer assembly of graphene on polyimide films via thermal imidization and synchronous reduction of graphene oxide

Author: A. Y. W. Sham
C. Cho
E. L. Cussler
G. Wang
H. Ito
M. C. Choi
M. Schneider
M.-H. Tsai
N. A. Kotov
R. K. Iler
R. Rajasekar
S. W. Keller
V. V. Tsukruk
W. S. Hummers
Y. G. Tropsha
Y. Long
Y. Lvov
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

Receptor-guided 3D-QSAR approach for the discovery of c-kit tyrosine kinase inhibitors

Author: A Tropsha
Anna Maria Almerico
Antonino Lauria
D Ballabio
D Linnekin
G Bellone
J Marshall
J Zupan
KK Eklund
Marco Tutone
MC Heinrich
MH Potashman
RK Kunz
W Scherman
WL Wang
Y Dai
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

Predictive QSAR workflow for the in silico identification and screening of novel HDAC inhibitors

Author: A Afantitis
A Afantitis
A Afantitis
A Afantitis
A Afantitis
A Golbraikh
A Tropsha
A Tropsha
A Tropsha
A Xie
AJM Ruijter de
AK Chakraborti
Antreas Afantitis
AT Balaban
B Liu
B Xia
CH Andrade
CW Yap
D Jaiswal
D-F Wang
DC Juvale
G Elaut
G Melagraki
G Melagraki
G Melagraki
G Melagraki
G Melagraki
G Melagraki
G Melagraki
George Kollias
Georgia Melagraki
H-F Chen
Haralambos Sarimveis
I Muegge
JJP Stewart
K Roy
KV Balakin
M Hewitt
M Jalali-Heravi
M Petitjean
MA Glozak
MS Castilho
NK Wagh
Olga Igglessi-Markopoulou
P Gallinari
P Ghosh
Panayiotis A. Koutentis
R Ragno
R Ragno
RVC Guido
RW Kennard
S Price
S Price
S Vadivelan
V Santini
W An
W Wu
WK Rasheed
Y Guo
Y-D Chen
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

Cheminformatics Meets Molecular Mechanics: A Combined Application of Knowledge-Based Pose Scoring and Physical Force Field-Based Hit Scoring Functions Improves the Accuracy of Structure-Based Virtual Screening

Poor performance of scoring functions is a well-known bottleneck in structure-based virtual screening, which is most frequently manifested in the scoring functions’ inability to discriminate between true ligands versus known non-binders (therefore designated as binding decoys). This deficiency leads to a large number of false positive hits resulting from virtual screening. We have hypothesized that filtering out or penalizing docking poses recognized as non-native (i.e., pose decoys) should improve the performance of virtual screening in terms of improved identification of true binders. Using several concepts from the field of cheminformatics, we have developed a novel approach to identifying pose decoys from an ensemble of poses generated by computational docking procedures. We demonstrate that the use of target-specific pose (-scoring) filter in combination with a physical force field-based scoring function (MedusaScore) leads to significant improvement of hit rates in virtual screening studies for 12 of the 13 benchmark sets from the clustered version of the Database of Useful Decoys (DUD). This new hybrid scoring function outperforms several conventional structure-based scoring functions, including XSCORE∷HMSCORE, ChemScore, PLP, and Chemgauss3, in six out of 13 data sets at early stage of VS (up 1% decoys of the screening database). We compare our hybrid method with several novel VS methods that were recently reported to have good performances on the same DUD data sets. We find that the retrieved ligands using our method are chemically more diverse in comparison with two ligand-based methods (FieldScreen and FLAP∷LBX). We also compare our method with FLAP∷RBLB, a high-performance VS method that also utilizes both the receptor and the cognate ligand structures. Interestingly, we find that the top ligands retrieved using our method are highly complementary to those retrieved using FLAP∷RBLB, hinting effective directions for best VS applications. We suggest that this integrative virtual screening approach combining cheminformatics and molecular mechanics methodologies may be applied to a broad variety of protein targets to improve the outcome of structure-based drug discovery studies

Crossref

PubMed Central

Carolina Digital Repository