6 research outputs found
Improving Classical Substructure-Based Virtual Screening to Handle Extrapolation Challenges
Target-oriented substructure-based virtual screening
(sSBVS) of
molecules is a promising approach in drug discovery. Yet, there are
doubts whether sSBVS is suitable also for extrapolation, that is,
for detecting molecules that are very different from those used for
training. Herein, we evaluate the predictive power of classic virtual
screening methods, namely, similarity searching using Tanimoto coefficient
(MTC) and Naive Bayes (NB). As could be expected, these classic methods
perform better in interpolation than in extrapolation tasks. Consequently,
to enhance the predictive ability for extrapolation tasks, we introduce
the Shadow approach, in which inclusion relations between substructures
are considered, as opposed to the classic sSBVS methods that assume
independence between substructures. Specifically, we discard contributions
from substructures included in (“shaded” by) others
which are, in turn, included in the molecule of interest. Indeed,
the Shadow classifier significantly outperforms both MTC (<i>pValue</i> = 3.1 × 10<sup>–16</sup>) and NB (<i>pValue</i> = 3.5 × 10<sup>–9</sup>) in detecting
hits sharing low similarity with the training active molecules
Allele distribution of the CGEN-40003 amplicon according to EULAR response.
<p>The Y-axis indicates percentage of patients. The X-axis indicates EULAR response (good, moderate, none). The colored boxes indicate the size (base pair) of the longest allele.</p
Association between genotype and EULAR good response versus EULAR moderate response/no response.
*<p>Odds ratio for EULAR good response being 512 positive,</p>***<p>after correction for dependency.</p
Association between genotype and EULAR good response versus EULAR no response.
*<p>Odds ratio for EULAR good response being 512 positive; #adjusted p-value;</p>**<p>Odds ratio for EULAR good response when both alleles are ≤280;</p>***<p>after correction for dependency.</p
Demographic and clinical characteristics at baseline.
<p>Values are given as median (range) or number (percentage of total).</p>#<p>3 patients had missing smoking status.</p>##<p>115 patients had missing anti-CCP values.</p
Possible tests imposing a 10% genotype group size condition in the 156 patients with either EULAR good response or no response.
<p>Possible tests imposing a 10% genotype group size condition in the 156 patients with either EULAR good response or no response.</p