16 research outputs found
Testing Biochemistry Revisited: How In Vivo Metabolism Can Be Understood from In Vitro Enzyme Kinetics
A decade ago, a team of biochemists including two of us, modeled yeast glycolysis and showed that one of the most studied biochemical pathways could not be quite understood in terms of the kinetic properties of the constituent enzymes as measured in cell extract. Moreover, when the same model was later applied to different experimental steady-state conditions, it often exhibited unrestrained metabolite accumulation
Benchmark Data Sets for Structure-Based Computational Target Prediction
Structure-based
computational target prediction methods identify
potential targets for a bioactive compound. Methods based on protein–ligand
docking so far face many challenges, where the greatest probably is
the ranking of true targets in a large data set of protein structures.
Currently, no standard data sets for evaluation exist, rendering comparison
and demonstration of improvements of methods cumbersome. Therefore,
we propose two data sets and evaluation strategies for a meaningful
evaluation of new target prediction methods, i.e., a small data set
consisting of three target classes for detailed proof-of-concept and
selectivity studies and a large data set consisting of 7992 protein
structures and 72 drug-like ligands allowing statistical evaluation
with performance metrics on a drug-like chemical space. Both data
sets are built from openly available resources, and any information
needed to perform the described experiments is reported. We describe
the composition of the data sets, the setup of screening experiments,
and the evaluation strategy. Performance metrics capable to measure
the early recognition of enrichments like AUC, BEDROC, and NSLR are
proposed. We apply a sequence-based target prediction method to the
large data set to analyze its content of nontrivial evaluation cases.
The proposed data sets are used for method evaluation of our new inverse
screening method <i>i</i>RAISE. The small data set reveals
the method’s capability and limitations to selectively distinguish
between rather similar protein structures. The large data set simulates
real target identification scenarios. <i>i</i>RAISE achieves
in 55% excellent or good enrichment a median AUC of 0.67 and RMSDs
below 2.0 Ã… for 74% and was able to predict the first true target
in 59 out of 72 cases in the top 2% of the protein data set of about
8000 structures
Discriminative Chemical Patterns: Automatic and Interactive Design
The
classification of molecules with respect to their inhibiting,
activating, or toxicological potential constitutes a central aspect
in the field of cheminformatics. Often, a discriminative feature is
needed to distinguish two different molecule sets. Besides physicochemical
properties, substructures and chemical patterns belong to the descriptors
most frequently applied for this purpose. As a commonly used example
of this descriptor class, SMARTS strings represent a powerful concept
for the representation and processing of abstract chemical patterns.
While their usage facilitates a convenient way to apply previously
derived classification rules on new molecule sets, the manual generation
of useful SMARTS patterns remains a complex and time-consuming process.
Here, we introduce SMARTSminer, a new algorithm for the automatic
derivation of discriminative SMARTS patterns from preclassified molecule
sets. Based on a specially adapted subgraph mining algorithm, SMARTSminer
identifies structural features that are frequent in only one of the
given molecule classes. In comparison to elemental substructures,
it also supports the consideration of general and specific SMARTS
features. Furthermore, SMARTSminer is integrated into an interactive
pattern editor named SMARTSeditor. This allows for an intuitive visualization
on the basis of the SMARTSviewer concept as well as interactive adaption
and further improvement of the generated patterns. Additionally, a
new molecular matching feature provides an immediate feedback on a
pattern’s matching behavior across the molecule sets. We demonstrate
the utility of the SMARTSminer functionality and its integration into
the SMARTSeditor software in several different classification scenarios
Fast Protein Binding Site Comparison via an Index-Based Screening Technology
We present TrixP, a new index-based method for fast protein
binding site comparison and function prediction. TrixP determines
binding site similarities based on the comparison of descriptors that
encode pharmacophoric and spatial features. Therefore, it adopts the
efficient core components of TrixX, a structure-based virtual screening
technology for large compound libraries. TrixP expands this technology
by new components in order to allow a screening of protein libraries.
TrixP accounts for the inherent flexibility of proteins employing
a partial shape matching routine. After the identification of structures
with matching pharmacophoric features and geometric shape, TrixP superimposes
the binding sites and, finally, assesses their similarity according
to the fit of pharmacophoric properties. TrixP is able to find analogies
between closely and distantly related binding sites. Recovery rates
of 81.8% for similar binding site pairs, assisted by rejecting rates
of 99.5% for dissimilar pairs on a test data set containing 1331 pairs,
confirm this ability. TrixP exclusively identifies members of the
same protein family on top ranking positions out of a library consisting
of 9802 binding sites. Furthermore, 30 predicted kinase binding sites
can almost perfectly be classified into their known subfamilies