97 research outputs found
Protein pocket and ligand shape comparison and its application in virtual screening
Understanding molecular recognition is one major requirement for drug discovery and design. Physicochemical and shape complementarity between two binding partners is the driving force during complex formation. In this study, the impact of shape within this process is analyzed. Protein binding pockets and co-crystallized ligands are represented by normalized principal moments of inertia ratios (NPRs). The corresponding descriptor space is triangular, with its corners occupied by spherical, discoid, and elongated shapes. An analysis of a selected set of sc-PDB complexes suggests that pockets and bound ligands avoid spherical shapes, which are, however, prevalent in small unoccupied pockets. Furthermore, a direct shape comparison confirms previous studies that on average only one third of a pocket is filled by its bound ligand, supplemented by a 50 % subpocket coverage. In this study, we found that shape complementary is expressed by low pairwise shape distances in NPR space, short distances between the centers-of-mass, and small deviations in the angle between the first principal ellipsoid axes. Furthermore, it is assessed how different binding pocket parameters are related to bioactivity and binding efficiency of the co-crystallized ligand. In addition, the performance of different shape and size parameters of pockets and ligands is evaluated in a virtual screening scenario performed on four representative target
Conformator: A Novel Method for the Generation of Conformer Ensembles
Computer-aided drug design methods such as docking, pharmacophore searching, 3D database searching, and the creation of 3D-QSAR models need conformational ensembles to handle the flexibility of small molecules. Here, we present Conformator, an accurate and effective knowledge-based algorithm for generating conformer ensembles. With 99.9% of all test molecules processed, Conformator stands out by its robustness with respect to input formats, molecular geometries, and the handling of macrocycles. With an extended set of rules for sampling torsion angles, a novel algorithm for macrocycle conformer generation, and a new clustering algorithm for the assembly of conformer ensembles, Conformator reaches a median minimum root-mean-square deviation (measured between protein-bound ligand conformations and ensembles of a maximum of 250 conformers) of 0.47 Ã… with no significant difference to the highest-ranked commercial algorithm OMEGA and significantly higher accuracy than seven free algorithms, including the RDKit DG algorithm. Conformator is freely available for noncommercial use and academic research.acceptedVersio
ProteinsPlus: a web portal for structure analysis of macromolecules
With currently more than 126 000 publicly available structures and an
increasing growth rate, the Protein Data Bank constitutes a rich data source
for structure-driven research in fields like drug discovery, crop science and
biotechnology in general. Typical workflows in these areas involve manifold
computational tools for the analysis and prediction of molecular functions.
Here, we present the ProteinsPlus web server that offers a unified easy-to-use
interface to a broad range of tools for the early phase of structure-based
molecular modeling. This includes solutions for commonly required pre-
processing tasks like structure quality assessment (EDIA), hydrogen placement
(Protoss) and the search for alternative conformations (SIENA). Beyond that,
it also addresses frequent problems as the generation of 2D-interaction
diagrams (PoseView), protein–protein interface classification (HyPPI) as well
as automatic pocket detection and druggablity assessment (DoGSiteScorer). The
unified ProteinsPlus interface covering all featured approaches provides
various facilities for intuitive input and result visualization, case-specific
parameterization and download options for further processing. Moreover, its
generalized workflow allows the user a quick familiarization with the
different tools. ProteinsPlus also stores the calculated results temporarily
for future request and thus facilitates convenient result communication and
re-access. The server is freely available at http://proteins.plus
Fast automated placement of polar hydrogen atoms in protein-ligand complexes
<p>Abstract</p> <p>Background</p> <p>Hydrogen bonds play a major role in the stabilization of protein-ligand complexes. The ability of a functional group to form them depends on the position of its hydrogen atoms. An accurate knowledge of the positions of hydrogen atoms in proteins is therefore important to correctly identify hydrogen bonds and their properties. The high mobility of hydrogen atoms introduces several degrees of freedom: Tautomeric states, where a hydrogen atom alters its binding partner, torsional changes where the position of the hydrogen atom is rotated around the last heavy-atom bond in a residue, and protonation states, where the number of hydrogen atoms at a functional group may change. Also, side-chain flips in glutamine and asparagine and histidine residues, which are common crystallographic ambiguities must be identified before structure-based calculations can be conducted.</p> <p>Results</p> <p>We have implemented a method to determine the most probable hydrogen atom positions in a given protein-ligand complex. Optimality of hydrogen bond geometries is determined by an empirical scoring function which is used in molecular docking. This allows to evaluate protein-ligand interactions with an established model. Also, our method allows to resolve common crystallographic ambiguities such as as flipped amide groups and histidine residues. To ensure high speed, we make use of a dynamic programming approach.</p> <p>Conclusion</p> <p>Our results were checked against selected high-resolution structures from an external dataset, for which the positions of the hydrogen atoms have been validated manually. The quality of our results is comparable to that of other programs, with the advantage of being fast enough to be applied on-the-fly for interactive usage or during score evaluation.</p
- …