11 research outputs found

    Detection of unrealistic molecular environments in protein structures based on expected electron densities

    Get PDF
    Understanding the relationship between protein structure and biological function is a central theme in structural biology. Advances are severely hampered by errors in experimentally determined protein structures. Detection and correction of such errors is therefore of utmost importance. Electron densities in molecular structures obey certain rules which depend on the molecular environment. Here we present and discuss a new approach that relates electron densities computed from a structural model to densities expected from prior observations on identical or closely related molecular environments. Strong deviations of computed from expected densities reveal unrealistic molecular structures. Most importantly, structure analysis and error detection are independent of experimental data and hence may be applied to any structural model. The comparison to state-of-the-art methods reveals that our approach is able to identify errors that formerly remained undetected. The new technique, called RefDens, is accessible as a public web service at http://refdens.services.came.sbg.ac.at

    SHIFTX2: significantly improved protein chemical shift prediction

    Get PDF
    A new computer program, called SHIFTX2, is described which is capable of rapidly and accurately calculating diamagnetic 1H, 13C and 15N chemical shifts from protein coordinate data. Compared to its predecessor (SHIFTX) and to other existing protein chemical shift prediction programs, SHIFTX2 is substantially more accurate (up to 26% better by correlation coefficient with an RMS error that is up to 3.3Ɨ smaller) than the next best performing program. It also provides significantly more coverage (up to 10% more), is significantly faster (up to 8.5Ɨ) and capable of calculating a wider variety of backbone and side chain chemical shifts (up to 6Ɨ) than many other shift predictors. In particular, SHIFTX2 is able to attain correlation coefficients between experimentally observed and predicted backbone chemical shifts of 0.9800 (15N), 0.9959 (13CĪ±), 0.9992 (13CĪ²), 0.9676 (13Cā€²), 0.9714 (1HN), 0.9744 (1HĪ±) and RMS errors of 1.1169, 0.4412, 0.5163, 0.5330, 0.1711, and 0.1231Ā ppm, respectively. The correlation between SHIFTX2ā€™s predicted and observed side chain chemical shifts is 0.9787 (13C) and 0.9482 (1H) with RMS errors of 0.9754 and 0.1723Ā ppm, respectively. SHIFTX2 is able to achieve such a high level of accuracy by using a large, high quality database of training proteins (>190), by utilizing advanced machine learning techniques, by incorporating many more features (Ļ‡2 and Ļ‡3 angles, solvent accessibility, H-bond geometry, pH, temperature), and by combining sequence-based with structure-based chemical shift prediction techniques. With this substantial improvement in accuracy we believe that SHIFTX2 will open the door to many long-anticipated applications of chemical shift prediction to protein structure determination, refinement and validation. SHIFTX2 is available both as a standalone program and as a web server (http://www.shiftx2.ca)

    SimShiftDB; local conformational restraints derived from chemical shift similarity searches on a large synthetic database

    Get PDF
    We present SimShiftDB, a new program to extract conformational data from protein chemical shifts using structural alignments. The alignments are obtained in searches of a large database containing 13,000 structures and corresponding back-calculated chemical shifts. SimShiftDB makes use of chemical shift data to provide accurate results even in the case of low sequence similarity, and with even coverage of the conformational search space. We compare SimShiftDB to HHSearch, a state-of-the-art sequence-based search tool, and to TALOS, the current standard tool for the task. We show that for a significant fraction of the predicted similarities, SimShiftDB outperforms the other two methods. Particularly, the high coverage afforded by the larger database often allows predictions to be made for residues not involved in canonical secondary structure, where TALOS predictions are both less frequent and more error prone. Thus SimShiftDB can be seen as a complement to currently available methods

    A 2-Approximation Algorithm for Sorting by Prefix Reversals c ā—‹ Springer-Verlag

    No full text
    Abstract. Sorting by Prefix Reversals, also known as Pancake Flipping, is the problem of transforming a given permutation into the identity permutation, where the only allowed operations are reversals of a prefix of the permutation. The problem complexity is still unknown, and no algorithm with an approximation ratio better than 3 is known. We present the first polynomial-time 2-approximation algorithm to solve this problem. Empirical tests suggest that the average performance is in fact better than 2.

    BIOINFORMATICS ORIGINAL PAPER Structural bioinformatics SimShift: Identifying structural similarities from NMR chemical shifts

    No full text
    Motivation: An important quantity that arises in NMR spectroscopy experiments is the chemical shift. The interpretation of these data is mostly done by human experts; to our knowledge there are no algorithms that predict protein structure from chemical shift sequences alone. One approach to facilitate this process could be to compare two such sequences, where the structure of one protein has already been resolved. Our claim is that similarity of chemical shifts thereby found implies structural similarity of the respective proteins. Results: We present an algorithm to identify structural similarities of proteins by aligning their associated chemical shift sequences. To evaluate the correctness of our predictions, we propose a benchmark set of protein pairs that have high structural similarity, but low sequence similarity (because with high sequence similarity the structural similarities could easily be detected by a sequence alignment algorithm). We compare our results with those of HHsearch and SSEA and show that our method outperforms both in>50 % of all cases. Contact

    Real Space Refinement of Crystal Structures with Canonical Distributions of Electrons

    Get PDF
    Recurring groups of atoms in molecules are surrounded by specific canonical distributions of electrons. Deviations from these distributions reveal unrealistic molecular geometries. Here, we show how canonical electron densities can be combined with classical electron densities derived from X-ray diffraction experiments to drive the real space refinement of crystal structures. The refinement process generally yields superior molecular models with reduced excess electron densities and improved stereochemistry without compromising the agreement between molecular models and experimental data
    corecore