300 research outputs found

    Statistical Properties of Contact Maps

    Full text link
    A contact map is a simple representation of the structure of proteins and other chain-like macromolecules. This representation is quite amenable to numerical studies of folding. We show that the number of contact maps corresponding to the possible configurations of a polypeptide chain of N amino acids, represented by (N-1)-step self avoiding walks on a lattice, grows exponentially with N for all dimensions D>1. We carry out exact enumerations in D=2 on the square and triangular lattices for walks of up to 20 steps and investigate various statistical properties of contact maps corresponding to such walks. We also study the exact statistics of contact maps generated by walks on a ladder.Comment: Latex file, 15 pages, 12 eps figures. To appear on Phys. Rev.

    Computation of protein geometry and its applications: Packing and function prediction

    Full text link
    This chapter discusses geometric models of biomolecules and geometric constructs, including the union of ball model, the weigthed Voronoi diagram, the weighted Delaunay triangulation, and the alpha shapes. These geometric constructs enable fast and analytical computaton of shapes of biomoleculres (including features such as voids and pockets) and metric properties (such as area and volume). The algorithms of Delaunay triangulation, computation of voids and pockets, as well volume/area computation are also described. In addition, applications in packing analysis of protein structures and protein function prediction are also discussed.Comment: 32 pages, 9 figure

    Molecular dynamics simulation of polymer helix formation using rigid-link methods

    Full text link
    Molecular dynamics simulations are used to study structure formation in simple model polymer chains that are subject to excluded volume and torsional interactions. The changing conformations exhibited by chains of different lengths under gradual cooling are followed until each reaches a state from which no further change is possible. The interactions are chosen so that the true ground state is a helix, and a high proportion of simulation runs succeed in reaching this state; the fraction that manage to form defect-free helices is a function of both chain length and cooling rate. In order to demonstrate behavior analogous to the formation of protein tertiary structure, additional attractive interactions are introduced into the model, leading to the appearance of aligned, antiparallel helix pairs. The simulations employ a computational approach that deals directly with the internal coordinates in a recursive manner; this representation is able to maintain constant bond lengths and angles without the necessity of treating them as an algebraic constraint problem supplementary to the equations of motion.Comment: 15 pages, 14 figure

    Semiclassical Mechanics of the Wigner 6j-Symbol

    Full text link
    The semiclassical mechanics of the Wigner 6j-symbol is examined from the standpoint of WKB theory for multidimensional, integrable systems, to explore the geometrical issues surrounding the Ponzano-Regge formula. The relations among the methods of Roberts and others for deriving the Ponzano-Regge formula are discussed, and a new approach, based on the recoupling of four angular momenta, is presented. A generalization of the Yutsis-type of spin network is developed for this purpose. Special attention is devoted to symplectic reduction, the reduced phase space of the 6j-symbol (the 2-sphere of Kapovich and Millson), and the reduction of Poisson bracket expressions for semiclassical amplitudes. General principles for the semiclassical study of arbitrary spin networks are laid down; some of these were used in our recent derivation of the asymptotic formula for the Wigner 9j-symbol.Comment: 64 pages, 50 figure

    Use of machine learning algorithms to classify binary protein sequences as highly-designable or poorly-designable

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>By using a standard Support Vector Machine (SVM) with a Sequential Minimal Optimization (SMO) method of training, Naïve Bayes and other machine learning algorithms we are able to distinguish between two classes of protein sequences: those folding to highly-designable conformations, or those folding to poorly- or non-designable conformations.</p> <p>Results</p> <p>First, we generate all possible compact lattice conformations for the specified shape (a hexagon or a triangle) on the 2D triangular lattice. Then we generate all possible binary hydrophobic/polar (H/P) sequences and by using a specified energy function, thread them through all of these compact conformations. If for a given sequence the lowest energy is obtained for a particular lattice conformation we assume that this sequence folds to that conformation. Highly-designable conformations have many H/P sequences folding to them, while poorly-designable conformations have few or no H/P sequences. We classify sequences as folding to either highly – or poorly-designable conformations. We have randomly selected subsets of the sequences belonging to highly-designable and poorly-designable conformations and used them to train several different standard machine learning algorithms.</p> <p>Conclusion</p> <p>By using these machine learning algorithms with ten-fold cross-validation we are able to classify the two classes of sequences with high accuracy – in some cases exceeding 95%.</p
    • 

    corecore