343,917 research outputs found

    Protein secondary structure: Entropy, correlations and prediction

    Get PDF
    Is protein secondary structure primarily determined by local interactions between residues closely spaced along the amino acid backbone, or by non-local tertiary interactions? To answer this question we have measured the entropy densities of primary structure and secondary structure sequences, and the local inter-sequence mutual information density. We find that the important inter-sequence interactions are short ranged, that correlations between neighboring amino acids are essentially uninformative, and that only 1/4 of the total information needed to determine the secondary structure is available from local inter-sequence correlations. Since the remaining information must come from non-local interactions, this observation supports the view that the majority of most proteins fold via a cooperative process where secondary and tertiary structure form concurrently. To provide a more direct comparison to existing secondary structure prediction methods, we construct a simple hidden Markov model (HMM) of the sequences. This HMM achieves a prediction accuracy comparable to other single sequence secondary structure prediction algorithms, and can extract almost all of the inter-sequence mutual information. This suggests that these algorithms are almost optimal, and that we should not expect a dramatic improvement in prediction accuracy. However, local correlations between secondary and primary structure are probably of under-appreciated importance in many tertiary structure prediction methods, such as threading.Comment: 8 pages, 5 figure

    Representing and comparing protein structures as paths in three-dimensional space

    Get PDF
    BACKGROUND: Most existing formulations of protein structure comparison are based on detailed atomic level descriptions of protein structures and bypass potential insights that arise from a higher-level abstraction. RESULTS: We propose a structure comparison approach based on a simplified representation of proteins that describes its three-dimensional path by local curvature along the generalized backbone of the polypeptide. We have implemented a dynamic programming procedure that aligns curvatures of proteins by optimizing a defined sum turning angle deviation measure. CONCLUSION: Although our procedure does not directly optimize global structural similarity as measured by RMSD, our benchmarking results indicate that it can surprisingly well recover the structural similarity defined by structure classification databases and traditional structure alignment programs. In addition, our program can recognize similarities between structures with extensive conformation changes that are beyond the ability of traditional structure alignment programs. We demonstrate the applications of procedure to several contexts of structure comparison. An implementation of our procedure, CURVE, is available as a public webserver

    AS2TS system for protein structure modeling and analysis

    Get PDF
    We present a set of programs and a website designed to facilitate protein structure comparison and protein structure modeling efforts. Our protein structure analysis and comparison services use the LGA (local-global alignment) program to search for regions of local similarity and to evaluate the level of structural similarity between compared protein structures. To facilitate the homology-based protein structure modeling process, our AL2TS service translates given sequence–structure alignment data into the standard Protein Data Bank (PDB) atom records (coordinates). For a given sequence of amino acids, the AS2TS (amino acid sequence to tertiary structure) system calculates (e.g. using PSI-BLAST PDB analysis) a list of the closest proteins from the PDB, and then a set of draft 3D models is automatically created. Web services are available at

    Binding Ligand Prediction for Proteins Using Partial Matching of Local Surface Patches

    Get PDF
    Functional elucidation of uncharacterized protein structures is an important task in bioinformatics. We report our new approach for structure-based function prediction which captures local surface features of ligand binding pockets. Function of proteins, specifically, binding ligands of proteins, can be predicted by finding similar local surface regions of known proteins. To enable partial comparison of binding sites in proteins, a weighted bipartite matching algorithm is used to match pairs of surface patches. The surface patches are encoded with the 3D Zernike descriptors. Unlike the existing methods which compare global characteristics of the protein fold or the global pocket shape, the local surface patch method can find functional similarity between non-homologous proteins and binding pockets for flexible ligand molecules. The proposed method improves prediction results over global pocket shape-based method which was previously developed by our group

    Two-Dimensional Infrared Spectroscopy of Antiparallel β-Sheet Secondary Structure

    Get PDF
    We investigate the sensitivity of femtosecond Fourier transform two-dimensional infrared spectroscopy to protein secondary structure with a study of antiparallel β-sheets. The results show that 2D IR spectroscopy is more sensitive to structural differences between proteins than traditional infrared spectroscopy, providing an observable that allows comparison to quantitative models of protein vibrational spectroscopy. 2D IR correlation spectra of the amide I region of poly-L-lysine, concanavalin A, ribonuclease A, and lysozyme show cross-peaks between the IR-active transitions that are characteristic of amide I couplings for polypeptides in antiparallel hydrogen-bonding registry. For poly-L-lysine, the 2D IR spectrum contains the eight-peak structure expected for two dominant vibrations of an extended, ordered antiparallel β-sheet. In the proteins with antiparallel β-sheets, interference effects between the diagonal and cross-peaks arising from the sheets, combined with diagonally elongated resonances from additional amide transitions, lead to a characteristic “Z”-shaped pattern for the amide I region in the 2D IR spectrum. We discuss in detail how the number of strands in the sheet, the local configurational disorder in the sheet, the delocalization of the vibrational excitation, and the angle between transition dipole moments affect the position, splitting, amplitude, and line shape of the cross-peaks and diagonal peaks.

    Hydrogen bond rotations as a uniform structural tool for analyzing protein architecture

    Get PDF
    Proteins fold into three-dimensional structures, which determine their diverse functions. The conformation of the backbone of each structure is locally at each Cα effectively described by conformational angles resulting in Ramachandran plots. These, however, do not describe the conformations around hydrogen bonds, which can be non-local along the backbone and are of major importance for protein structure. Here, we introduce the spatial rotation between hydrogen bonded peptide planes as a new descriptor for protein structure locally around a hydrogen bond. Strikingly, this rotational descriptor sampled over high-quality structures from the protein data base (PDB) concentrates into 30 localized clusters, some of which correlate to the common secondary structures and others to more special motifs, yet generally providing a unifying systematic classification of local structure around protein hydrogen bonds. It further provides a uniform vocabulary for comparison of protein structure near hydrogen bonds even between bonds in different proteins without alignment

    Integrated web service for improving alignment quality based on segments comparison

    Get PDF
    BACKGROUND: Defining blocks forming the global protein structure on the basis of local structural regularity is a very fruitful idea, extensively used in description, and prediction of structure from only sequence information. Over many years the secondary structure elements were used as available building blocks with great success. Specially prepared sets of possible structural motifs can be used to describe similarity between very distant, non-homologous proteins. The reason for utilizing the structural information in the description of proteins is straightforward. Structural comparison is able to detect approximately twice as many distant relationships as sequence comparison at the same error rate. RESULTS: Here we provide a new fragment library for Local Structure Segment (LSS) prediction called FRAGlib which is integrated with a previously described segment alignment algorithm SEA. A joined FRAGlib/SEA server provides easy access to both algorithms, allowing a one stop alignment service using a novel approach to protein sequence alignment based on a network matching approach. The FRAGlib used as secondary structure prediction achieves only 73% accuracy in Q3 measure, but when combined with the SEA alignment, it achieves a significant improvement in pairwise sequence alignment quality, as compared to previous SEA implementation and other public alignment algorithms. The FRAGlib algorithm takes ~2 min. to search over FRAGlib database for a typical query protein with 500 residues. The SEA service align two typical proteins within circa ~5 min. All supplementary materials (detailed results of all the benchmarks, the list of test proteins and the whole fragments library) are available for download on-line at . CONCLUSIONS: The joined FRAGlib/SEA server will be a valuable tool both for molecular biologists working on protein sequence analysis and for bioinformaticians developing computational methods of structure prediction and alignment of proteins

    ProteinDBS v2.0: a web server for global and local protein structure search

    Get PDF
    ProteinDBS v2.0 is a web server designed for efficient and accurate comparisons and searches of structurally similar proteins from a large-scale database. It provides two comparison methods, global-to-global and local-to-local, to facilitate the searches of protein structures or substructures. ProteinDBS v2.0 applies advanced feature extraction algorithms and scalable indexing techniques to achieve a high-running speed while preserving reasonably high precision of structural comparison. The experimental results show that our system is able to return results of global comparisons in seconds from a complete Protein Data Bank (PDB) database of 152 959 protein chains and that it takes much less time to complete local comparisons from a non-redundant database of 3276 proteins than other accurate comparison methods. ProteinDBS v2.0 supports query by PDB protein ID and by new structures uploaded by users. To our knowledge, this is the only search engine that can simultaneously support global and local comparisons. ProteinDBS v2.0 is a useful tool to investigate functional or evolutional relationships among proteins. Moreover, the common substructures identified by local comparison can be potentially used to assist the human curation process in discovering new domains or folds from the ever-growing protein structure databases. The system is hosted at http://ProteinDBS.rnet.missouri.edu

    Modeling and predicting all-α transmembrane proteins including helix–helix pairing

    Get PDF
    AbstractModeling and predicting the structure of proteins is one of the most important challenges of computational biology. Exact physical models are too complex to provide feasible prediction tools and other ab initio methods only use local and probabilistic information to fold a given sequence. We show in this paper that all-α transmembrane protein secondary and super-secondary structures can be modeled with a multi-tape S-attributed grammar. An efficient structure prediction algorithm using both local and global constraints is designed and evaluated. Comparison with existing methods shows that the prediction rates as well as the definition level are sensibly increased. Furthermore this approach can be generalized to more complex proteins
    corecore