Search CORE

118 research outputs found

PAUL: Protein structural alignment using integer linear programming and Lagrangian relaxation

Author: A Caprara
Francisco S Domingues
G Mayr
Gunnar W Klau
IN Shindyalov
Inken Wohlers
J Jung
K Mizuguchi
L Holm
Lars Petzold
O Bachar
T Kawabata
Y Ye
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2009
Field of study

Crossref

CWI's Institutional Repository

Springer - Publisher Connector

PubMed Central

An optimized TOPS+ comparison method for enhanced TOPS models

Author: A Brazma
A Harrison
A Harrison
CA Orengo
CA Orengo
CA Orengo
CJ van Rijsbergen
D Gilbert
D Gilbert
D Westhead
David Gilbert
G Valiente
Gabriel Valiente
GJ Barton
GM Torrance
HM Berman
HM Grindley
I Koch
I Michalopoulos
IN Shindyalov
J Handl
J Viksna
K Mizuguchi
L Holm
LP Chew
M Veeramalai
M Veeramalai
M Veeramalai
Mallika Veeramalai
N Krasnogor
RB Russell
S Goldsmith-Fischman
SB Needleman
SS Krishna
T Madej
T Madej
TF Smith
VI Levenshtein
WR Taylor
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2010
Field of study

This article has been made available through the Brunel Open Access Publishing Fund.Background Although methods based on highly abstract descriptions of protein structures, such as VAST and TOPS, can perform very fast protein structure comparison, the results can lack a high degree of biological significance. Previously we have discussed the basic mechanisms of our novel method for structure comparison based on our TOPS+ model (Topological descriptions of Protein Structures Enhanced with Ligand Information). In this paper we show how these results can be significantly improved using parameter optimization, and we call the resulting optimised TOPS+ method as advanced TOPS+ comparison method i.e. advTOPS+. Results We have developed a TOPS+ string model as an improvement to the TOPS [1-3] graph model by considering loops as secondary structure elements (SSEs) in addition to helices and strands, representing ligands as first class objects, and describing interactions between SSEs, and SSEs and ligands, by incoming and outgoing arcs, annotating SSEs with the interaction direction and type. Benchmarking results of an all-against-all pairwise comparison using a large dataset of 2,620 non-redundant structures from the PDB40 dataset [4] demonstrate the biological significance, in terms of SCOP classification at the superfamily level, of our TOPS+ comparison method. Conclusions Our advanced TOPS+ comparison shows better performance on the PDB40 dataset [4] compared to our basic TOPS+ method, giving 90 percent accuracy for SCOP alpha+beta; a 6 percent increase in accuracy compared to the TOPS and basic TOPS+ methods. It also outperforms the TOPS, basic TOPS+ and SSAP comparison methods on the Chew-Kedem dataset [5], achieving 98 percent accuracy. Software Availability: The TOPS+ comparison server is available at http://balabio.dcs.gla.ac.uk/mallika/WebTOPS/.This article is available through the Brunel Open Access Publishing Fun

Crossref

LAReferencia - Red Federada de Repositorios Institucionales de Publicaciones Científicas Latinoamericanas

UPCommons. Portal del coneixement obert de la UPC

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

Brunel University Research Archive

Fr-TM-align: a new protein structural alignment method based on fragment alignments and the TM-score

Author: A Sali
AG Murzin
AM Lisewski
AR Ortiz
AS Yang
CA Orengo
CA Orengo
D Baker
D Kihara
F Teichert
G Vogt
G Vriend
HM Berman
IN Shindyalov
J Moult
J Skolnick
J Skolnick
J Zhu
Jeffrey Skolnick
L Holm
L Holm
M Levitt
M Novotny
ML Sierk
ML Sierk
MS Waterman
N Siew
NN Alexandrov
NN Alexandrov
P Koehl
R Kolodny
R Leplae
RB Russell
RH Lathrop
Shashi Bhushan Pandit
T Akutsu
T Shibuya
TJ Oldfield
V Alesker
WR Taylor
Y Ye
Y Zhang
Y Zhang
Y Zhang
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/12/2008
Field of study

©2008 Pandit and Skolnick; licensee BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. This article is available from: http://www.biomedcentral.com/1471-2105/9/531doi:10.1186/1471-2105-9-531Background: Protein tertiary structure comparisons are employed in various fields of contemporary structural biology. Most structure comparison methods involve generation of an initial seed alignment, which is extended and/or refined to provide the best structural superposition between a pair of protein structures as assessed by a structure comparison metric. One such metric, the TM-score, was recently introduced to provide a combined structure quality measure of the coordinate root mean square deviation between a pair of structures and coverage. Using the TM-score, the TM-align structure alignment algorithm was developed that was often found to have better accuracy and coverage than the most commonly used structural alignment programs; however, there were a number of situations when this was not true. Results: To further improve structure alignment quality, the Fr-TM-align algorithm has been developed where aligned fragment pairs are used to generate the initial seed alignments that are then refined using dynamic programming to maximize the TM-score. For the assessment of the structural alignment quality from Fr-TM-align in comparison to other programs such as CE and TMalign, we examined various alignment quality assessment scores such as PSI and TM-score. The assessment showed that the structural alignment quality from Fr-TM-align is better in comparison to both CE and TM-align. On average, the structural alignments generated using Fr-TM-align have a higher TM-score (~9%) and coverage (~7%) in comparison to those generated by TM-align. Fr- TM-align uses an exhaustive procedure to generate initial seed alignments. Hence, the algorithm is computationally more expensive than TM-align. Conclusion: Fr-TM-align, a new algorithm that employs fragment alignment and assembly provides better structural alignments in comparison to TM-align. The source code and executables of Fr- TM-align are freely downloadable at: http://cssb.biology.gatech.edu/skolnick/files/FrTMalign/

Scholarly Materials And Research @ Georgia Tech

Crossref

Directory of Open Access Journals

PubMed Central

Predicting residue contacts using pragmatic correlated mutations method: reducing the false positives

Author: A Valencia
AI Shulman
Emil G Alexov
F Pazos
G Wang
GM Suel
I Halperin
IN Shindyalov
JD Thompson Higgins D
M Punta
ME Hatley
MS Singer
P Fariselli
P Fariselli
Petras J Kundrotas
PL Martelli
R Schneider
S Lockless
U Gobel
W Li
WP Russ
Publication venue: BioMed Central
Publication date: 01/01/2006
Field of study

BACKGROUND: Predicting residues' contacts using primary amino acid sequence alone is an important task that can guide 3D structure modeling and can verify the quality of the predicted 3D structures. The correlated mutations (CM) method serves as the most promising approach and it has been used to predict amino acids pairs that are distant in the primary sequence but form contacts in the native 3D structure of homologous proteins. RESULTS: Here we report a new implementation of the CM method with an added set of selection rules (filters). The parameters of the algorithm were optimized against fifteen high resolution crystal structures with optimization criterion that maximized the confidentiality of the predictions. The optimization resulted in a true positive ratio (TPR) of 0.08 for the CM without filters and a TPR of 0.14 for the CM with filters. The protocol was further benchmarked against 65 high resolution structures that were not included in the optimization test. The benchmarking resulted in a TPR of 0.07 for the CM without filters and to a TPR of 0.09 for the CM with filters. CONCLUSION: Thus, the inclusion of selection rules resulted to an overall improvement of 30%. In addition, the pair-wise comparison of TPR for each protein without and with filters resulted in an average improvement of 1.7. The methodology was implemented into a web server that is freely available to the public. The purpose of this implementation is to provide the 3D structure predictors with a tool that can help with ranking alternative models by satisfying the largest number of predicted contacts, as well as it can provide a confidence score for contacts in cases where structure is known

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

Clemson University: TigerPrints

FLORA: a novel method to predict protein function from structure in diverse superfamilies

Predicting protein function from structure remains an active area of interest, particularly for the structural genomics initiatives where a substantial number of structures are initially solved with little or no functional characterisation. Although global structure comparison methods can be used to transfer functional annotations, the relationship between fold and function is complex, particularly in functionally diverse superfamilies that have evolved through different secondary structure embellishments to a common structural core. The majority of prediction algorithms employ local templates built on known or predicted functional residues. Here, we present a novel method (FLORA) that automatically generates structural motifs associated with different functional sub-families (FSGs) within functionally diverse domain superfamilies. Templates are created purely on the basis of their specificity for a given FSG, and the method makes no prior prediction of functional sites, nor assumes specific physico-chemical properties of residues. FLORA is able to accurately discriminate between homologous domains with different functions and substantially outperforms (a 2–3 fold increase in coverage at low error rates) popular structure comparison methods and a leading function prediction method. We benchmark FLORA on a large data set of enzyme superfamilies from all three major protein classes (α, β, αβ) and demonstrate the functional relevance of the motifs it identifies. We also provide novel predictions of enzymatic activity for a large number of structures solved by the Protein Structure Initiative. Overall, we show that FLORA is able to effectively detect functionally similar protein domain structures by purely using patterns of structural conservation of all residues

CiteSeerX

Public Library of Science (PLOS)

Crossref

Directory of Open Access Journals

UCL Discovery

PubMed Central

SECRET domain of variola virus CrmB protein can be a member of poxviral type II chemokine-binding proteins family

Author: A Alcami
A Alejo
A Carfi
AS Lalani
AS Lalani
BT Seet
BT Seet
CS Bond
Denis V Antonets
IF Charo
IN Shindyalov
JM Boomker
K Bryson
L Zhang
MB Ruiz-Arguello
MW Bahar
PL Arnold
Sergei N Shchelkunov
SN Shchelkunov
Tatyana S Nepomnyashchikh
W Rocchia
WL DeLano
Y Zhang
Y Zhang
Y Zhang
Publication venue: BioMed Central
Publication date: 01/01/2010
Field of study

Abstract Background Variola virus (VARV) the causative agent of smallpox, eradicated in 1980, have wide spectrum of immunomodulatory proteins to evade host immunity. Recently additional biological activity was discovered for VARV CrmB protein, known to bind and inhibit tumour necrosis factor (TNF) through its N-terminal domain homologous to cellular TNF receptors. Besides binding TNF, this protein was also shown to bind with high affinity several chemokines which recruit B- and T-lymphocytes and dendritic cells to sites of viral entry and replication. Ability to bind chemokines was shown to be associated with unique C-terminal domain of CrmB protein. This domain named SECRET (Smallpox virus-Encoded Chemokine Receptor) is unrelated to the host proteins and lacks significant homology with other known viral chemokine-binding proteins or any other known protein. Findings <it>De novo </it>modelling of VARV-CrmB SECRET domain spatial structure revealed its apparent structural homology with cowpox virus CC-chemokine binding protein (vCCI) and vaccinia virus A41 protein, despite low sequence identity between these three proteins. Potential ligand-binding surface of modelled VARV-CrmB SECRET domain was also predicted to bear prominent electronegative charge which is characteristic to known orthopoxviral chemokine-binding proteins. Conclusions Our results suggest that SECRET should be included into the family of poxviral type II chemokine-binding proteins and that it might have been evolved from the vCCI-like predecessor protein.</p

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

Building Science Gateways for Analysing Molecular Docking Results Using a Generic Framework and Methodology

Author: B Kramer
B Ludäscher
D Temelkovski
E Glaab
G Jones
G Terstyanszky
GM Morris
H Hasegawa
ID Kuntz
IN Shindyalov
J Krüger
JJ Irwin
K Wolstencroft
L Holm
M Jaghoori
P D'Ursi
P Kacsuk
P Kim
P Kunszt
S Forli
S Wang
SF Sousa
T Kiss
T Kiss
TA Wassenaar
WJ Allen
X Zhang
Z Farkas
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2020
Field of study

Molecular docking and virtual screening experiments require large computational and data resources and high-level user interfaces in the form of science gateways. While science gateways supporting such experiments are relatively common, there is a clearly identified need to design and implement more complex environments for further analysis of docking results. This paper describes a generic framework and a related methodology that supports the efficient development of such environments. The framework is modular enabling the reuse of already existing components. The methodology, which proposes three techniques that the development team can use, is agile and encourages active participation of end-users. Based on the framework and methodology, two prototype implementations of science-gateway-based docking environments are presented and evaluated. The first system recommends a receptor-ligand pair for the next docking experiment, and the second filters docking results based on ligand properties

Crossref

WestminsterResearch

A framework for protein structure classification and identification of novel protein structures

Author: AC Martin
AC Murzin
AJ Enright
AP Singh
C Cortes
CA Orengo
D Chivian
D Frishman
G Getz
HK Saini
IN Shindyalov
J Gough
J Hou
JE Gewehr
Jignesh M Patel
JM Chandonia
L Holm
L Holm
L Lo Conte
M Madera
N Beckmann
O Çamoglu
O Çamoglu
P Røgen
R Day
S Cheek
S Van Dongen
T Madej
You Jung Kim
Publication venue: BioMed Central
Publication date: 01/01/2006
Field of study

BACKGROUND: Protein structure classification plays a central role in understanding the function of a protein molecule with respect to all known proteins in a structure database. With the rapid increase in the number of new protein structures, the need for automated and accurate methods for protein classification is increasingly important. RESULTS: In this paper we present a unified framework for protein structure classification and identification of novel protein structures. The framework consists of a set of components for comparing, classifying, and clustering protein structures. These components allow us to accurately classify proteins into known folds, to detect new protein folds, and to provide a way of clustering the new folds. In our evaluation with SCOP 1.69, our method correctly classifies 86.0%, 87.7%, and 90.5% of new domains at family, superfamily, and fold levels. Furthermore, for protein domains that belong to new domain families, our method is able to produce clusters that closely correspond to the new families in SCOP 1.69. As a result, our method can also be used to suggest new classification groups that contain novel folds. CONCLUSION: We have developed a method called proCC for automatically classifying and clustering domains. The method is effective in classifying new domains and suggesting new domain families, and it is also very efficient. A web site offering access to proCC is freely available a

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

Combination of scoring schemes for protein docking

Author: B Huang
C Zhang
CM Deane
D Kozakov
Dietmar Schomburg
F Melo
G Moont
H Neuvirth
I Halperin
IN Shindyalov
J Mintseris
JE Dennis
KE Gottschalk
L Lo Conte
M Meyer
O Martin
O Zimmermann
P Aloy
P Caffrey
P Chakrabarti
P Heuser
Philipp Heuser
R Development Core Team
RB Schnabel Koontz J.
RM Jackson
S Jones
V Grimm
WS Valdar
Publication venue: BioMed Central
Publication date: 01/08/2007
Field of study

Abstract Background Docking algorithms are developed to predict in which orientation two proteins are likely to bind under natural conditions. The currently used methods usually consist of a sampling step followed by a scoring step. We developed a weighted geometric correlation based on optimised atom specific weighting factors and combined them with our previously published amino acid specific scoring and with a comprehensive SVM-based scoring function. Results The scoring with the atom specific weighting factors yields better results than the amino acid specific scoring. In combination with SVM-based scoring functions the percentage of complexes for which a near native structure can be predicted within the top 100 ranks increased from 14% with the geometric scoring to 54% with the combination of all scoring functions. Especially for the enzyme-inhibitor complexes the results of the ranking are excellent. For half of these complexes a near-native structure can be predicted within the first 10 proposed structures and for more than 86% of all enzyme-inhibitor complexes within the first 50 predicted structures. Conclusion We were able to develop a combination of different scoring schemes which considers a series of previously described and some new scoring criteria yielding a remarkable improvement of prediction quality.</p

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

CMASA: an accurate algorithm for detecting local protein structural similarity and its application to enzyme catalytic site annotation

Author: A Andreeva
A Stark
A Stark
BW Matthews
CJ Sigrist
CT Porter
E Krissinel
ED Scheeff
G Ausiello
GJ Kleywegt
Gong-Hua Li
H Ago
HM Berman
I Boltes
IN Shindyalov
JA Barker
JA Gerlt
JC Lagarias
Jing-Fei Huang
JS Fetrow
JW Torrance
K Kinoshita
L Holm
P Chen
PF Gherardini
RA Laskowski
RD Finn
RV Spriggs
S Schmitt
SF Altschul
SF Altschul
T Fawcett
T Madej
Publication venue: BioMed Central
Publication date: 01/01/2010
Field of study

Abstract Background The rapid development of structural genomics has resulted in many "unknown function" proteins being deposited in Protein Data Bank (PDB), thus, the functional prediction of these proteins has become a challenge for structural bioinformatics. Several sequence-based and structure-based methods have been developed to predict protein function, but these methods need to be improved further, such as, enhancing the accuracy, sensitivity, and the computational speed. Here, an accurate algorithm, the CMASA (Contact MAtrix based local Structural Alignment algorithm), has been developed to predict unknown functions of proteins based on the local protein structural similarity. This algorithm has been evaluated by building a test set including 164 enzyme families, and also been compared to other methods. Results The evaluation of CMASA shows that the CMASA is highly accurate (0.96), sensitive (0.86), and fast enough to be used in the large-scale functional annotation. Comparing to both sequence-based and global structure-based methods, not only the CMASA can find remote homologous proteins, but also can find the active site convergence. Comparing to other local structure comparison-based methods, the CMASA can obtain the better performance than both FFF (a method using geometry to predict protein function) and SPASM (a local structure alignment method); and the CMASA is more sensitive than PINTS and is more accurate than JESS (both are local structure alignment methods). The CMASA was applied to annotate the enzyme catalytic sites of the non-redundant PDB, and at least 166 putative catalytic sites have been suggested, these sites can not be observed by the Catalytic Site Atlas (CSA). Conclusions The CMASA is an accurate algorithm for detecting local protein structural similarity, and it holds several advantages in predicting enzyme active sites. The CMASA can be used in large-scale enzyme active site annotation. The CMASA can be available by the mail-based server (<url>http://159.226.149.45/other1/CMASA/CMASA.htm</url>).</p

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central