Search CORE

103 research outputs found

PocketMatch: A new algorithm to compare binding sites in protein structures

Author: A Argyrou
A Stark
AG Murzin
AT Laurie
B Huang
CA Orengo
F Glaser
G Dodson
G Ramachandraiah
GJ Kleywegt
HM Berman
JA Barker
JH Choi
Kalidas Yeturu
L Holm
M Jambon
Nagasuma Chandra
ND Gold
ND Gold
R Russell
R Wang
RA Laskowski
RJ Morris
S Schmitt
T Prasad
Y Kalidas
Publication venue
Publication date: 01/01/2008
Field of study

Background: Recognizing similarities and deriving relationships among protein molecules is a fundamental
requirement in present-day biology. Similarities can be present at various levels which can be detected through comparison of protein sequences or their structural folds. In some cases similarities obscure at these levels could be present merely in the substructures at their binding sites. Inferring functional similarities between protein molecules by comparing their binding sites is still largely exploratory and not as yet a routine protocol. One of
the main reasons for this is the limitation in the choice of appropriate analytical tools that can compare binding sites with high sensitivity. To benefit from the enormous amount of structural data that is being rapidly accumulated, it is essential to have high throughput tools that enable large scale binding site comparison.

Results: Here we present a new algorithm PocketMatch for comparison of binding sites in a frame invariant
manner. Each binding site is represented by 90 lists of sorted distances capturing shape and chemical nature of the site. The sorted arrays are then aligned using an incremental alignment method and scored to obtain PMScores for pairs of sites. A comprehensive sensitivity analysis and an extensive validation of the algorithm have been carried out. Perturbation studies where the geometry of a given site was retained but the residue types were changed randomly, indicated that chance similarities were virtually non-existent. Our analysis also demonstrates that shape information alone is insufficient to discriminate between diverse binding sites, unless
combined with chemical nature of amino acids.

Conclusions: A new algorithm has been developed to compare binding sites in accurate, efficient and
high-throughput manner. Though the representation used is conceptually simplistic, we demonstrate that along
with the new alignment strategy used, it is sufficient to enable binding comparison with high sensitivity. Novel methodology has also been presented for validating the algorithm for accuracy and sensitivity with respect to geometry and chemical nature of the site. The method is also fast and takes about 1/250th second for one comparison on a single processor. A parallel version on BlueGene has also been implemented

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

Nature Precedings

PocketMatch: A new algorithm to compare binding sites in protein structures

Author: Yeturu Kalidas
Nagasuma Chandra
Publication venue
Publication date: 01/01/2008
Field of study

Crossref

Nature Precedings

LoopIng: A template-based tool for predicting the structure of protein loops

Author: ABDEL MESSIH MARIO ALFY FAHMY
Lepore Rosalba
Tramontano Anna
Publication venue: 'Oxford University Press (OUP)'
Publication date: 01/01/2015
Field of study

MOTIVATION: Predicting the structure of protein loops is very challenging, mainly because they are not necessarily subject to strong evolutionary pressure. This implies that, unlike the rest of the protein, standard homology modeling techniques are not very effective in modeling their structure. However, loops are often involved in protein function, hence inferring their structure is important for predicting protein structure as well as function. RESULTS: We describe a method, LoopIng, based on the Random Forest automated learning technique, which, given a target loop, selects a structural template for it from a database of loop candidates. Compared to the most recently available methods, LoopIng is able to achieve similar accuracy for short loops (4-10 residues) and significant enhancements for long loops (11-20 residues). The quality of the predictions is robust to errors that unavoidably affect the stem regions when these are modeled. The method returns a confidence score for the predicted template loops and has the advantage of being very fast (on average: 1 min/loop)

PubMed Central

Archivio della ricerca- Università di Roma La Sapienza

Annotating Protein Functional Residues by Coupling High-Throughput Fitness Profile and Homologous-Structure Analysis.

Author: Du Yushen
Gong Danyang
Jiang Lin
Shu Sara
Sun Ren
Wu Nicholas C
Wu Ting-Ting
Zhang Tianhao
Publication venue: eScholarship, University of California
Publication date: 01/11/2016
Field of study

Identification and annotation of functional residues are fundamental questions in protein sequence analysis. Sequence and structure conservation provides valuable information to tackle these questions. It is, however, limited by the incomplete sampling of sequence space in natural evolution. Moreover, proteins often have multiple functions, with overlapping sequences that present challenges to accurate annotation of the exact functions of individual residues by conservation-based methods. Using the influenza A virus PB1 protein as an example, we developed a method to systematically identify and annotate functional residues. We used saturation mutagenesis and high-throughput sequencing to measure the replication capacity of single nucleotide mutations across the entire PB1 protein. After predicting protein stability upon mutations, we identified functional PB1 residues that are essential for viral replication. To further annotate the functional residues important to the canonical or noncanonical functions of viral RNA-dependent RNA polymerase (vRdRp), we performed a homologous-structure analysis with 16 different vRdRp structures. We achieved high sensitivity in annotating the known canonical polymerase functional residues. Moreover, we identified a cluster of noncanonical functional residues located in the loop region of the PB1 β-ribbon. We further demonstrated that these residues were important for PB1 protein nuclear import through the interaction with Ran-binding protein 5. In summary, we developed a systematic and sensitive method to identify and annotate functional residues that are not restrained by sequence conservation. Importantly, this method is generally applicable to other proteins about which homologous-structure information is available.ImportanceTo fully comprehend the diverse functions of a protein, it is essential to understand the functionality of individual residues. Current methods are highly dependent on evolutionary sequence conservation, which is usually limited by sampling size. Sequence conservation-based methods are further confounded by structural constraints and multifunctionality of proteins. Here we present a method that can systematically identify and annotate functional residues of a given protein. We used a high-throughput functional profiling platform to identify essential residues. Coupling it with homologous-structure comparison, we were able to annotate multiple functions of proteins. We demonstrated the method with the PB1 protein of influenza A virus and identified novel functional residues in addition to its canonical function as an RNA-dependent RNA polymerase. Not limited to virology, this method is generally applicable to other proteins that can be functionally selected and about which homologous-structure information is available

Directory of Open Access Journals

PubMed Central

eScholarship - University of California

Recommended from our members

Using orthologous and paralogous proteins to identify specificity determining residues

Author: Gelfand Mikhail S
Mirny Leonid A
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 07/07/2014
Field of study

Background: Concepts of orthology and paralogy are become increasingly important as whole-genome comparison allows their identification in complete genomes. Functional specificity of proteins is assumed to be conserved among orthologs and is different among paralogs. We used this assumption to identify residues which determine specificity of protein-DNA and protein-ligand recognition. Finding such residues is crucial for understanding mechanisms of molecular recognition and for rational protein and drug design. Results: Assuming conservation of specificity among orthologs and different specificity of paralogs, we identify residues which correlate with this grouping by specificity. The method is taking advantage of complete genomes to find multiple orthologs and paralogs. The central part of this method is a procedure to compute statistical significance of the predictions. The procedure is based on a simple statistical model of protein evolution. When applied to a large family of bacterial transcription factors, our method identified 12 residues that are presumed to determine the protein-DNA and protein-ligand recognition specificity. Structural analysis of the proteins and available experimental results strongly support our predictions. Our results suggest new experiments aimed at rational re-design of specificity in bacterial transcription factors by a minimal number of mutations. Conclusions: While sets of orthologous and paralogous proteins can be easily derived from complete genomic sequences, our method can identify putative specificity determinants in such proteins

Harvard University - DASH

Structural Repertoire of HIV-1-Neutralizing Antibodies Targeting the CD4 Supersite in 14 Donors

Author: Adams
Aliaksandr Druz
An-Suei Yang
Anqi Zheng
Anthony P. West
Antonio Lanzavecchia
Baoshan Zhang
Barton F. Haynes
Binley
Braden
Burton
Calarese
Cardoso
Chen
Cinque Soto
Corti
Daniel Lingwood
Davide Corti
Dennis R. Burton
Desmyter
Diskin
Diskin
Doria-Rose
Emsley
Georgiev
Hoxie
Hraber
Huang
Hung-Pin Peng
Ivelin S. Georgiev
James C. Mullikin
Jardine
Johannes F. Scheid
John R. Mascola
Joseph P. Casazza
Julien
Klein
Kong
Krisha McKee
Kwon
Kwong
Kwong
Lawrence Shapiro
Lei Chen
Leonidas Stamatatos
Li
Liao
Lillian Tran
Lynch
Lyumkis
M. Gordon Joyce
Marie Pancera
Mark Connors
Mark K. Louder
Matthew D. Gray
McClure
McCoy
McGuire
Michael J. Ernandes
Michel C. Nussenzweig
Montefiori
Mouquet
Myron S. Cohen
Nancy S. Longo
Nicole A. Doria-Rose
North
Ofek
Padlan
Pamela J. Bjorkman
Pancera
Pejchal
Peter D. Kwong
Petrey
Priyamvada Acharya
Quentin J. Sattentau
Rebecca M. Lynch
Robert T. Bailer
Robin A. Weiss
Rudicell
Rui Kong
Ryu
Sanjay Srivatsan
Scheid
Scheid
Sijy O’Dell
Stephanie Moquin
Stephen D. Schmidt
Tatsiana Kirys
Tatyana Gindin
Tiller
Timothy S. Luongo
Tongqing Zhou
Walker
Wang
Wayne C. Koff
West
West
Wu
Wu
Wu
Xueling Wu
Yongping Yang
Zhongjia Yang
Zhou
Zhou
Zhou
Zhu
Zhu
Publication venue
Publication date: 01/01/2015
Field of study

The site on the HIV-1 gp120 glycoprotein that binds the CD4 receptor is recognized by broadly reactive antibodies, several of which neutralize over 90% of HIV-1 strains. To understand how antibodies achieve such neutralization, we isolated CD4-binding-site (CD4bs) antibodies and analyzed 16 co-crystal structures –8 determined here– of CD4bs antibodies from 14 donors. The 16 antibodies segregated by recognition mode and developmental ontogeny into two types: CDR H3-dominated and VH-gene-restricted. Both could achieve greater than 80% neutralization breadth, and both could develop in the same donor. Although paratope chemistries differed, all 16 gp120-CD4bs antibody complexes showed geometric similarity, with antibody-neutralization breadth correlating with antibody-angle of approach relative to the most effective antibody of each type. The repertoire for effective recognition of the CD4 supersite thus comprises antibodies with distinct paratopes arrayed about two optimal geometric orientations, one achieved by CDR H3 ontogenies and the other achieved by VH-gene-restricted ontogenies

Elsevier - Publisher Connector

Crossref

PubMed Central

Carolina Digital Repository

SARS-CoV-2 and Its Variants: The Pandemic of Unvaccinated

Author: Angius Fabrizio
Manzin Aldo
Pala Giuseppe
Publication venue: 'Frontiers Media SA'
Publication date: 01/01/2021
Field of study

Directory of Open Access Journals

Archivio istituzionale della ricerca - Università di Cagliari

Including Functional Annotations and Extending the Collection of Structural Classifications of Protein Loops (ArchDB)

Author: Aviles Francesc X.
Enrique Querol E
Espadaler Jordi
Fernandez-Fuentes Narcis
Hermoso Antoni
Oliva Baldomero
Sternberg Michael J.E.
Publication venue: Libertas Academica
Publication date: 24/11/2009
Field of study

Loops represent an important part of protein structures. The study of loop is critical for two main reasons: First, loops are often involved in protein function, stability and folding. Second, despite improvements in experimental and computational structure prediction methods, modeling the conformation of loops remains problematic. Here, we present a structural classification of loops, ArchDB, a mine of information with application in both mentioned fields: loop structure prediction and function prediction. ArchDB (http://sbi.imim.es/archdb) is a database of classified protein loop motifs. The current database provides four different classification sets tailored for different purposes. ArchDB-40, a loop classification derived from SCOP40, well suited for modeling common loop motifs. Since features relevant to loop structure or function can be more easily determined on well-populated clusters, we have developed ArchDB-95, a loop classification derived from SCOP95. This new classification set shows a ~40% increase in the number of subclasses, and a large 7-fold increase in the number of putative structure/function-related subclasses. We also present ArchDB-EC, a classification of loop motifs from enzymes, and ArchDB-KI, a manually annotated classification of loop motifs from kinases. Information about ligand contacts and PDB sites has been included in all classification sets. Improvements in our classification scheme are described, as well as several new database features, such as the ability to query by conserved annotations, sequence similarity, or uploading 3D coordinates of a protein. The lengths of classified loops range between 0 and 36 residues long. ArchDB offers an exhaustive sampling of loop structures. Functional information about loops and links with related biological databases are also provided. All this information and the possibility to browse/query the database through a web-server outline an useful tool with application in the comparative study of loops, the analysis of loops involved in protein function and to obtain templates for loop modeling

PubMed Central

Spiral - Imperial College Digital Repository

Protein surface representation and analysis by dimension reduction

Author: Qureshi Rehman
Sacan Ahmet
Yang Heng
Publication venue: BioMed Central
Publication date: 01/01/2012
Field of study

Abstract Background Protein structures are better conserved than protein sequences, and consequently more functional information is available in structures than in sequences. However, proteins generally interact with other proteins and molecules via their surface regions and a backbone-only analysis of protein structures may miss many of the functional and evolutionary features. Surface information can help better elucidate proteins' functions and their interactions with other proteins. Computational analysis and comparison of protein surfaces is an important challenge to overcome to enable efficient and accurate functional characterization of proteins. Methods In this study we present a new method for representation and comparison of protein surface features. Our method is based on mapping the 3-D protein surfaces onto 2-D maps using various dimension reduction methods. We have proposed area and neighbor based metrics in order to evaluate the accuracy of this surface representation. In order to capture functionally relevant information, we encode geometric and biochemical features of the protein, such as hydrophobicity, electrostatic potential, and curvature, into separate color channels in the 2-D map. The resulting images can then be compared using efficient 2-D image registration methods to identify surface regions and features shared by proteins. Results We demonstrate the utility of our method and characterize its performance using both synthetic and real data. Among the dimension reduction methods investigated, SNE, LandmarkIsomap, Isomap, and Sammon's mapping provide the best performance in preserving the area and neighborhood properties of the original 3-D surface. The enriched 2-D representation is shown to be useful in characterizing the functional site of chymotrypsin and able to detect structural similarities in heat shock proteins. A texture mapping using the 2-D representation is also proposed as an interesting application to structure visualization.</p

Crossref

Directory of Open Access Journals

PubMed Central