Search CORE

MODBASE, a database of annotated comparative protein structure models and associated resources.

Author: Barkan David T
Carter Hannah
Davis Fred P
Eramian David
Eswar Narayanan
Karchin Rachel
Kelly Libusha
Mankoo Parminder
Marti-Renom Marc A
Pieper Ursula
Sali Andrej
Webb Ben M
Publication venue: eScholarship, University of California
Publication date: 23/10/2008
Field of study

MODBASE (http://salilab.org/modbase) is a database of annotated comparative protein structure models. The models are calculated by MODPIPE, an automated modeling pipeline that relies primarily on MODELLER for fold assignment, sequence-structure alignment, model building and model assessment (http:/salilab.org/modeller). MODBASE currently contains 5,152,695 reliable models for domains in 1,593,209 unique protein sequences; only models based on statistically significant alignments and/or models assessed to have the correct fold are included. MODBASE also allows users to calculate comparative models on demand, through an interface to the MODWEB modeling server (http://salilab.org/modweb). Other resources integrated with MODBASE include databases of multiple protein structure alignments (DBAli), structurally defined ligand binding sites (LIGBASE), predicted ligand binding sites (AnnoLyze), structurally defined binary domain interfaces (PIBASE) and annotated single nucleotide polymorphisms and somatic mutations found in human proteins (LS-SNP, LS-Mut). MODBASE models are also available through the Protein Model Portal (http://www.proteinmodelportal.org/)

B2G-FAR, a species-centered GO annotation repository

Author: Al-Shahrour
Al-Shahrour
Al-Shahrour
Altschul
Ana Conesa
Arnold
Ashburner
Barrell
Camon
Conesa
Conesa
Espinoza
Götz
Holt
Huerta-Cepas
Joaquín Dopazo
Kersey
Marc-André Jehl
Marti-Renom
Myhre
Patricia Sebastián-León
Patrick Tischler
Quevillon
Rattei
Riley
Roland Arnold
Samuel Martín-Rodríguez
Sjölander
Stefan Götz
The Gene Ontology Consortium
The Uniprot Consortium
Thomas Rattei
Wise
Publication venue: Oxford University Press
Publication date: 01/04/2011
Field of study

Motivation: Functional genomics research has expanded enormously in the last decade thanks to the cost reduction in high-throughput technologies and the development of computational tools that generate, standardize and share information on gene and protein function such as the Gene Ontology (GO). Nevertheless, many biologists, especially working with non-model organisms, still suffer from non-existing or low-coverage functional annotation, or simply struggle retrieving, summarizing and querying these data

University of Birmingham Research Portal

Public Library of Science (PLOS)

FLORA: a novel method to predict protein function from structure in diverse superfamilies

Predicting protein function from structure remains an active area of interest, particularly for the structural genomics initiatives where a substantial number of structures are initially solved with little or no functional characterisation. Although global structure comparison methods can be used to transfer functional annotations, the relationship between fold and function is complex, particularly in functionally diverse superfamilies that have evolved through different secondary structure embellishments to a common structural core. The majority of prediction algorithms employ local templates built on known or predicted functional residues. Here, we present a novel method (FLORA) that automatically generates structural motifs associated with different functional sub-families (FSGs) within functionally diverse domain superfamilies. Templates are created purely on the basis of their specificity for a given FSG, and the method makes no prior prediction of functional sites, nor assumes specific physico-chemical properties of residues. FLORA is able to accurately discriminate between homologous domains with different functions and substantially outperforms (a 2–3 fold increase in coverage at low error rates) popular structure comparison methods and a leading function prediction method. We benchmark FLORA on a large data set of enzyme superfamilies from all three major protein classes (α, β, αβ) and demonstrate the functional relevance of the motifs it identifies. We also provide novel predictions of enzymatic activity for a large number of structures solved by the Protein Structure Initiative. Overall, we show that FLORA is able to effectively detect functionally similar protein domain structures by purely using patterns of structural conservation of all residues

CiteSeerX

UCL Discovery

Public Library of Science (PLOS)

A Kernel for Open Source Drug Discovery in Tropical Diseases

Open source drug discovery, a promising alternative avenue to conventional patent-based drug development, has so far remained elusive with few exceptions. A major stumbling block has been the absence of a critical mass of preexisting work that volunteers can improve through a series of granular contributions. This paper introduces the results from a newly assembled computational pipeline for identifying protein targets for drug discovery in ten organisms that cause tropical diseases. We have also experimentally tested two promising targets for their binding to commercially available drugs, validating one and invalidating the other. The resulting kernel provides a base of drug targets and lead candidates around which an open source community can nucleate. We invite readers to donate their judgment and in silico and in vitro experiments to develop these targets to the point where drug optimization can begin

Public Library of Science (PLOS)

The Overlap of Small Molecule and Protein Binding Sites within Families of Protein Structures

Author: A Leo-Macias
A Sali
A Shulman-Peleg
AA Bogan
AC Stuart
AG Murzin
AL Brass
AM Sanchez
Andrej Sali
AP Higueruelo
B de Chassey
B Ma
B Qian
BR Howard
CD Thanos
CL Drum
D Datta
D Dimitropoulos
D Wilson
DA Erlanson
DR Caffrey
E Sokolskaja
FP Davis
FP Davis
Fred P. Davis
GJ Kleywegt
GR Crabtree
H Zhu
J Kuriyan
JA Wells
JJ Ellis
JM Chandonia
KS Thorn
L Parthasarathi
LL Conte
M Wurtele
MA Marti-Renom
MD Dyer
MR Arkin
MR Arkin
O Keskin
Philip E. Bourne
R Elber
R Sedrani
RA Laskowski
RP Bhattacharyya
S Eyrisch
S Jones
SJ Projan
SL Lebeis
SR Collins
T Berg
T Clackson
T Kortemme
TD Bunney
X Wang
Y Ofran
Publication venue: Public Library of Science
Publication date: 01/02/2010
Field of study

Protein–protein interactions are challenging targets for modulation by small molecules. Here, we propose an approach that harnesses the increasing structural coverage of protein complexes to identify small molecules that may target protein interactions. Specifically, we identify ligand and protein binding sites that overlap upon alignment of homologous proteins. Of the 2,619 protein structure families observed to bind proteins, 1,028 also bind small molecules (250–1000 Da), and 197 exhibit a statistically significant (p<0.01) overlap between ligand and protein binding positions. These “bi-functional positions”, which bind both ligands and proteins, are particularly enriched in tyrosine and tryptophan residues, similar to “energetic hotspots” described previously, and are significantly less conserved than mono-functional and solvent exposed positions. Homology transfer identifies ligands whose binding sites overlap at least 20% of the protein interface for 35% of domain–domain and 45% of domain–peptide mediated interactions. The analysis recovered known small-molecule modulators of protein interactions as well as predicted new interaction targets based on the sequence similarity of ligand binding sites. We illustrate the predictive utility of the method by suggesting structural mechanisms for the effects of sanglifehrin A on HIV virion production, bepridil on the cellular entry of anthrax edema factor, and fusicoccin on vertebrate developmental pathways. The results, available at http://pibase.janelia.org, represent a comprehensive collection of structurally characterized modulators of protein interactions, and suggest that homologous structures are a useful resource for the rational design of interaction modulators

LAReferencia - Red Federada de Repositorios Institucionales de Publicaciones Científicas Latinoamericanas

Assessing the structural conservation of protein pockets to study functional and allosteric sites: implications for drug discovery

Author: Daura Xavier
Panjkovich Alejandro
Publication venue: BioMed Central
Publication date: 01/01/2010
Field of study

Background: With the classical, active-site oriented drug-development approach reaching its limits, protein ligand-binding sites in general and allosteric sites in particular are increasingly attracting the interest of medicinal chemists in the search for new types of targets and strategies to drug development. Given that allostery represents one of the most common and powerful means to regulate protein function, the traditional drug discovery approach of targeting active sites can be extended by targeting allosteric or regulatory protein pockets that may allow the discovery of not only novel drug-like inhibitors, but activators as well. The wealth of available protein structural data can be exploited to further increase our understanding of allosterism, which in turn may have therapeutic applications. A first step in this direction is to identify and characterize putative effector sites that may be present in already available structural data. Results: We performed a large-scale study of protein cavities as potential allosteric and functional sites, by integrating publicly available information on protein sequences, structures and active sites for more than a thousand protein families. By identifying common pockets across different structures of the same protein family we developed a method to measure the pocket's structural conservation. The method was first parameterized using known active sites. We characterized the predicted pockets in terms of sequence and structural conservation, backbone flexibility and electrostatic potential. Although these different measures do not tend to correlate, their combination is useful in selecting functional and regulatory sites, as a detailed analysis of a handful of protein families shows. We finally estimated the numbers of potential allosteric or regulatory pockets that may be present in the data set, finding that pockets with putative functional and effector characteristics are widespread across protein families. Conclusions: Our results show that structurally conserved pockets are a common feature of protein families. The structural conservation of protein pockets, combined with other characteristics, can be exploited in drug discovery procedures, in particular for the selection of the most appropriate target protein and pocket for the design of drugs against entire protein families or subfamilies (e.g. for the development of broad-spectrum antimicrobials) or against a specific protein (e.g. in attempting to reduce side effects)

Springer - Publisher Connector

Diposit Digital de Documents de la UAB

ModBase, a database of annotated comparative protein structure models, and associated resources

Author: A. Sali
A. Schlessinger
B. M. Webb
Bairoch
Baker
Biswas
Biswas
Boeckmann
C. C. Huang
D. Schneidman-Duhovny
D. T. Barkan
Davis
Dutta
E. C. Meng
E. F. Pettersen
Eswar
Eswar
Fiser
Giglia
H. Braberg
Hediger
Hewett
Hillisch
Hura
K. Sjolander
Kaneko
Klein
Kondo
Kroetz
M. S. Madhusudhan
Marti-Renom
Marti-Renom
Melo
Meng
Mochalkin
Orti
P. Sampathkumar
Pardob
Petoukhov
Pettersen
Pieper
Pieper
Putnam
R. S. Datta
S. K. Burley
Sali
Schlessinger
Schomburg
Schwede
Shen
Smith
Suerbaum
T. E. Ferrin
U. Pieper
Wallner
Wu
Z. Yang
Zhu
Zmasek
Publication venue: Oxford University Press
Publication date: 01/01/2011
Field of study

ModBase (http://salilab.org/modbase) is a database of annotated comparative protein structure models. The models are calculated by ModPipe, an automated modeling pipeline that relies primarily on Modeller for fold assignment, sequence–structure alignment, model building and model assessment (http://salilab.org/modeller/). ModBase currently contains 10 355 444 reliable models for domains in 2 421 920 unique protein sequences. ModBase allows users to update comparative models on demand, and request modeling of additional sequences through an interface to the ModWeb modeling server (http://salilab.org/modweb). ModBase models are available through the ModBase interface as well as the Protein Model Portal (http://www.proteinmodelportal.org/). Recently developed associated resources include the SALIGN server for multiple sequence and structure alignment (http://salilab.org/salign), the ModEval server for predicting the accuracy of protein structure models (http://salilab.org/modeval), the PCSS server for predicting which peptides bind to a given protein (http://salilab.org/pcss) and the FoXS server for calculating and fitting Small Angle X-ray Scattering profiles (http://salilab.org/foxs)

Knowledge-based annotation of small molecule binding sites in proteins

Author: Bryant Stephen H
Madej Thomas
Panchenko Anna R
Shoemaker Benjamin A
Thangudu Ratna R
Tyagi Manoj
Publication venue: BioMed Central
Publication date: 01/01/2010
Field of study

Abstract Background The study of protein-small molecule interactions is vital for understanding protein function and for practical applications in drug discovery. To benefit from the rapidly increasing structural data, it is essential to improve the tools that enable large scale binding site prediction with greater emphasis on their biological validity. Results We have developed a new method for the annotation of protein-small molecule binding sites, using inference by homology, which allows us to extend annotation onto protein sequences without experimental data available. To ensure biological relevance of binding sites, our method clusters similar binding sites found in homologous protein structures based on their sequence and structure conservation. Binding sites which appear evolutionarily conserved among non-redundant sets of homologous proteins are given higher priority. After binding sites are clustered, position specific score matrices (PSSMs) are constructed from the corresponding binding site alignments. Together with other measures, the PSSMs are subsequently used to rank binding sites to assess how well they match the query and to better gauge their biological relevance. The method also facilitates a succinct and informative representation of observed and inferred binding sites from homologs with known three-dimensional structures, thereby providing the means to analyze conservation and diversity of binding modes. Furthermore, the chemical properties of small molecules bound to the inferred binding sites can be used as a starting point in small molecule virtual screening. The method was validated by comparison to other binding site prediction methods and to a collection of manually curated binding site annotations. We show that our method achieves a sensitivity of 72% at predicting biologically relevant binding sites and can accurately discriminate those sites that bind biological small molecules from non-biological ones. Conclusions A new algorithm has been developed to predict binding sites with high accuracy in terms of their biological validity. It also provides a common platform for function prediction, knowledge-based docking and for small molecule virtual screening. The method can be applied even for a query sequence without structure. The method is available at <url>http://www.ncbi.nlm.nih.gov/Structure/ibis/ibis.cgi</url>.</p

Springer - Publisher Connector