150 research outputs found
The SWISSâMODEL Repository of annotated threeâdimensional protein structure homology models
The SWISSâMODEL Repository is a database of annotated threeâdimensional comparative protein structure models generated by the fully automated homologyâmodelling pipeline SWISSâMODEL. The Repository currently contains about 300 000 threeâdimensional models for sequences from the SwissâProt and TrEMBL databases. The content of the Repository is updated on a regular basis incorporating new sequences, taking advantage of new template structures becoming available and reflecting improvements in the underlying modelling algorithms. Each entry consists of one or more threeâdimensional protein models, the superposed template structures, the alignments on which the models are based, a summary of the modelling process and a force field based quality assessment. The SWISSâMODEL Repository can be queried via an interactive website at http://swissmodel.expasy. org/repository/. Annotation and crossâlinking of the models with other databases, e.g. SwissâProt on the ExPASy server, allow for seamless navigation between protein sequence and structure information. The aim of the SWISSâMODEL Repository is to provide access to an upâtoâdate collection of annotated threeâdimensional protein models generated by automated homology modelling, bridging the gap between sequence and structure database
The SWISS-MODEL Repository: new features and functionalities
The SWISS-MODEL Repository is a database of annotated 3D protein structure models generated by the SWISS-MODEL homology-modelling pipeline. As of September 2005, the repository contained 675â000 models for 604â000 different protein sequences of the UniProt database. Regular updates ensure that the content of the repository reflects the current state of sequence and structure databases, integrating new or modified target sequences, and making use of new template structures. Each Repository entry consists of one or more 3D models accompanied by detailed information about the target protein and the model building process: functional annotation, a detailed template selection log, target-template alignment, summary of the model building and model quality assessment. The SWISS-MODEL Repository is freely accessible at http://swissmodel.expasy.org/repositor
Critical assessment of methods of protein structure prediction: Progress and new directions in round XI
Modeling of protein structure from amino acid sequence now plays a major role in structural biology. Here we report new
developments and progress from the CASP11 community experiment, assessing the state of the art in structure modeling.
Notable points include the following: (1) New methods for predicting three dimensional contacts resulted in a few spectacular
template free models in this CASP, whereas models based on sequence homology to proteins with experimental structure
continue to be the most accurate. (2) Refinement of initial protein models, primarily using molecular dynamics related
approaches, has now advanced to the point where the best methods can consistently (though slightly) improve nearly all
models. (3) The use of relatively sparse NMR constraints dramatically improves the accuracy of models, and another type of
sparse data, chemical crosslinking, introduced in this CASP, also shows promise for producing better models. (4) A new
emphasis on modeling protein complexes, in collaboration with CAPRI, has produced interesting results, but also shows the
need for more focus on this area. (5) Methods for estimating the accuracy of models have advanced to the point where they
are of considerable practical use. (6) A first assessment demonstrates that models can sometimes successfully address biological
questions that motivate experimental structure determination. (7) There is continuing progress in accuracy of modeling
regions of structure not directly available by comparative modeling, while there is marginal or no progress in some other
areas
Assessing the local structural quality of transmembrane protein models using statistical potentials (QMEANBrane)
Motivation: Membrane proteins are an important class of biological macromolecules involved in many cellular key processes including signalling and transport. They account for one third of genes in the human genome and >50% of current drug targets. Despite their importance, experimental structural data are sparse, resulting in high expectations for computational modelling tools to help fill this gap. However, as many empirical methods have been trained on experimental structural data, which is biased towards soluble globular proteins, their accuracy for transmembrane proteins is often limited. Results: We developed a local model quality estimation method for membrane proteins (âQMEANBrane') by combining statistical potentials trained on membrane protein structures with a per-residue weighting scheme. The increasing number of available experimental membrane protein structures allowed us to train membrane-specific statistical potentials that approach statistical saturation. We show that reliable local quality estimation of membrane protein models is possible, thereby extending local quality estimation to these biologically relevant molecules. Availability and implementation: Source code and datasets are available on request. Contact: [email protected] Supplementary Information: Supplementary data are available at Bioinformatics onlin
QMEAN server for protein model quality estimation
Model quality estimation is an essential component of protein structure prediction, since ultimately the accuracy of a model determines its usefulness for specific applications. Usually, in the course of protein structure prediction a set of alternative models is produced, from which subsequently the most accurate model has to be selected. The QMEAN server provides access to two scoring functions successfully tested at the eighth round of the community-wide blind test experiment CASP. The user can choose between the composite scoring function QMEAN, which derives a quality estimate on the basis of the geometrical analysis of single models, and the clustering-based scoring function QMEANclust which calculates a global and local quality estimate based on a weighted all-against-all comparison of the models from the ensemble provided by the user. The web server performs a ranking of the input models and highlights potentially problematic regions for each model. The QMEAN server is available at http://swissmodel.expasy.org/qmea
Toward the estimation of the absolute quality of individual protein structure models
Motivation: Quality assessment of protein structures is an important part of experimental structure validation and plays a crucial role in protein structure prediction, where the predicted models may contain substantial errors. Most current scoring functions are primarily designed to rank alternative models of the same sequence supporting model selection, whereas the prediction of the absolute quality of an individual protein model has received little attention in the field. However, reliable absolute quality estimates are crucial to assess the suitability of a model for specific biomedical applications. Results: In this work, we present a new absolute measure for the quality of protein models, which provides an estimate of the âdegree of nativeness' of the structural features observed in a model and describes the likelihood that a given model is of comparable quality to experimental structures. Model quality estimates based on the QMEAN scoring function were normalized with respect to the number of interactions. The resulting scoring function is independent of the size of the protein and may therefore be used to assess both monomers and entire oligomeric assemblies. Model quality scores for individual models are then expressed as âZ-scores' in comparison to scores obtained for high-resolution crystal structures. We demonstrate the ability of the newly introduced QMEAN Z-score to detect experimentally solved protein structures containing significant errors, as well as to evaluate theoretical protein models. In a comprehensive QMEAN Z-score analysis of all experimental structures in the PDB, membrane proteins accumulate on one side of the score spectrum and thermostable proteins on the other. Proteins from the thermophilic organism Thermatoga maritima received significantly higher QMEAN Z-scores in a pairwise comparison with their homologous mesophilic counterparts, underlining the significance of the QMEAN Z-score as an estimate of protein stability. Availability: The Z-score calculation has been integrated in the QMEAN server available at: http://swissmodel.expasy.org/qmean. Contact: [email protected] Supplementary information: Supplementary data are available at Bioinformatics onlin
Toward the estimation of the absolute quality of individual protein structure models
Motivation: Quality assessment of protein structures is an important part of experimental structure validation and plays a crucial role in protein structure prediction, where the predicted models may contain substantial errors. Most current scoring functions are primarily designed to rank alternative models of the same sequence supporting model selection, whereas the prediction of the absolute quality of an individual protein model has received little attention in the field. However, reliable absolute quality estimates are crucial to assess the suitability of a model for specific biomedical applications
Outer membrane β-barrel structure prediction through the lens of AlphaFold2
Most proteins found in the outer membrane of Gram-negative bacteria share a common domain: the transmembrane β-barrel. These outer membrane β-barrels (OMBBs) occur in multiple sizes, and different families with a wide range of functions evolved independently by amplification from a pool of homologous ancestral ββ-hairpins. This is part of the reason why predicting their three-dimensional (3D) structure, especially by homology modeling, is a major challenge. Recently, DeepMind's AlphaFold v2 (AF2) became the first structure prediction method to reach close-to-experimental atomic accuracy in CASP even for difficult targets. However, membrane proteins, especially OMBBs, were not abundant during its training, raising the question of how accurate the predictions are for these families. In this study, we assessed the performance of AF2 in the prediction of OMBBs of various topologies using an in-house-developed tool for the analysis of OMBB 3D structures, barrOs . In agreement with previous studies on other membrane protein classes, our results indicate that AF2 predicts OMBB structures at high accuracy independently of the use of templates, even for novel topologies absent from the training set. These results provide confidence on the models generated by AF2 and open the door to the structural elucidation of novel OMBB topologies identified in high-throughput OMBB annotation studies
QMEANclust: estimation of protein model quality by combining a composite scoring function with structural density information
ABSTRACT: BACKGROUND: The selection of the most accurate protein model from a set of alternatives is a crucial step in protein structure prediction both in template-based and ab initio approaches. Scoring functions have been developed which can either return a quality estimate for a single model or derive a score from the information contained in the ensemble of models for a given sequence. Local structural features occurring more frequently in the ensemble have a greater probability of being correct. Within the context of the CASP experiment, these so called consensus methods have been shown to perform considerably better in selecting good candidate models, but tend to fail if the best models are far from the dominant structural cluster. In this paper we show that model selection can be improved if both approaches are combined by pre-filtering the models used during the calculation of the structural consensus. RESULTS: Our recently published QMEAN composite scoring function has been improved by including an all-atom interaction potential term. The preliminary model ranking based on the new QMEAN score is used to select a subset of reliable models against which the structural consensus score is calculated. This scoring function called QMEANclust achieves a correlation coefficient of predicted quality score and GDT_TS of 0.9 averaged over the 98 CASP7 targets and perform significantly better in selecting good models from the ensemble of server models than any other groups participating in the quality estimation category of CASP7. Both scoring functions are also benchmarked on the MOULDER test set consisting of 20 target proteins each with 300 alternatives models generated by MODELLER. QMEAN outperforms all other tested scoring functions operating on individual models, while the consensus method QMEANclust only works properly on decoy sets containing a certain fraction of near-native conformations. We also present a local version of QMEAN for the per-residue estimation of model quality (QMEANlocal) and compare it to a new local consensus-based approach. CONCLUSION: Improved model selection is obtained by using a composite scoring function operating on single models in order to enrich higher quality models which are subsequently used to calculate the structural consensus. The performance of consensus-based methods such as QMEANclust highly depends on the composition and quality of the model ensemble to be analysed. Therefore, performance estimates for consensus methods based on large meta-datasets (e.g. CASP) might overrate their applicability in more realistic modelling situations with smaller sets of models based on individual methods
- âŚ