18 research outputs found

    Prediction of protein assemblies, the next frontier: The CASP14-CAPRI experiment

    Get PDF
    We present the results for CAPRI Round 50, the fourth joint CASP-CAPRI protein assembly prediction challenge. The Round comprised a total of twelve targets, including six dimers, three trimers, and three higher-order oligomers. Four of these were easy targets, for which good structural templates were available either for the full assembly, or for the main interfaces (of the higher-order oligomers). Eight were difficult targets for which only distantly related templates were found for the individual subunits. Twenty-five CAPRI groups including eight automatic servers submitted ~1250 models per target. Twenty groups including six servers participated in the CAPRI scoring challenge submitted ~190 models per target. The accuracy of the predicted models was evaluated using the classical CAPRI criteria. The prediction performance was measured by a weighted scoring scheme that takes into account the number of models of acceptable quality or higher submitted by each group as part of their five top-ranking models. Compared to the previous CASP-CAPRI challenge, top performing groups submitted such models for a larger fraction (70–75%) of the targets in this Round, but fewer of these models were of high accuracy. Scorer groups achieved stronger performance with more groups submitting correct models for 70–80% of the targets or achieving high accuracy predictions. Servers performed less well in general, except for the MDOCKPP and LZERD servers, who performed on par with human groups. In addition to these results, major advances in methodology are discussed, providing an informative overview of where the prediction of protein assemblies currently stands.Cancer Research UK, Grant/Award Number: FC001003; Changzhou Science and Technology Bureau, Grant/Award Number: CE20200503; Department of Energy and Climate Change, Grant/Award Numbers: DE-AR001213, DE-SC0020400, DE-SC0021303; H2020 European Institute of Innovation and Technology, Grant/Award Numbers: 675728, 777536, 823830; Institut national de recherche en informatique et en automatique (INRIA), Grant/Award Number: Cordi-S; Lietuvos Mokslo Taryba, Grant/Award Numbers: S-MIP-17-60, S-MIP-21-35; Medical Research Council, Grant/Award Number: FC001003; Japan Society for the Promotion of Science KAKENHI, Grant/Award Number: JP19J00950; Ministerio de Ciencia e Innovación, Grant/Award Number: PID2019-110167RB-I00; Narodowe Centrum Nauki, Grant/Award Numbers: UMO-2017/25/B/ST4/01026, UMO-2017/26/M/ST4/00044, UMO-2017/27/B/ST4/00926; National Institute of General Medical Sciences, Grant/Award Numbers: R21GM127952, R35GM118078, RM1135136, T32GM132024; National Institutes of Health, Grant/Award Numbers: R01GM074255, R01GM078221, R01GM093123, R01GM109980, R01GM133840, R01GN123055, R01HL142301, R35GM124952, R35GM136409; National Natural Science Foundation of China, Grant/Award Number: 81603152; National Science Foundation, Grant/Award Numbers: AF1645512, CCF1943008, CMMI1825941, DBI1759277, DBI1759934, DBI1917263, DBI20036350, IIS1763246, MCB1925643; NWO, Grant/Award Number: TOP-PUNT 718.015.001; Wellcome Trust, Grant/Award Number: FC00100

    Methods for the analysis and assessment of the three-dimensional structures of proteins and nucleic acids: development and applications

    No full text
    This dissertation describes three novel effective methods for the analysis and evaluation of biomolecular structures. The presented methods construct and utilize the Voronoi tessellation of atomic balls and the tessellation-derived interatomic contact areas. The first method, Voronota, is a method for computing the vertices of the Voronoi diagram of balls. It is capable of processing macromolecular structures efficiently by exploiting common patterns of atomic spatial arrangements. The second method, CAD-score (Contact Area Difference Score), is a highly effective method for the comparison of different conformations of macromolecules, for example, native and modeled structures. It is universally applicable for the comparison of structures of all the major types of macromolecules (proteins, nucleic acids and their complexes). The third method, VoroMQA (Voronoi diagram­-based Model Quality Assessment), is a method for the evaluation of predicted protein structures when the native structure is unknown. It efficiently combines the idea of knowledge-based statistical potential with the concept of interatomic contact areas derived from the Voronoi tessellation of atomic balls, it consistently outperforms other statistical potential-based methods. The main conclusion of the presented studies is that Voronoi tessellation-derived contact areas effectively capture important structural features of biological macromolecules

    Baltymų ir nukleorūgščių erdvinių struktūrų analizės ir vertinimo metodai: kūrimas ir taikymas

    No full text
    This dissertation describes three novel effective methods for the analysis and evaluation of biomolecular structures. The presented methods construct and utilize the Voronoi tessellation of atomic balls and the tessellation-derived interatomic contact areas. The first method, Voronota, is a method for computing the vertices of the Voronoi diagram of balls. It is capable of processing macromolecular structures efficiently by exploiting common patterns of atomic spatial arrangements. The second method, CAD-score (Contact Area Difference Score), is a highly effective method for the comparison of different conformations of macromolecules, for example, native and modeled structures. It is universally applicable for the comparison of structures of all the major types of macromolecules (proteins, nucleic acids and their complexes). The third method, VoroMQA (Voronoi diagram­-based Model Quality Assessment), is a method for the evaluation of predicted protein structures when the native structure is unknown. It efficiently combines the idea of knowledge-based statistical potential with the concept of interatomic contact areas derived from the Voronoi tessellation of atomic balls, it consistently outperforms other statistical potential-based methods. The main conclusion of the presented studies is that Voronoi tessellation-derived contact areas effectively capture important structural features of biological macromolecules

    VoroMQA web server for assessing three-dimensional structures of proteins and protein complexes

    No full text
    The VoroMQA (Voronoi tessellation-based Model Quality Assessment) web server is dedicated to the estimation of protein structure quality, a common step in selecting realistic and most accurate computational models and in validating experimental structures. As an input, the VoroMQA web server accepts one or more protein structures in PDB format. Input structures may be either monomeric proteins or multimeric protein complexes. For every input structure, the server provides both global and local (per-residue) scores. Visualization of the local scores along the protein chain is enhanced by providing secondary structure assignment and information on solvent accessibility. A unique feature of the VoroMQA server is the ability to directly assess protein-protein interaction interfaces. If this type of assessment is requested, the web server provides interface quality scores, interface energy estimates, and local scores for residues involved in inter-chain interfaces. VoroMQA, the underlying method of the web server, was extensively tested in recent community-wide CASP and CAPRI experiments. During these experiments VoroMQA showed outstanding performance both in model selection and in estimation of accuracy of local structural regions. The VoroMQA web server is available at http://bioinformatics.ibt.lt/wtsam/voromqa

    The use of interatomic contact areas to quantify discrepancies between RNA 3D models and reference structures

    No full text
    Growing interest in computational prediction of ribonucleic acid (RNA) three-dimensional structure has highlighted the need for reliable and meaningful methods to compare models and experimental structures. We present a structure superposition-free method to quantify both the local and global accuracy of RNA structural models with respect to the reference structure. The method, initially developed for proteins and here extended to RNA, closely reflects physical interactions, has a simple definition, a fixed range of values and no arbitrary parameters. It is based on the correspondence of respective contact areas between nucleotides or their components (base or backbone). The better is the agreement between respective contact areas in a model and the reference structure, the more accurate the model is considered to be. Since RNA bases account for the largest contact areas, we further distinguish stacking and non-stacking contacts. We have extensively tested the contact area-based evaluation method and found it effective in both revealing local discrepancies and ranking models by their overall quality. Compared to other reference-based RNA model evaluation methods, the new method shows a stronger emphasis on stereochemical quality of models. In addition, it takes into account model completeness, enabling a meaningful evaluation of full models and those missing some residues

    Modeling of protein complexes in CASP14 with emphasis on the interaction interface prediction

    No full text
    The goal of CASP experiments is to monitor the progress in the protein structure prediction field. During the 14th CASP edition we aimed to test our capabilities of predicting structures of protein complexes. Our protocol for modeling protein assemblies included both template-based modeling and free docking. Structural templates were identified using sensitive sequence-based searches. If sequence-based searches failed, we performed structure-based template searches using selected CASP server models. In the absence of reliable templates we applied free docking starting from monomers generated by CASP servers. We evaluated and ranked models of protein complexes using an improved version of our protein structure quality assessment method, VoroMQA, taking into account both interaction interface and global structure scores. If reliable templates could be identified, generally accurate models of protein assemblies were generated with the exception of an antibody-antigen interaction. The success of free docking mainly depended on the accuracy of initial subunit models and on the scoring of docking solutions. To put our overall results in perspective, we analyzed our performance in the context of other CASP groups. Although the subunits in our assembly models often were not of the top quality, these models had, overall, the best-predicted intersubunit interfaces according to several accuracy measures. We attribute our relative success primarily to the emphasis on the interaction interface when modeling and scoring
    corecore