Search CORE

28 research outputs found

Recommended from our members

Archiving and disseminating integrative structure models.

Author: Berman Helen M
Sali Andrej
Vallat Brinda
Webb Benjamin
Westbrook John
Publication venue: eScholarship, University of California
Publication date: 01/07/2019
Field of study

Limitations in the applicability, accuracy, and precision of individual structure characterization methods can sometimes be overcome via an integrative modeling approach that relies on information from all available sources, including all available experimental data and prior models. The open-source Integrative Modeling Platform (IMP) is one piece of software that implements all computational aspects of integrative modeling. To maximize the impact of integrative structures, the coordinates should be made publicly available, as is already the case for structures based on X-ray crystallography, NMR spectroscopy, and electron microscopy. Moreover, the associated experimental data and modeling protocols should also be archived, such that the original results can easily be reproduced. Finally, it is essential that the integrative structures are validated as part of their publication and deposition. A number of research groups have already developed software to implement integrative modeling and have generated a number of structures, prompting the formation of an Integrative/Hybrid Methods Task Force. Following the recommendations of this task force, the existing PDBx/mmCIF data representation used for atomic PDB structures has been extended to address the requirements for archiving integrative structural models. This IHM-dictionary adds a flexible model representation, including coarse graining, models in multiple states and/or related by time or other order, and multiple input experimental information sources. A prototype archiving system called PDB-Dev ( https://pdb-dev.wwpdb.org ) has also been created to archive integrative structural models, together with a Python library to facilitate handling of integrative models in PDBx/mmCIF format

eScholarship - University of California

IHMCIF: An Extension of the PDBx/mmCIF Data Standard for Integrative Structure Determination Methods

IHMCIF (github.com/ihmwg/IHMCIF) is a data information framework that supports archiving and disseminating macromolecular structures determined by integrative or hybrid modeling (IHM), and making them Findable, Accessible, Interoperable, and Reusable (FAIR). IHMCIF is an extension of the Protein Data Bank Exchange/macromolecular Crystallographic Information Framework (PDBx/mmCIF) that serves as the framework for the Protein Data Bank (PDB) to archive experimentally determined atomic structures of biological macromolecules and their complexes with one another and small molecule ligands (e.g., enzyme cofactors and drugs). IHMCIF serves as the foundational data standard for the PDB-Dev prototype system, developed for archiving and disseminating integrative structures. It utilizes a flexible data representation to describe integrative structures that span multiple spatiotemporal scales and structural states with definitions for restraints from a variety of experimental methods contributing to integrative structural biology. The IHMCIF extension was created with the benefit of considerable community input and recommendations gathered by the Worldwide Protein Data Bank (wwPDB) Task Force for Integrative or Hybrid Methods (wwpdb.org/task/hybrid). Herein, we describe the development of IHMCIF to support evolving methodologies and ongoing advancements in integrative structural biology. Ultimately, IHMCIF will facilitate the unification of PDB-Dev data and tools with the PDB archive so that integrative structures can be archived and disseminated through PDB

Edinburgh Research Explorer

eScholarship - University of California

Hochschulschriftenserver - Universität Frankfurt am Main

Federating structural models and data:Outcomes from a workshop on archiving integrative structures

Structures of biomolecular systems are increasingly computed by integrative modeling. In this approach, a structural model is constructed by combining information from multiple sources, including varied experimental methods and prior models. In 2019, a Workshop was held as a Biophysical Society Satellite Meeting to assess progress and discuss further requirements for archiving integrative structures. The primary goal of the Workshop was to build consensus for addressing the challenges involved in creating common data standards, building methods for federated data exchange, and developing mechanisms for validating integrative structures. The summary of the Workshop and the recommendations that emerged are presented here

edoc

Edinburgh Research Explorer

eScholarship - University of California

MPG.PuRe

New system for archiving integrative structures

Author: Vallat Brinda,
Publication venue
Publication date: 08/06/2022
Field of study

Ezid

Archiving and disseminating integrative structure models.

Author: Vallat Brinda,
Publication venue
Publication date: 21/05/2020
Field of study

Ezid

Development of a Prototype System for Archiving Integrative/Hybrid Structure Models of Biological Macromolecules.

Author: Vallat Brinda,
Publication venue
Publication date: 26/08/2020
Field of study

Ezid

Development of a Prototype System for Archiving Integrative/Hybrid Structure Models of Biological Macromolecules.

Author: Vallat Brinda,
Publication venue
Publication date: 19/09/2023
Field of study

Ezid

Modularity of Protein Folds as a Tool for Template-Free Modeling of Structures.

Author: Andras Fiser
Brinda Vallat
Carlos Madrid-Aliste
Publication venue: 'Public Library of Science (PLoS)'
Publication date: 01/08/2015
Field of study

Predicting the three-dimensional structure of proteins from their amino acid sequences remains a challenging problem in molecular biology. While the current structural coverage of proteins is almost exclusively provided by template-based techniques, the modeling of the rest of the protein sequences increasingly require template-free methods. However, template-free modeling methods are much less reliable and are usually applicable for smaller proteins, leaving much space for improvement. We present here a novel computational method that uses a library of supersecondary structure fragments, known as Smotifs, to model protein structures. The library of Smotifs has saturated over time, providing a theoretical foundation for efficient modeling. The method relies on weak sequence signals from remotely related protein structures to create a library of Smotif fragments specific to the target protein sequence. This Smotif library is exploited in a fragment assembly protocol to sample decoys, which are assessed by a composite scoring function. Since the Smotif fragments are larger in size compared to the ones used in other fragment-based methods, the proposed modeling algorithm, SmotifTF, can employ an exhaustive sampling during decoy assembly. SmotifTF successfully predicts the overall fold of the target proteins in about 50% of the test cases and performs competitively when compared to other state of the art prediction methods, especially when sequence signal to remote homologs is diminishing. Smotif-based modeling is complementary to current prediction methods and provides a promising direction in addressing the structure prediction problem, especially when targeting larger proteins for modeling

Directory of Open Access Journals

PubMed Central

GDT_TS values of top scoring models obtained with SmotifTF method using dynamic Smotif library generated at different e-value cutoffs.

Author: Andras Fiser (2327872)
Brinda Vallat (781248)
Carlos Madrid-Aliste (781249)
Publication venue
Publication date
Field of study

GDT_TS values of top scoring models obtained with SmotifTF method using dynamic Smotif library generated at different e-value cutoffs.</p

The Francis Crick Institute

Performance of SmotifTF on the benchmarking test set in comparison to other methods

Author: Andras Fiser (2327872)
Brinda Vallat (781248)
Carlos Madrid-Aliste (781249)
Publication venue
Publication date
Field of study

1 = Number of residues in the query protein2 = Major secondary structure class according to DSSP [<a href="http://www.ploscompbiol.org/article/info:doi/10.1371/journal.pcbi.1004419#pcbi.1004419.ref057" target="_blank">57</a>]3 = e-value of the best hit in the dynamic database4 = GDT_TS score of the best scoring model when compared to the native structure.Performance of SmotifTF on the benchmarking test set in comparison to other methods</p

The Francis Crick Institute