Search CORE

2 research outputs found

Annotating Macromolecular Complexes in the Protein Data Bank: Improving the FAIRness of Structure Data

Author: Appasamy Sri Devan
Armstrong David
Berrisford John
Ellaway Joseph, I J
Gaborova Romana
Grudinin Sergei
Gupta Deepti
Harrus Deborah
Leines Grisell Díaz
Nair Sreenath
Pidruchna Ivanna
Varadi Mihaly
Velankar Sameer
Publication venue: Nature Publishing Group
Publication date: 15/05/2023
Field of study

Abstract Macromolecular complexes are essential functional units in nearly all cellular processes, and their atomic-level understanding is critical for elucidating and modulating molecular mechanisms. The Protein Data Bank (PDB) serves as the global repository for experimentally determined structures of macromolecules. Structural data in the PDB offer valuable insights into the dynamics, conformation, and functional states of biological assemblies. However, the current annotation practices lack standardised naming conventions for assemblies in the PDB, complicating the identification of instances representing the same assembly. In this study, we introduce a method leveraging resources external to PDB, such as the Complex Portal, UniProt and Gene Ontology, to describe assemblies and contextualise them within their biological settings accurately. Employing the proposed approach, we assigned standard names and provided value-added annotations to over 90% of unique assemblies in the PDB. This standardisation of assembly data enhances the PDB, facilitating a deeper understanding of these cellular components. Furthermore, the data standardisation improves the PDB’s FAIR attributes, fostering more effective basic and translational research and education across scientific disciplines

Hal - Université Grenoble Alpes

Annotating Macromolecular Complexes in the Protein Data Bank: Improving the FAIRness of Structure Data

Author: David Armstrong
Deborah Harrus
Deepti Gupta
Grisell Díaz Leines
Ivanna Pidruchna
John Berrisford
Joseph I. J. Ellaway
Mandar Deshpande
Mihaly Varadi
Romana Gaborova
Sameer Velankar
Sergei Grudinin
Sreenath Nair
Sri Devan Appasamy
Stephen Anyango
Publication venue: Nature Portfolio
Publication date: 15/05/2023
Field of study

Abstract Macromolecular complexes are essential functional units in nearly all cellular processes, and their atomic-level understanding is critical for elucidating and modulating molecular mechanisms. The Protein Data Bank (PDB) serves as the global repository for experimentally determined structures of macromolecules. Structural data in the PDB offer valuable insights into the dynamics, conformation, and functional states of biological assemblies. However, the current annotation practices lack standardised naming conventions for assemblies in the PDB, complicating the identification of instances representing the same assembly. In this study, we introduce a method leveraging resources external to PDB, such as the Complex Portal, UniProt and Gene Ontology, to describe assemblies and contextualise them within their biological settings accurately. Employing the proposed approach, we assigned standard names to over 90% of unique assemblies in the PDB and provided persistent identifiers for each assembly. This standardisation of assembly data enhances the PDB, facilitating a deeper understanding of macromolecular complexes. Furthermore, the data standardisation improves the PDB’s FAIR attributes, fostering more effective basic and translational research and scientific education

Hal - Université Grenoble Alpes

Directory of Open Access Journals

INRIA a CCSD electronic archive server