Search CORE

7 research outputs found

Implementation of 3D spatial indexing and compression in a large-scale molecular dynamics simulation database for rapid atomic contact detection

Author: Beck David AC
Daggett Valerie
Simms Andrew M
Toofanny Rudesh D
Publication venue: BioMed Central
Publication date: 01/01/2011
Field of study

Abstract Background Molecular dynamics (MD) simulations offer the ability to observe the dynamics and interactions of both whole macromolecules and individual atoms as a function of time. Taken in context with experimental data, atomic interactions from simulation provide insight into the mechanics of protein folding, dynamics, and function. The calculation of atomic interactions or contacts from an MD trajectory is computationally demanding and the work required grows exponentially with the size of the simulation system. We describe the implementation of a spatial indexing algorithm in our multi-terabyte MD simulation database that significantly reduces the run-time required for discovery of contacts. The approach is applied to the Dynameomics project data. Spatial indexing, also known as spatial hashing, is a method that divides the simulation space into regular sized bins and attributes an index to each bin. Since, the calculation of contacts is widely employed in the simulation field, we also use this as the basis for testing compression of data tables. We investigate the effects of compression of the trajectory coordinate tables with different options of data and index compression within MS SQL SERVER 2008. Results Our implementation of spatial indexing speeds up the calculation of contacts over a 1 nanosecond (ns) simulation window by between 14% and 90% (i.e., 1.2 and 10.3 times faster). For a 'full' simulation trajectory (51 ns) spatial indexing reduces the calculation run-time between 31 and 81% (between 1.4 and 5.3 times faster). Compression resulted in reduced table sizes but resulted in no significant difference in the total execution time for neighbour discovery. The greatest compression (~36%) was achieved using page level compression on both the data and indexes. Conclusions The spatial indexing scheme significantly decreases the time taken to calculate atomic contacts and could be applied to other multidimensional neighbor discovery problems. The speed up enables on-the-fly calculation and visualization of contacts and rapid cross simulation analysis for knowledge discovery. Using page compression for the atomic coordinate tables and indexes saves ~36% of disk space without any significant decrease in calculation time and should be considered for other non-transactional databases in MS SQL SERVER 2008.</p

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

Evolutionary Drivers of Protein Shape

Author: Marples Callum R.
Shannon Gareth
Toofanny Rudesh D.
Williams Philip M.
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 15/08/2019
Field of study

Diffusional motion within the crowded environment of the cell is known to be crucial to cellular function as it drives the interactions of proteins. However, the relationships between protein diffusion, shape and interaction, and the evolutionary selection mechanisms that arise as a consequence, have not been investigated. Here, we study the dynamics of triaxial ellipsoids of equivalent steric volume to proteins at different aspect ratios and volume fractions using a combination of Brownian molecular dynamics and geometric packing. In general, proteins are found to have a shape, approximately Golden in aspect ratio, that give rise to the highest critical volume fraction resisting gelation, corresponding to the fastest long-time self-diffusion in the cell. The ellipsoidal shape also directs random collisions between proteins away from sites that would promote aggregation and loss of function to more rapidly evolving nonsticky regions on the surface, and further provides a greater tolerance to mutation

Repository@Nottingham

Molecular Dynamics Simulations of Consensus Tetratricopeptide Repeat Proteins

Author: Aitziber L. Cortajarena
Lynne Regan
Rudesh D. Toofanny
Valerie Daggett
Publication venue: 'Elsevier BV'
Publication date
Field of study

Crossref

GB1 Is Not a Two-State Folder: Identification and Characterization of an On-Pathway Intermediate

Author: BRUNORI Maurizio
GIANNI STEFANO
GIRI RAJANISH
MORRONE ANGELA
Rudesh d Toofanny
TRAVAGLINI ALLOCATELLI Carlo
Valerie Daggett
Publication venue: 'Elsevier BV'
Publication date: 01/01/2011
Field of study

The folding pathway of the small alpha/beta protein GB1 has been extensively studied during the past two decades using both theoretical and experimental approaches. These studies provided a consensus view that the protein folds in a two-state manner. Here, we reassessed the folding of GB1, both by experiments and simulations, and detected the presence of an on-pathway intermediate. This intermediate has eluded earlier experimental characterization and is distinct from the collapsed state previously identified using ultrarapid mixing. Failure to identify the presence of an intermediate affects some of the conclusions that have been drawn for GB1, a popular model for protein folding studies

Elsevier - Publisher Connector

Crossref

PubMed Central

HAL Descartes

Archivio della ricerca- Università di Roma La Sapienza

Hal-Diderot

DIVE: a data intensive visualization engine

Author: Ashburner
Bernstein
Boeckler
Bromley
Dennis Bromley
Joerger
Rakhit
Robert Su
Rudesh D. Toofanny
Rysavy
Schmidlin
Schroeder
Shannon
Simms
Steven J. Rysavy
Tom Schmidlin
Valerie Daggett
Van der Kamp
Publication venue: 'Oxford University Press (OUP)'
Publication date
Field of study

Crossref

Dynameomics:a comprehensive database of protein dynamics

Author: Anderson Peter C
Beck David A C
Benson Noah C
Bromley Dennis
Daggett Valerie
Jonsson Amanda L
Merkley Eric D
Rysavy Steven
Schaeffer R Dustin
Scouras Alexander D
Simms Andrew M
Toofanny Rudesh D
van der Kamp Marc W
Publication venue: 'Elsevier BV'
Publication date: 14/03/2010
Field of study

The dynamic behavior of proteins is important for an understanding of their function and folding. We have performed molecular dynamics simulations of the native state and unfolding pathways of over 1000 proteins, representing the majority of folds in globular proteins. These data are stored and organized using an innovative database approach, which can be mined to obtain both general and specific information about the dynamics and folding/unfolding of proteins, relevant subsets thereof, and individual proteins. Here we describe the project in general terms and the type of information contained in the database. Then we provide examples of mining the database for information relevant to protein folding, structure building, the effect of single-nucleotide polymorphisms, and drug design. The native state simulation data and corresponding analyses for the 100 most populated metafolds, together with related resources, are publicly accessible through www.dynameomics.org

PubMed Central

Explore Bristol Research

A Comprehensive Multidimensional-Embedded, One-Dimensional Reaction Coordinate for Protein Unfolding/Folding

Author: Amanda L. Jonsson
Beck
Beck
Boczko
Capaldi
Daggett
Daggett
Day
Day
Day
Fersht
Fulton
García
Gianni
Jonsson
Kazmirski
Kehl
Krantz
Lee
Li
Li
Li
López-Hernández
Martínez
McCully
Northey
Otzen
Petrovich
Rudesh D. Toofanny
Scheraga
Scott
Sheinerman
Sheinerman
Simms
Sosnick
Valerie Daggett
Went
White
Yee
Publication venue: The Biophysical Society
Publication date: 02/06/2010
Field of study

The goal of the Dynameomics project is to perform, store, and analyze molecular dynamics simulations of representative proteins, of all known globular folds, in their native state and along their unfolding pathways. To analyze unfolding simulations, the location of the protein along the unfolding reaction coordinate (RXN) must be determined. Properties such as the fraction of native contacts and radius of gyration are often used; however, there is an issue regarding degeneracy with these properties, as native and nonnative species can overlap. Here, we used 15 physical properties of the protein to construct a multidimensional-embedded, one-dimensional RXN coordinate that faithfully captures the complex nature of unfolding. The unfolding RXN coordinates for 188 proteins (1534 simulations and 22.9 μs in explicit water) were calculated. Native, transition, intermediate, and denatured states were readily identified with the use of this RXN coordinate. A global native ensemble based on the native-state properties of the 188 proteins was created. This ensemble was shown to be effective for calculating RXN coordinates for folds outside the initial 188 targets. These RXN coordinates enable, high-throughput assignment of conformational states, which represents an important step in comparing protein properties across fold space as well as characterizing the unfolding of individual proteins

Elsevier - Publisher Connector

Crossref

PubMed Central