4,834 research outputs found
DISCRETIZED GEOMETRIC APPROACHES TO THE ANALYSIS OF PROTEIN STRUCTURES
Proteins play crucial roles in a variety of biological processes. While we know that their amino acid sequence determines their structure, which in turn determines their function, we do not know why particular sequences fold into particular structures. My work focuses on discretized geometric descriptions of protein structureāconceptualizing native structure space as composed of mostly discrete, geometrically defined fragmentsāto better understand the patterns underlying why particular sequence elements correspond to particular structure elements. This discretized geometric approach is applied to multiple levels of protein structure, from conceptualizing contacts between residues as interactions between discrete structural elements to treating protein structures as an assembly of discrete fragments. My earlier work focused on better understanding inter-residue contacts and estimating their energies statistically. By scoring structures with energies derived from a stricter notion of contact, I show that native protein structures can be identified out of a set of decoy structures more often than when using energies derived from traditional definitions of contact and how this has implications for the evaluation of predictions that rely on structurally defined contacts for validation. Demonstrating how useful simple geometric descriptors of structure can be, I then show that these energies identify native structures on par with well-validated, detailed, atomistic energy functions. Moving to a higher level of structure, in my later work I demonstrate that discretized, geometrically defined structural fragments make good objects for the interactive assembly of protein backbones and present a software application which lets users do so. Finally, I use these fragments to generate structure-conditioned statistical energies, generalizing the classic idea of contact energies by incorporating specific structural context, enabling these energies to reflect the interaction geometries they come from. These structure-conditioned energies contain more information about native sequence preferences, correlate more highly with experimentally determined energies, and show that pairwise sequence preferences are tightly coupled to their structural context. Considered jointly, these projects highlight the degree to which protein structures and the interactions they comprise can be understood as geometric elements coming together in finely tuned ways
Identification of direct residue contacts in protein-protein interaction by message passing
Understanding the molecular determinants of specificity in protein-protein
interaction is an outstanding challenge of postgenome biology. The availability
of large protein databases generated from sequences of hundreds of bacterial
genomes enables various statistical approaches to this problem. In this context
covariance-based methods have been used to identify correlation between amino
acid positions in interacting proteins. However, these methods have an
important shortcoming, in that they cannot distinguish between directly and
indirectly correlated residues. We developed a method that combines covariance
analysis with global inference analysis, adopted from use in statistical
physics. Applied to a set of >2,500 representatives of the bacterial
two-component signal transduction system, the combination of covariance with
global inference successfully and robustly identified residue pairs that are
proximal in space without resorting to ad hoc tuning parameters, both for
heterointeractions between sensor kinase (SK) and response regulator (RR)
proteins and for homointeractions between RR proteins. The spectacular success
of this approach illustrates the effectiveness of the global inference approach
in identifying direct interaction based on sequence information alone. We
expect this method to be applicable soon to interaction surfaces between
proteins present in only 1 copy per genome as the number of sequenced genomes
continues to expand. Use of this method could significantly increase the
potential targets for therapeutic intervention, shed light on the mechanism of
protein-protein interaction, and establish the foundation for the accurate
prediction of interacting protein partners.Comment: Supplementary information available on
http://www.pnas.org/content/106/1/67.abstrac
Molecular Binding Energies from Partition Density Functional Theory
Approximate molecular calculations via standard Kohn-Sham Density Functional
Theory are exactly reproduced by performing self-consistent calculations on
isolated fragments via Partition Density Functional Theory [Phys. Rev. A 82,
024501 (2010)]. We illustrate this with the binding curves of small diatomic
molecules. We find that partition energies are in all cases qualitatively
similar and numerically close to actual binding energies. We discuss
qualitative features of the associated partition potentials
A new xantphos-type ligand and its gold(I) complexes: Synthesis, structure, luminescence
A novel xantphos analog diphosphine ligand, 9,9-dimethyl-4,5-bis(diphenylphosphinomethyl)-9H-xanthene (X(CP)2), with methylene groups inserted between the xanthene skeleton and the two diphenylphosphine units, has been synthesized. A two-coordinate and a three-coordinate gold(I) complex of the ligand, [Au2Cl2(X(CP)2)] and [AuCl(X(CP)2)], have been prepared and studied by X-ray diffraction, NMR and optical spectroscopy. In the solid state, [AuCl(X(CP)2)] adopts a highly ordered structure with a planar xanthene skeleton that faces another plane composed of two phenyl rings and the AuCl moiety. The structure of [Au2Cl2(X(CP)2)] is much less regular, the two PāAuāCl vectors point to the opposite sides of the folded xanthene backbone. The exchange-broadened resonances in the NMR spectra of [AuCl(X(CP)2))] indicate that this complex exists as a mixture of various chemical species and/or conformers in solution. In contrast, the NMR spectra of [Au2Cl2(X(CP)2)] exclude any medium-range exchange processes. Aurophilic interactions are absent in both X(CP)2 complexes. X(CP)2, as well as its two gold complexes, is phosphorescent in the solid state; the complexes emit at higher wavelengths and with longer lifetimes than the free ligand
Phosphate binding sites identification in protein structures
Nearly half of known protein structures interact with phosphate-containing ligands, such as nucleotides and other cofactors. Many methods have been developed for the identification of metal ions-binding sites and some for bigger ligands such as carbohydrates, but none is yet available for the prediction of phosphate-binding sites. Here we describe Pfinder, a method that predicts binding sites for phosphate groups, both in the form of ions or as parts of other non-peptide ligands, in proteins of known structure. Pfinder uses the Query3D local structural comparison algorithm to scan a protein structure for the presence of a number of structural motifs identified for their ability to bind the phosphate chemical group. Pfinder has been tested on a data set of 52 proteins for which both the apo and holo forms were available. We obtained at least one correct prediction in 63% of the holo structures and in 62% of the apo. The ability of Pfinder to recognize a phosphate-binding site in unbound protein structures makes it an ideal tool for functional annotation and for complementing docking and drug design methods. The Pfinder program is available at http://pdbfun.uniroma2.it/pfinder
SecStAnT: Secondary Structure Analysis Tool for data selection, statistics and models building
Abstract
Motivation:āAtomistic or coarse grained (CG) potentials derived from statistical distributions of internal variables have recently become popular due to the need of simplified interactions for reaching larger scales in simulations or more efficient conformational space sampling. However, the process of parameterization of accurate and predictive statistics-based force fields requires a huge amount of work and is prone to the introduction of bias and errors.
Results:āThis article introduces SecStAnT, a software for the creation and analysis of protein structural datasets with user-defined primary/secondary structure composition, with a particular focus on the CG representation. In addition, the possibility of managing different resolutions and the primary/secondary structure selectivity allow addressing the mapping-backmapping of atomistic to CG representation and study the secondary to primary structure relations. Sample datasets and distributions are reported, including interpretation of structural features.
Availability and implementation:āSecStAnT is available free of charge at secstant.sourceforge.net/. Source code is freely available on request, implemented in Java and supported on Linux, MS Windows and OSX.
Contact:ā[email protected]
Supplementary information: Supplementary data are available at Bioinformatics online
Partition density functional theory
Partition density functional theory (PDFT) is a method for dividing a molecular electronic structure calculation into fragment calculations. The molecular density and energy corresponding to Kohn Sham density-functional theory (KS-DFT) may be exactly recovered from these fragments. Each fragment acts as an isolated system except for the influence of a global one-body \u27partition\u27 potential which deforms the fragment densities. In this work, the developments of PDFT are put into the context of other fragment-based density functional methods. We developed three numerical implementations of PDFT: One within the NWChem computational chemistry package using basis sets, and the other two developed from scratch using real-space grids. It is shown that all three of these programs can exactly reproduce a KS-DFT calculation via fragment calculations. The first of our in-house codes handles non-interacting electrons in arbitrary one-dimensional potentials with any number of fragments. This code is used to explore how the exact partition potential changes for different partitionings of the same system and also to study features which determine which systems yield non-integer PDFT occupations and which systems are locked into integer PDFT occupations. The second in-house code, CADMium, performs real-space calculations of diatomic molecules. Features of the exact partition potential are studied for a variety of cases and an analytical formula determining singularities in the partition potential is derived. We introduce an approximation for the non-additive kinetic energy and show how this quantity can be computed exactly. Finally a PDFT functional is developed to address the issues of static correlation and delocalization errors in approximations within DFT. The functional is applied to the dissociation of H2+ and H2
- ā¦