135 research outputs found
Prediction of the thermodynamics of protein unfolding: the helix-coil transition of poly(L-alanine).
Experimentally based contact energies decode interactions responsible for protein–DNA affinity and the role of molecular waters at the binding interface
A major obstacle towards understanding the molecular basis of transcriptional regulation is the lack of a recognition code for protein–DNA interactions. Using high-quality crystal structures and binding data on the promiscuous family of C2H2 zinc fingers (ZF), we decode 10 fundamental specific interactions responsible for protein–DNA recognition. The interactions include five hydrogen bond types, three atomic desolvation penalties, a favorable non-polar energy, and a novel water accessibility factor. We apply this code to three large datasets containing a total of 89 C2H2 transcription factor (TF) mutants on the three ZFs of EGR. Guided by molecular dynamics simulations of individual ZFs, we map the interactions into homology models that embody all feasible intra- and intermolecular bonds, selecting for each sequence the structure with the lowest free energy. These interactions reproduce the change in affinity of 35 mutants of finger I (R2 = 0.998), 23 mutants of finger II (R2 = 0.96) and 31 finger III human domains (R2 = 0.94). Our findings reveal recognition rules that depend on DNA sequence/structure, molecular water at the interface and induced fit of the C2H2 TFs. Collectively, our method provides the first robust framework to decode the molecular basis of TFs binding to DNA
Learning about protein folding via potential functions
Over the last few years we have developed an empirical potential function that solves the protein structure recognition problem : given the sequence for an n -residue globular protein and a collection of plausible protein conformations, including the native conformation for that sequence, identify the correct, native conformation. Having determined this potential on the basis of only some 6500 native/nonnative pairs of structures for 58 proteins, we find it recognizes the native conformation for essentially all compact, soluble, globular proteins having known native conformations in comparisons with 10 4 to 10 6 reasonable alternative conformations apiece. In this sense, the potential encodes nearly all the essential features of globular protein conformational preference. In addition it “knows” about many additional factors in protein folding, such as the stabilization of multimeric proteins, quaternary structure, the role of disulfide bridges and ligands, pro proteins vs. processed proteins, and minimal strand lengths in globular proteins. Comparisons are made with other sorts of protein folding problems, and applications in protein conformational determination and prediction are discussed. © 1994 Wiley-Liss, Inc.Peer Reviewedhttp://deepblue.lib.umich.edu/bitstream/2027.42/38520/1/340200206_ftp.pd
Nature of protein family signatures: Insights from singular value analysis of position-specific scoring matrices
Position-specific scoring matrices (PSSMs) are useful for detecting weak
homology in protein sequence analysis, and they are thought to contain some
essential signatures of the protein families. In order to elucidate what kind
of ingredients constitute such family-specific signatures, we apply singular
value decomposition to a set of PSSMs and examine the properties of dominant
right and left singular vectors. The first right singular vectors were
correlated with various amino acid indices including relative mutability, amino
acid composition in protein interior, hydropathy, or turn propensity, depending
on proteins. A significant correlation between the first left singular vector
and a measure of site conservation was observed. It is shown that the
contribution of the first singular component to the PSSMs act to disfavor
potentially but falsely functionally important residues at conserved sites. The
second right singular vectors were highly correlated with hydrophobicity
scales, and the corresponding left singular vectors with contact numbers of
protein structures. It is suggested that sequence alignment with a PSSM is
essentially equivalent to threading supplemented with functional information.
The presented method may be used to separate functionally important sites from
structurally important ones, and thus it may be a useful tool for predicting
protein functions.Comment: 22 pages, 7 figures, 4 table
SNOSite: Exploiting Maximal Dependence Decomposition to Identify Cysteine S-Nitrosylation with Substrate Site Specificity
S-nitrosylation, the covalent attachment of a nitric oxide to (NO) the sulfur atom of cysteine, is a selective and reversible protein post-translational modification (PTM) that regulates protein activity, localization, and stability. Despite its implication in the regulation of protein functions and cell signaling, the substrate specificity of cysteine S-nitrosylation remains unknown. Based on a total of 586 experimentally identified S-nitrosylation sites from SNAP/L-cysteine-stimulated mouse endothelial cells, this work presents an informatics investigation on S-nitrosylation sites including structural factors such as the flanking amino acids composition, the accessible surface area (ASA) and physicochemical properties, i.e. positive charge and side chain interaction parameter. Due to the difficulty to obtain the conserved motifs by conventional motif analysis, maximal dependence decomposition (MDD) has been applied to obtain statistically significant conserved motifs. Support vector machine (SVM) is applied to generate predictive model for each MDD-clustered motif. According to five-fold cross-validation, the MDD-clustered SVMs could achieve an accuracy of 0.902, and provides a promising performance in an independent test set. The effectiveness of the model was demonstrated on the correct identification of previously reported S-nitrosylation sites of Bos taurus dimethylarginine dimethylaminohydrolase 1 (DDAH1) and human hemoglobin subunit beta (HBB). Finally, the MDD-clustered model was adopted to construct an effective web-based tool, named SNOSite (http://csb.cse.yzu.edu.tw/SNOSite/), for identifying S-nitrosylation sites on the uncharacterized protein sequences
- …