Search CORE

72,005 research outputs found

Protein–DNA binding specificity predictions with structural models

Author: Baker David
Havranek James J.
Morozov Alexandre V.
Siggia Eric D.
Publication venue: Oxford University Press
Publication date: 01/01/2005
Field of study

Protein–DNA interactions play a central role in transcriptional regulation and other biological processes. Investigating the mechanism of binding affinity and specificity in protein–DNA complexes is thus an important goal. Here we develop a simple physical energy function, which uses electrostatics, solvation, hydrogen bonds and atom-packing terms to model direct readout and sequence-specific DNA conformational energy to model indirect readout of DNA sequence by the bound protein. The predictive capability of the model is tested against another model based only on the knowledge of the consensus sequence and the number of contacts between amino acids and DNA bases. Both models are used to carry out predictions of protein–DNA binding affinities which are then compared with experimental measurements. The nearly additive nature of protein–DNA interaction energies in our model allows us to construct position-specific weight matrices by computing base pair probabilities independently for each position in the binding site. Our approach is less data intensive than knowledge-based models of protein–DNA interactions, and is not limited to any specific family of transcription factors. However, native structures of protein–DNA complexes or their close homologs are required as input to the model. Use of homology modeling can significantly increase the extent of our approach, making it a useful tool for studying regulatory pathways in many organisms and cell types

CiteSeerX

Crossref

PubMed Central

Inherent limitations of probabilistic models for protein-DNA binding specificity

Author: Ruan Shuxiang
Stormo Gary D
Publication venue: Digital Commons@Becker
Publication date: 01/01/2017
Field of study

The specificities of transcription factors are most commonly represented with probabilistic models. These models provide a probability for each base occurring at each position within the binding site and the positions are assumed to contribute independently. The model is simple and intuitive and is the basis for many motif discovery algorithms. However, the model also has inherent limitations that prevent it from accurately representing true binding probabilities, especially for the highest affinity sites under conditions of high protein concentration. The limitations are not due to the assumption of independence between positions but rather are caused by the non-linear relationship between binding affinity and binding probability and the fact that independent normalization at each position skews the site probabilities. Generally probabilistic models are reasonably good approximations, but new high-throughput methods allow for biophysical models with increased accuracy that should be used whenever possible

Directory of Open Access Journals

Digital Commons@Becker

FigShare

From Nonspecific DNA–Protein Encounter Complexes to the Prediction of DNA–Protein Interactions

Author: A Sarai
AV Morozov
BW Matthews
BW Matthews
CG Kalodimos
CH Yan
CO Pabo
E Fraenkel
E Katchalski-Katzir
FK Winkler
H Tjong
I Bonnet
IB Kuznetsov
Ilya Vakser
J Gorman
J Skolnick
J Skolnick
JE Donald
Jeffrey Skolnick
JJ Havranek
JS Lamoureux
JS Lamoureux
M Billeter
M Gao
M van Dijk
MJ Sippl
Mu Gao
N Bhardwaj
NC Horton
NM Luscombe
NP Stanford
O Givaty
P Aloy
P Rotkiewicz
PH von Hippel
R Mendez
R Samudrala
RMA Knegtel
S Ahmad
S Jones
SE Halford
SJ Hubbard
TA Robertson
TW Siggers
W Humphrey
WJ Lane
XJ Lu
Y Zhang
Y Zhang
ZJ Liu
Publication venue: 'Public Library of Science (PLoS)'
Publication date: 01/01/2009
Field of study

©2009 Gao, Skolnick. This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.doi:10.1371/journal.pcbi.1000341DNA–protein interactions are involved in many essential biological activities. Because there is no simple mapping code between DNA base pairs and protein amino acids, the prediction of DNA–protein interactions is a challenging problem. Here, we present a novel computational approach for predicting DNA-binding protein residues and DNA–protein interaction modes without knowing its specific DNA target sequence. Given the structure of a DNA-binding protein, the method first generates an ensemble of complex structures obtained by rigid-body docking with a nonspecific canonical B-DNA. Representative models are subsequently selected through clustering and ranking by their DNA–protein interfacial energy. Analysis of these encounter complex models suggests that the recognition sites for specific DNA binding are usually favorable interaction sites for the nonspecific DNA probe and that nonspecific DNA–protein interaction modes exhibit some similarity to specific DNA–protein binding modes. Although the method requires as input the knowledge that the protein binds DNA, in benchmark tests, it achieves better performance in identifying DNA-binding sites than three previously established methods, which are based on sophisticated machine-learning techniques. We further apply our method to protein structures predicted through modeling and demonstrate that our method performs satisfactorily on protein models whose root-mean-square Ca deviation from native is up to 5 Å from their native structures. This study provides valuable structural insights into how a specific DNA-binding protein interacts with a nonspecific DNA sequence. The similarity between the specific DNA–protein interaction mode and nonspecific interaction modes may reflect an important sampling step in search of its specific DNA targets by a DNA-binding protein

Scholarly Materials And Research @ Georgia Tech

CiteSeerX

Public Library of Science (PLOS)

Crossref

Directory of Open Access Journals

PubMed Central

Recommended from our members

A combined computational-experimental approach to define the structural origin of antibody recognition of sialyl-Tn, a tumor-associated carbohydrate antigen.

Author: Amon Ron
Chen Xi
Fleishman Sarel J
Glushka John N
Grant Oliver C
Leviatan Ben-Arye Shani
Makeneni Spandana
Marshanski Tal
Nivedha Anita K
Norn Christoffer
Padler-Karavani Vered
Woods Robert J
Yu Hai
Publication venue: eScholarship, University of California
Publication date: 01/07/2018
Field of study

Anti-carbohydrate monoclonal antibodies (mAbs) hold great promise as cancer therapeutics and diagnostics. However, their specificity can be mixed, and detailed characterization is problematic, because antibody-glycan complexes are challenging to crystallize. Here, we developed a generalizable approach employing high-throughput techniques for characterizing the structure and specificity of such mAbs, and applied it to the mAb TKH2 developed against the tumor-associated carbohydrate antigen sialyl-Tn (STn). The mAb specificity was defined by apparent KD values determined by quantitative glycan microarray screening. Key residues in the antibody combining site were identified by site-directed mutagenesis, and the glycan-antigen contact surface was defined using saturation transfer difference NMR (STD-NMR). These features were then employed as metrics for selecting the optimal 3D-model of the antibody-glycan complex, out of thousands plausible options generated by automated docking and molecular dynamics simulation. STn-specificity was further validated by computationally screening of the selected antibody 3D-model against the human sialyl-Tn-glycome. This computational-experimental approach would allow rational design of potent antibodies targeting carbohydrates

eScholarship - University of California

Predicting Transcription Factor Specificity with All-Atom Models

Author: Arvidson
Benos
Blattner
Djordjevic
Donald
Endres
Endres
Foat
Glasfeld
He
Humphrey
Ingraham
Kalodimos
Kazakov
Kinney
Kullback
Lafontaine
Leonid A. Mirny
Liu
Lu
MacKerell
Maerkl
Man
Mehran Kardar
Meng
Mironov
Morozov
Onufriev
Paillard
Paillard
Peter Virnau
Pérez
Sahand J. Rahi
Schumacher
von Hippel
Wang
Publication venue: 'Oxford University Press (OUP)'
Publication date: 01/08/2008
Field of study

The binding of a transcription factor (TF) to a DNA operator site can initiate or repress the expression of a gene. Computational prediction of sites recognized by a TF has traditionally relied upon knowledge of several cognate sites, rather than an ab initio approach. Here, we examine the possibility of using structure-based energy calculations that require no knowledge of bound sites but rather start with the structure of a protein-DNA complex. We study the PurR E. coli TF, and explore to which extent atomistic models of protein-DNA complexes can be used to distinguish between cognate and non-cognate DNA sites. Particular emphasis is placed on systematic evaluation of this approach by comparing its performance with bioinformatic methods, by testing it against random decoys and sites of homologous TFs. We also examine a set of experimental mutations in both DNA and the protein. Using our explicit estimates of energy, we show that the specificity for PurR is dominated by direct protein-DNA interactions, and weakly influenced by bending of DNA.Comment: 26 pages, 3 figure

arXiv.org e-Print Archive

DSpace@MIT

Crossref

Harvard University - DASH

PubMed Central

RosettaBackrub--a web server for flexible backbone protein structure modeling and design.

Author: Friedland Gregory F
Humphris Elisabeth L
Kortemme Tanja
Lauck Florian
Smith Colin A
Publication venue: eScholarship, University of California
Publication date: 12/05/2010
Field of study

The RosettaBackrub server (http://kortemmelab.ucsf.edu/backrub) implements the Backrub method, derived from observations of alternative conformations in high-resolution protein crystal structures, for flexible backbone protein modeling. Backrub modeling is applied to three related applications using the Rosetta program for structure prediction and design: (I) modeling of structures of point mutations, (II) generating protein conformational ensembles and designing sequences consistent with these conformations and (III) predicting tolerated sequences at protein-protein interfaces. The three protocols have been validated on experimental data. Starting from a user-provided single input protein structure in PDB format, the server generates near-native conformational ensembles. The predicted conformations and sequences can be used for different applications, such as to guide mutagenesis experiments, for ensemble-docking approaches or to generate sequence libraries for protein design

PubMed Central

eScholarship - University of California

Functional interplay between NTP leaving group and base pair recognition during RNA polymerase II nucleotide incorporation revealed by methylene substitution.

Author: Chong Jenny
Huang Xuhui
Hwang Candy S
Kool Eric T
McKenna Charles E
Shin Ji Hyun
Ulrich Sébastien
Wang Dong
Wang Wei
Xu Liang
Zhang Lu
Publication venue: eScholarship, University of California
Publication date: 07/04/2016
Field of study

RNA polymerase II (pol II) utilizes a complex interaction network to select and incorporate correct nucleoside triphosphate (NTP) substrates with high efficiency and fidelity. Our previous 'synthetic nucleic acid substitution' strategy has been successfully applied in dissecting the function of nucleic acid moieties in pol II transcription. However, how the triphosphate moiety of substrate influences the rate of P-O bond cleavage and formation during nucleotide incorporation is still unclear. Here, by employing β,γ-bridging atom-'substituted' NTPs, we elucidate how the methylene substitution in the pyrophosphate leaving group affects cognate and non-cognate nucleotide incorporation. Intriguingly, the effect of the β,γ-methylene substitution on the non-cognate UTP/dT scaffold (∼3-fold decrease in kpol) is significantly different from that of the cognate ATP/dT scaffold (∼130-fold decrease in kpol). Removal of the wobble hydrogen bonds in U:dT recovers a strong response to methylene substitution of UTP. Our kinetic and modeling studies are consistent with a unique altered transition state for bond formation and cleavage for UTP/dT incorporation compared with ATP/dT incorporation. Collectively, our data reveals the functional interplay between NTP triphosphate moiety and base pair hydrogen bonding recognition during nucleotide incorporation

PubMed Central

eScholarship - University of California

Characterization of Aptamer-Protein Complexes by X-ray Crystallography and Alternative Approaches

Author: Baugh
Bauke W. Dijkstra
Bing
Bock
Cao
Chayen
Convery
Doudna
Ellington
Friedmann
Garber
Hauke Smidt
Hermann
Hianik
Hoggan
Hollis
Horn
Huang
Huang
Hwang
Jiang
Johan Hekelaar
John van der Oost
Kaur
Ke
Kelly
Kikin
Krauss
Kwan
Laing
Lebruska
Lee
Long
Lupold
Macaya
Mark Levisson
Mascini
McPherson
Mehta
Miyakawa
Moorthy
Murai
Nix
Nomura
Orlova
Padmanabhan
Padmanabhan
Paige
Parisien
Poniková
Reinemann
Reinstein
Renault
Rivas
Rowsell
Ruigrok
Sekiya
Shum
Skrzypczak-Jankun
Snyder
Someya
Stoltenburg
Sugiyama
Sussman
Tereshko
Tuerk
Vincent J. B. Ruigrok
Wang
Wilson
Win
Wochner
Yan
Yee
Zuker
Publication venue
Publication date: 01/01/2012
Field of study

Aptamers are oligonucleotide ligands, either RNA or ssDNA, selected for high-affinity binding to molecular targets, such as small organic molecules, proteins or whole microorganisms. While reports of new aptamers are numerous, characterization of their specific interaction is often restricted to the affinity of binding (KD). Over the years, crystal structures of aptamer-protein complexes have only scarcely become available. Here we describe some relevant technical issues about the process of crystallizing aptamer-protein complexes and highlight some biochemical details on the molecular basis of selected aptamer-protein interactions. In addition, alternative experimental and computational approaches are discussed to study aptamer-protein interactions.

Multidisciplinary Digital Publishing Institute

University of Groningen

Directory of Open Access Journals

Wageningen University & Research Publications

CiteSeerX

Crossref

Proceedings - University of Groningen

ARTS repository - University of Groningen

PubMed Central

University of Groningen Digital Archive

Dissertations of the University of Groningen