Search CORE

172 research outputs found

Determining Physical Constraints in Transcriptional Initiation Complexes Using DNA Sequence Analysis

Author: Chiang Derek Y.
Eisen Michael B.
Moses Alan M.
Shultzaberger Ryan K.
Publication venue: Public Library of Science
Publication date: 01/01/2007
Field of study

Eukaryotic gene expression is often under the control of cooperatively acting transcription factors whose binding is limited by structural constraints. By determining these structural constraints, we can understand the “rules” that define functional cooperativity. Conversely, by understanding the rules of binding, we can infer structural characteristics. We have developed an information theory based method for approximating the physical limitations of cooperative interactions by comparing sequence analysis to microarray expression data. When applied to the coordinated binding of the sulfur amino acid regulatory protein Met4 by Cbf1 and Met31, we were able to create a combinatorial model that can correctly identify Met4 regulated genes. Interestingly, we found that the major determinant of Met4 regulation was the sum of the strength of the Cbf1 and Met31 binding sites and that the energetic costs associated with spacing appeared to be minimal

CiteSeerX

Directory of Open Access Journals

PubMed Central

eScholarship - University of California

Recommended from our members

Flexible Promoter Architecture Requirements for Coactivator Recruitment

Author: Chiang Derek Y-h
Eisen Michael B
Gasch Audrey P
Nix David A
Shultzaberger Ryan K
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 18/04/2011
Field of study

Background: The spatial organization of transcription factor binding sites in regulatory DNA, and the composition of intersite sequences, influences the assembly of the multiprotein complexes that regulate RNA polymerase recruitment and thereby affects transcription. We have developed a genetic approach to investigate how reporter gene transcription is affected by varying the spacing between transcription factor binding sites. We characterized the components of promoter architecture that govern the yeast transcription factors Cbf1 and Met31/32, which bind independently, but collaboratively recruit the coactivator Met4. Results: A Cbf1 binding site was required upstream of a Met31/32 binding site for full reporter gene expression. Distance constraints on coactivator recruitment were more flexible than those for cooperatively binding transcription factors. Distances from 18 to 50 bp between binding sites support efficient recruitment of Met4, with only slight modulation by helical phasing. Intriguingly, we found that certain sequences located between the binding sites abolished gene expression. Conclusion: These results yield insight to the influence of both binding site architecture and local DNA flexibility on gene expression, and can be used to refine computational predictions of gene expression from promoter sequences. In addition, our approach can be applied to survey promoter architecture requirements for arbitrary combinations of transcription factor binding sites

Harvard University - DASH

Protein-coding gene promoters in Methanocaldococcus (Methanococcus) jannaschii

Author: Badger
Bailey
Baliga
Bartlett
Bell
Bell
Bensing
Best
Brenneis
Brown
Bult
Burden
Celesnik
Cui
Danner
Darcy
Enhu Li
Erill
Gary J. Olsen
Giometti
Giometti
Grunberg
Hain
Hannenhalli
Hanzelka
Hausner
Hausner
Hertz
Hofacker
Howland
Huerta
Iacono
Jian Zhang
Jones
Klein
Kosa
Lagrange
Langer
Li
Littlefield
Morris
Okuda
Ouhammouch
Ouhammouch
Palmer
Reiter
Reiter
Renfrow
Rhodius
Richard
Robert
Roulet
Schattner
Schneider
Schneider
Shimada
Shine
Shultzaberger
Shultzaberger
Shultzaberger
Shultzaberger
Slupska
Soppa
Soppa
Staden
Stormo
Stormo
Tabansky
Thomm
Tolstrup
Tsai
Tuerk
Vierke
Vilela
Wen
Wheeler
Wich
Wiley
Workman
Zhu
Zillig
Publication venue: Oxford University Press
Publication date: 01/01/2009
Field of study

Although Methanocaldococcus (Methanococcus) jannaschii was the first archaeon to have its genome sequenced, little is known about the promoters of its protein-coding genes. To expand our knowledge, we have experimentally identified 131 promoters for 107 protein-coding genes in this genome by mapping their transcription start sites. Compared to previously identified promoters, more than half of which are from genes for stable RNAs, the protein-coding gene promoters are qualitatively similar in overall sequence pattern, but statistically different at several positions due to greater variation among their sequences. Relative binding affinity for general transcription factors was measured for 12 of these promoters by competition electrophoretic mobility shift assays. These promoters bind the factors less tightly than do most tRNA gene promoters. When a position weight matrix (PWM) was constructed from the protein gene promoters, factor binding affinities correlated with corresponding promoter PWM scores. We show that the PWM based on our data more accurately predicts promoters in the genome and transcription start sites than could be done with the previously available data. We also introduce a PWM logo, which visually displays the implications of observing a given base at a position in a sequence

CiteSeerX

Crossref

PubMed Central

Discovery of Fur binding site clusters in Escherichia coli by information theory models

Author: Chen Zehua
Doan Bernard
Lewis Karen A.
Lyakhov Ilya G.
Schneider Thomas D.
Shultzaberger Ryan K.
Storz Gisela
Zheng Ming
Publication venue: Oxford University Press
Publication date: 01/01/2007
Field of study

Fur is a DNA binding protein that represses bacterial iron uptake systems. Eleven footprinted Escherichia coli Fur binding sites were used to create an initial information theory model of Fur binding, which was then refined by adding 13 experimentally confirmed sites. When the refined model was scanned across all available footprinted sequences, sequence walkers, which are visual depictions of predicted binding sites, frequently appeared in clusters that fit the footprints (∼83% coverage). This indicated that the model can accurately predict Fur binding. Within the clusters, individual walkers were separated from their neighbors by exactly 3 or 6 bases, consistent with models in which Fur dimers bind on different faces of the DNA helix. When the E. coli genome was scanned, we found 363 unique clusters, which includes all known Fur-repressed genes that are involved in iron metabolism. In contrast, only a few of the known Fur-activated genes have predicted Fur binding sites at their promoters. These observations suggest that Fur is either a direct repressor or an indirect activator. The Pseudomonas aeruginosa and Bacillus subtilis Fur models are highly similar to the E. coli Fur model, suggesting that the Fur–DNA recognition mechanism may be conserved for even distantly related bacteria

CiteSeerX

PubMed Central

Probing the Informational and Regulatory Plasticity of a Transcription Factor DNA–Binding Domain

Transcription factors have two functional constraints on their evolution: (1) their binding sites must have enough information to be distinguishable from all other sequences in the genome, and (2) they must bind these sites with an affinity that appropriately modulates the rate of transcription. Since both are determined by the biophysical properties of the DNA–binding domain, selection on one will ultimately affect the other. We were interested in understanding how plastic the informational and regulatory properties of a transcription factor are and how transcription factors evolve to balance these constraints. To study this, we developed an in vivo selection system in Escherichia coli to identify variants of the helix-turn-helix transcription factor MarA that bind different sets of binding sites with varying degrees of degeneracy. Unlike previous in vitro methods used to identify novel DNA binders and to probe the plasticity of the binding domain, our selections were done within the context of the initiation complex, selecting for both specific binding within the genome and for a physiologically significant strength of interaction to maintain function of the factor. Using MITOMI, quantitative PCR, and a binding site fitness assay, we characterized the binding, function, and fitness of some of these variants. We observed that a large range of binding preferences, information contents, and activities could be accessed with a few mutations, suggesting that transcriptional regulatory networks are highly adaptable and expandable

Infoscience - École polytechnique fédérale de Lausanne

CiteSeerX

Crossref

Directory of Open Access Journals

PubMed Central

FigShare

Correlation between binding rate constants and individual information of E. coli Fis binding sites

Author: Fisher Robert J.
Lyakhov Ilya G.
Roberts Lindsey R.
Schneider Thomas D.
Shultzaberger Ryan K.
Sidorov Igor A.
Stephen Andrew G.
Publication venue: Oxford University Press
Publication date: 01/01/2007
Field of study

Individual protein binding sites on DNA can be measured in bits of information. This information is related to the free energy of binding by the second law of thermodynamics, but binding kinetics appear to be inaccessible from sequence information since the relative contributions of the on- and off-rates to the binding constant, and hence the free energy, are unknown. However, the on-rate could be independent of the sequence since a protein is likely to bind once it is near a site. To test this, we used surface plasmon resonance and electromobility shift assays to determine the kinetics for binding of the Fis protein to a range of naturally occurring binding sites. We observed that the logarithm of the off-rate is indeed proportional to the individual information of the binding sites, as predicted. However, the on-rate is also related to the information, but to a lesser degree. We suggest that the on-rate is mostly determined by DNA bending, which in turn is determined by the sequence information. Finally, we observed a break in the binding curve around zero bits of information. The break is expected from information theory because it represents the coding demarcation between specific and nonspecific binding

CiteSeerX

PubMed Central

Recognition of prokaryotic promoters based on a novel variable-window Z-curve method

Author: Alberts
Askary
Bansal
Barrios
Barrios
Benson
Bland
Burden
Burnham
Estrem
Evans
Gama-Castro
Gao
Gao
Geladi
Gordon
Gruber
Gruber
Guo
Helmann
Helmann
Hook-Barnard
Hook-Barnard
Höskuldsson
Kai Song
Kvalheim
Kvalheim
Lin
Lindgren
Mann
McCracken
Paget
Paget
Perez-Rueda
Perez-Rueda
Rani
Rosipal
Rosipal
Rännar
Samal
Shultzaberger
Shultzaberger
Sierro
Sierro
Tsukahara
van Hijum
Wold
Wold
Wosten
Yang
Zhang
Zhang
Zhang
Zhang
Zhang
Zhang
Zhang
Publication venue: Oxford University Press
Publication date
Field of study

Transcription is the first step in gene expression, and it is the step at which most of the regulation of expression occurs. Although sequenced prokaryotic genomes provide a wealth of information, transcriptional regulatory networks are still poorly understood using the available genomic information, largely because accurate prediction of promoters is difficult. To improve promoter recognition performance, a novel variable-window Z-curve method is developed to extract general features of prokaryotic promoters. The features are used for further classification by the partial least squares technique. To verify the prediction performance, the proposed method is applied to predict promoter fragments of two representative prokaryotic model organisms (Escherichia coli and Bacillus subtilis). Depending on the feature extraction and selection power of the proposed method, the promoter prediction accuracies are improved markedly over most existing approaches: for E. coli, the accuracies are 96.05% (σ70 promoters, coding negative samples), 90.44% (σ70 promoters, non-coding negative samples), 92.13% (known sigma-factor promoters, coding negative samples), 92.50% (known sigma-factor promoters, non-coding negative samples), respectively; for B. subtilis, the accuracies are 95.83% (known sigma-factor promoters, coding negative samples) and 99.09% (known sigma-factor promoters, non-coding negative samples). Additionally, being a linear technique, the computational simplicity of the proposed method makes it easy to run in a matter of minutes on ordinary personal computers or even laptops. More importantly, there is no need to optimize parameters, so it is very practical for predicting other species promoters without any prior knowledge or prior information of the statistical properties of the samples

Crossref

PubMed Central

Regulatory network structure determines patterns of intermolecular epistasis

Author: Anderson
Barton
Bataillon
Bershtein
Breen
Camps
Charlesworth
de Visser
Eyre-Walker
Fontana
Fontana
Halligan
Jacquier
Johnson
Kinney
Lagator
Lee
Lutz
Lynch
Markiewicz
Metzger
Nagai
Orr
Otto
Pakula
Podgornaia
Poelwijk
Ptashne
Salgado
Sarkisyan
Shultzaberger
Soskine
Wagner
Wang
Weinreich
White
Whitlock
Yun
Publication venue: 'eLife Sciences Publications, Ltd'
Publication date: 01/01/2017
Field of study

Most phenotypes are determined by molecular systems composed of specifically interacting molecules. However, unlike for individual components, little is known about the distributions of mutational effects of molecular systems as a whole. We ask how the distribution of mutational effects of a transcriptional regulatory system differs from the distributions of its components, by first independently, and then simultaneously, mutating a transcription factor and the associated promoter it represses. We find that the system distribution exhibits increased phenotypic variation compared to individual component distributions - an effect arising from intermolecular epistasis between the transcription factor and its DNA-binding site. In large part, this epistasis can be qualitatively attributed to the structure of the transcriptional regulatory system and could therefore be a common feature in prokaryotes. Counter-intuitively, intermolecular epistasis can alleviate the constraints of individual components, thereby increasing phenotypic variation that selection could act on and facilitating adaptive evolution

University of Liverpool Repository

Crossref

IST Austria: PubRep (Institute of Science and Technology)

The University of Manchester - Institutional Repository

Anatomy of Escherichia coli σ(70) promoters

Author: Althaus
Aoyama
Aoyama
Ball
Barne
Benoff
Blattner
Bolshoy
Borowiec
Brodolin
Browning
Burgess
Busby
Campbell
Chen
Chen
Collado-Vides
Darst
deHaseth
deHaseth
Dombroski
Dubendorff
Eichenberger
Fenton
Galas
Goodrich
Gralla
Gross
Gross
GuhaThakurta
Hagerman
Halford
Harley
Hawley
Helmann
Hengen
Hengen
Hershberg
Hershberg
Hertz
Heyduk
Horwitz
Huerta
Karen A. Lewis
Kawano
Keilty
Kumar
Lee
Leroy
Lim
Lim
Lukashin
Lyakhov
Mandecki
Masse
Mathew
McClure
Miller
Mirny
Moyle
Mulligan
Murakami
O'Neill
Oliphant
Papp
Penotti
Peter
Pierce
Postle
Raibaud
Ringrose
Roberts
Robison
Rogan
Rogan
Roy
Rudd
Rudd
Ryan K. Shultzaberger
Salgado
Schneider
Schneider
Schneider
Schneider
Schneider
Schneider
Schneider
Schneider
Schneider
Schneider
Schneider
Schneider
Sclavi
Seeman
Semsey
Shannon
Shultzaberger
Shultzaberger
Siebenlist
Stefano
Stephens
Thomas D. Schneider
Travers
Travers
Tsolis
Ussery
Vassylyev
Wassarman
Weller
Wiese
Yin
Young
Young
Young
Zehua Chen
Zhi
Zhi
Publication venue: Oxford University Press
Publication date: 01/01/2006
Field of study

Information theory was used to build a promoter model that accounts for the −10, the −35 and the uncertainty of the gap between them on a common scale. Helical face assignment indicated that base −7, rather than −11, of the −10 may be flipping to initiate transcription. We found that the sequence conservation of σ(70) binding sites is 6.5 ± 0.1 bits. Some promoters lack a −35 region, but have a 6.7 ± 0.2 bit extended −10, almost the same information as the bipartite promoter. These results and similarities between the contacts in the extended −10 binding and the −35 suggest that the flexible bipartite σ factor evolved from a simpler polymerase. Binding predicted by the bipartite model is enriched around 35 bases upstream of the translational start. This distance is the smallest 5′ mRNA leader necessary for ribosome binding, suggesting that selective pressure minimizes transcript length. The promoter model was combined with models of the transcription factors Fur and Lrp to locate new promoters, to quantify promoter strengths, and to predict activation and repression. Finally, the DNA-bending proteins Fis, H-NS and IHF frequently have sites within one DNA persistence length from the −35, so bending allows distal activators to reach the polymerase

CiteSeerX

Crossref

PubMed Central

MODEST: a web-based design tool for oligonucleotide-mediated genome engineering and recombineering

Author: Annika I.N. Wallin
Baba
Bonde
Bubunenko
Cock
Dicarlo
Ellis
Gao
Gold
Harris H. Wang
Katashkina
Kono
Kosuri
Li
Lorenz
Mads T. Bonde
Mads V. Anderson
Michael S. Klausen
Morten O.A. Sommer
Mosberg
Neylon
Nyerges
Salis
SantaLucia
Sawitzke
Schurr
Sharan
Shine
Shultzaberger
Sullivan
Swingle
Van Pijkeren
Wang
Wang
Wang
Yamamoto
Publication venue: 'Oxford University Press (OUP)'
Publication date: 01/01/2014
Field of study

Recombineering and multiplex automated genome engineering (MAGE) offer the possibility to rapidly modify multiple genomic or plasmid sites at high efficiencies. This enables efficient creation of genetic variants including both single mutants with specifically targeted modifications as well as combinatorial cell libraries. Manual design of oligonucleotides for these approaches can be tedious, time-consuming, and may not be practical for larger projects targeting many genomic sites. At present, the change from a desired phenotype (e.g. altered expression of a specific protein) to a designed MAGE oligo, which confers the corresponding genetic change, is performed manually. To address these challenges, we have developed the MAGE Oligo Design Tool (MODEST). This web-based tool allows designing of MAGE oligos for (i) tuning translation rates by modifying the ribosomal binding site, (ii) generating translational gene knockouts and (iii) introducing other coding or non-coding mutations, including amino acid substitutions, insertions, deletions and point mutations. The tool automatically designs oligos based on desired genotypic or phenotypic changes defined by the user, which can be used for high efficiency recombineering and MAGE. MODEST is available for free and is open to all users at http://modest.biosustain.dtu.dk

Crossref

PubMed Central

Online Research Database In Technology