Search CORE

28 research outputs found

FOOTER: a web tool for finding mammalian DNA regulatory regions using phylogenetic footprinting

Author: Benos Panayiotis V.
Corcoran David L.
Feingold Eleanor
Publication venue: Oxford University Press
Publication date: 01/01/2005
Field of study

FOOTER is a newly developed algorithm that analyzes homologous mammalian promoter sequences in order to identify transcriptional DNA regulatory ‘signals’. FOOTER uses prior knowledge about the binding site preferences of the transcription factors (TFs) in the form of position-specific scoring matrices (PSSMs). The PSSM models are generated from known mammalian binding sites from the TRANSFAC database. In a test set of 72 confirmed binding sites (most of them not present in TRANSFAC) of 19 TFs, it exhibited 83% sensitivity and 72% specificity. FOOTER is accessible over the web at

CiteSeerX

Crossref

PubMed Central

STAMP: a web tool for exploring DNA-binding motif similarities

Author: Benos Panayiotis V.
Mahony Shaun
Publication venue: Oxford University Press
Publication date: 01/01/2007
Field of study

STAMP is a newly developed web server that is designed to support the study of DNA-binding motifs. STAMP may be used to query motifs against databases of known motifs; the software aligns input motifs against the chosen database (or alternatively against a user-provided dataset), and lists of the highest-scoring matches are returned. Such similarity-search functionality is expected to facilitate the identification of transcription factors that potentially interact with newly discovered motifs. STAMP also automatically builds multiple alignments, familial binding profiles and similarity trees when more than one motif is inputted. These functions are expected to enable evolutionary studies on sets of related motifs and fixed-order regulatory modules, as well as illustrating similarities and redundancies within the input motif collection. STAMP is a highly flexible alignment platform, allowing users to ‘mix-and-match’ between various implemented comparison metrics, alignment methods (local or global, gapped or ungapped), multiple alignment strategies and tree-building methods. Motifs may be inputted as frequency matrices (in many of the commonly used formats), consensus sequences, or alignments of known binding sites. STAMP also directly accepts the output files from 12 supported motif-finders, enabling quick interpretation of motif-discovery analyses. STAMP is available at http://www.benoslab.pitt.edu/stam

CiteSeerX

Crossref

PubMed Central

Simplified Method to Predict Mutual Interactions of Human Transcription Factors Based on Their Primary Structure

Author: A Ben Hur
A Ceol
A Ramani
A Remenyi
A van Dijk
A Varshavsky
B Aranda
B Breitkreutz
B Lemon
Boris Jankovic
C Camacho
C Chen
D Caffrey
D GuhaThakurta
E Wingender
F Browne
GJ McLachlan
H Almuallim
I Donaldson
I Guyon
J Bock
J Capra
J Espadaler
J Hoskins
J Shen
J Wang
JJ Chung
JM Vaquerizas
Joaquín Dopazo
L Matthews
L Yu
M Guharoy
M Guharoy
M Kato
M McDowall
N Banerjee
P Aloy
P Aloy
R Hoffmann
R Jansen
S Hannenhalli
S Kawashima
S Lee
S Lo
S Orchard
S Pitre
S Teichmann
Sebastian Schmeier
T Dandekar
T Lee
T Ravasi
U Ogmen
V Matys
VB Bajić
Vladimir B. Bajic
W Kim
W Valdar
X Chen
X Li
X Wu
X Yu
X Yu
Y Guo
Z Hu
Z Zhu
Publication venue: Public Library of Science
Publication date: 01/01/2011
Field of study

Background: Physical interactions between transcription factors (TFs) are necessary for forming regulatory protein complexes and thus play a crucial role in gene regulation. Currently, knowledge about the mechanisms of these TF interactions is incomplete and the number of known TF interactions is limited. Computational prediction of such interactions can help identify potential new TF interactions as well as contribute to better understanding the complex machinery involved in gene regulation. Methodology: We propose here such a method for the prediction of TF interactions. The method uses only the primary sequence information of the interacting TFs, resulting in a much greater simplicity of the prediction algorithm. Through an advanced feature selection process, we determined a subset of 97 model features that constitute the optimized model in the subset we considered. The model, based on quadratic discriminant analysis, achieves a prediction accuracy of 85.39 % on a blind set of interactions. This result is achieved despite the selection for the negative data set of only those TF from the same type of proteins, i.e. TFs that function in the same cellular compartment (nucleus) and in the same type of molecular process (transcription initiation). Such selection poses significant challenges for developing models with high specificity, but at the same time better reflects real-world problems. Conclusions: The performance of our predictor compares well to those of much more complex approaches for predicting TF and general protein-protein interactions, particularly when taking the reduced complexity of model utilisation into account

CiteSeerX

Public Library of Science (PLOS)

Crossref

PubMed Central

Statistical detection of cooperative transcription factors with similarity adjustment

Author: Aerts
Arnone
Berman
Berman
Boeva
Brown
Chargaff
Clyde
Crooks
Crowley
De Bleser
Frith
Frith
Frith
Frith
GuhaThakurta
GuhaThakurta
H. Klein
Hannenhalli
Harbison
Klingenhoff
Krivan
Lifanov
M. Vingron
Matys
Matys
Pape
Pilpel
Sosinsky
Stormo
U. J. Pape
Wagner
Wasserman
Yoshida
Yuh
Publication venue: Oxford University Press
Publication date: 01/01/2009
Field of study

Motivation: Statistical assessment of cis-regulatory modules (CRMs) is a crucial task in computational biology. Usually, one concludes from exceptional co-occurrences of DNA motifs that the corresponding transcription factors (TFs) are cooperative. However, similar DNA motifs tend to co-occur in random sequences due to high probability of overlapping occurrences. Therefore, it is important to consider similarity of DNA motifs in the statistical assessment

Crossref

PubMed Central

MPG.PuRe

IMAGE: A New Tool for the Prediction of Transcription Factor Binding Sites

Author: Casilli R.
Marongiu A.
Melchionna S.
Palazzari P.
Paparcone R.
Rosato V.
Publication venue: Libertas Academica
Publication date: 01/01/2008
Field of study

IMAGE is an application tool, based on the vector quantization method, aiding the discovery of nucleotidic sequences corresponding to Transcription Factor binding sites. Starting from the knowledge of regulation regions of a number of co-expressed genes, the software is able to predict the occurrence of specific motifs of different lengths (starting from 6 base pairs) with a defined number of punctual mutations

Crossref

Directory of Open Access Journals

PubMed Central

Open Access Repository

Correlation between binding rate constants and individual information of E. coli Fis binding sites

Author: Fisher Robert J.
Lyakhov Ilya G.
Roberts Lindsey R.
Schneider Thomas D.
Shultzaberger Ryan K.
Sidorov Igor A.
Stephen Andrew G.
Publication venue: Oxford University Press
Publication date: 01/01/2007
Field of study

Individual protein binding sites on DNA can be measured in bits of information. This information is related to the free energy of binding by the second law of thermodynamics, but binding kinetics appear to be inaccessible from sequence information since the relative contributions of the on- and off-rates to the binding constant, and hence the free energy, are unknown. However, the on-rate could be independent of the sequence since a protein is likely to bind once it is near a site. To test this, we used surface plasmon resonance and electromobility shift assays to determine the kinetics for binding of the Fis protein to a range of naturally occurring binding sites. We observed that the logarithm of the off-rate is indeed proportional to the individual information of the binding sites, as predicted. However, the on-rate is also related to the information, but to a lesser degree. We suggest that the on-rate is mostly determined by DNA bending, which in turn is determined by the sequence information. Finally, we observed a break in the binding curve around zero bits of information. The break is expected from information theory because it represents the coding demarcation between specific and nonspecific binding

CiteSeerX

PubMed Central

Practical Strategies for Discovering Regulatory DNA Sequence Motifs

Author: Fraenkel Ernest
MacIsaac Kenzie D
Publication venue: Public Library of Science
Publication date: 01/04/2006
Field of study

Crossref

Directory of Open Access Journals

PubMed Central

Inferring transcription factor complexes from ChIP-seq data

Author: Bailey Timothy L.
Frith Martin C.
Johnson James
Whitington Tom
Publication venue: Oxford University Press
Publication date: 01/08/2011
Field of study

Chromatin immunoprecipitation followed by high-throughput sequencing (ChIP-seq) allows researchers to determine the genome-wide binding locations of individual transcription factors (TFs) at high resolution. This information can be interrogated to study various aspects of TF behaviour, including the mechanisms that control TF binding. Physical interaction between TFs comprises one important aspect of TF binding in eukaryotes, mediating tissue-specific gene expression. We have developed an algorithm, spaced motif analysis (SpaMo), which is able to infer physical interactions between the given TF and TFs bound at neighbouring sites at the DNA interface. The algorithm predicts TF interactions in half of the ChIP-seq data sets we test, with the majority of these predictions supported by direct evidence from the literature or evidence of homodimerization. High resolution motif spacing information obtained by this method can facilitate an improved understanding of individual TF complex structures. SpaMo can assist researchers in extracting maximum information relating to binding mechanisms from their TF ChIP-seq data. SpaMo is available for download and interactive use as part of the MEME Suite (http://meme.nbcr.net)

PubMed Central

University of Queensland eSpace

Determining Physical Constraints in Transcriptional Initiation Complexes Using DNA Sequence Analysis

Author: Chiang Derek Y.
Eisen Michael B.
Moses Alan M.
Shultzaberger Ryan K.
Publication venue: Public Library of Science
Publication date: 01/01/2007
Field of study

Eukaryotic gene expression is often under the control of cooperatively acting transcription factors whose binding is limited by structural constraints. By determining these structural constraints, we can understand the “rules” that define functional cooperativity. Conversely, by understanding the rules of binding, we can infer structural characteristics. We have developed an information theory based method for approximating the physical limitations of cooperative interactions by comparing sequence analysis to microarray expression data. When applied to the coordinated binding of the sulfur amino acid regulatory protein Met4 by Cbf1 and Met31, we were able to create a combinatorial model that can correctly identify Met4 regulated genes. Interestingly, we found that the major determinant of Met4 regulation was the sum of the strength of the Cbf1 and Met31 binding sites and that the energetic costs associated with spacing appeared to be minimal

CiteSeerX

Directory of Open Access Journals

PubMed Central

eScholarship - University of California

UNT Digital Library

An Overview of the Importance of Conformational Flexibility in Gene Regulation by the Transcription Factors

Author: Khan Shagufta H.
Kumar Raj
Publication venue: Hindawi Publishing Corporation
Publication date
Field of study

A number of proteins with intrinsically disordered (ID) regions/domains are reported to be found disproportionately higher in transcription factors. Available evidences suggest that presence of ID region/domain within a transcription factor plays an important role in its biological functions. These ID sequences provide large flexible surfaces that can allow them to make more efficient physical and functional interactions with their target partners. Since transcription factors regulate expression of target genes by interacting with specific coregulatory proteins, these ID regions/domains can be used as a platform for such large macromolecular interactions, and may represent a mechanism for regulation of cellular processes. The precise structural basis for the function of these ID regions/domains of the transcription factors remains to be determined. In the recent years there has been growing evidence suggesting that an induced fit-like process leads to imposition of folded functional structure in these ID domains on which large multiprotein complexes are built. These multiprotein complexes may eventually dictate the final outcome of the gene regulation by the transcription factors

Crossref

PubMed Central