Search CORE

252 research outputs found

Hierarchy and Feedback in the Evolution of the E. coli Transcription Network

Author: Atkinson
B. Bassetti
Babu
Becskei
Charlebois
Conant
Dekel
Gough
H. Isambert
Harbison
Lagomarsino
Lee
M. Cosentino Lagomarsino
Ma
Ma
Madan Babu
Mazurie
Milo
P. Jona
P l
Rosenfeld
Salgado
Schneider
Shen-Orr
Teichmann
Thieffry
Thomas
Wall
Warren
Publication venue: 'Proceedings of the National Academy of Sciences'
Publication date: 01/01/2007
Field of study

The E.coli transcription network has an essentially feedforward structure, with, however, abundant feedback at the level of self-regulations. Here, we investigate how these properties emerged during evolution. An assessment of the role of gene duplication based on protein domain architecture shows that (i) transcriptional autoregulators have mostly arisen through duplication, while (ii) the expected feedback loops stemming from their initial cross-regulation are strongly selected against. This requires a divergent coevolution of the transcription factor DNA-binding sites and their respective DNA cis-regulatory regions. Moreover, we find that the network tends to grow by expansion of the existing hierarchical layers of computation, rather than by addition of new layers. We also argue that rewiring of regulatory links due to mutation/selection of novel transcription factor/DNA binding interactions appears not to significantly affect the network global hierarchy, and that horizontally transferred genes are mainly added at the bottom, as new target nodes. These findings highlight the important evolutionary roles of both duplication and selective deletion of crosstalks between autoregulators in the emergence of the hierarchical transcription network of E.coli.Comment: to appear in PNA

arXiv.org e-Print Archive

Crossref

AIR Universita degli studi di Milano

PubMed Central

Validating module network learning algorithms using simulated data

Author: A Battle
A Butte
AA Petti
AJ Butte
Anagha Joshi
AP Gasch
CE Shannon
CT Harbison
D Pe'er
D Pe'er
E Segal
E Segal
E Segal
Eric Bonnet
HW Ma
J Kasturi
J Sinkkonen
K Basso
K Lemmens
KA Heller
Kathleen Marchal
Koenraad Van Leemput
LH Hartwell
M Ashburner
MA Beer
Martin Kuiper
MJL de Hoon
N Friedman
N Friedman
NM Luscombe
Piet van Remortel
S Maere
Steven Maere
T Ideker
T Van den Bulcke
T Van den Bulcke
Tim Van den Bulcke
Tom Michoel
X Xu
Y Garten
Yvan Saeys
Yves Van de Peer
Z Bar-Joseph
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2007
Field of study

In recent years, several authors have used probabilistic graphical models to learn expression modules and their regulatory programs from gene expression data. Here, we demonstrate the use of the synthetic data generator SynTReN for the purpose of testing and comparing module network learning algorithms. We introduce a software package for learning module networks, called LeMoNe, which incorporates a novel strategy for learning regulatory programs. Novelties include the use of a bottom-up Bayesian hierarchical clustering to construct the regulatory programs, and the use of a conditional entropy measure to assign regulators to the regulation program nodes. Using SynTReN data, we test the performance of LeMoNe in a completely controlled situation and assess the effect of the methodological changes we made with respect to an existing software package, namely Genomica. Additionally, we assess the effect of various parameters, such as the size of the data set and the amount of noise, on the inference performance. Overall, application of Genomica and LeMoNe to simulated data sets gave comparable results. However, LeMoNe offers some advantages, one of them being that the learning process is considerably faster for larger data sets. Additionally, we show that the location of the regulators in the LeMoNe regulation programs and their conditional entropy may be used to prioritize regulators for functional validation, and that the combination of the bottom-up clustering strategy with the conditional entropy-based assignment of regulators improves the handling of missing or hidden regulators.Comment: 13 pages, 6 figures + 2 pages, 2 figures supplementary informatio

arXiv.org e-Print Archive

Crossref

Springer - Publisher Connector

Ghent University Academic Bibliography

PubMed Central

Edinburgh Research Explorer

Archivsystem Ask23

HAL-CEA

Discovery of protein–DNA interactions by penalized multivariate regression

Author: Beer
Ben-Yehuda
Bonner
Buck
Gupta
Harbison
Hinnebusch
Hughes
L. Zamdborg
Lindquist
Liu
Macisaac
Matthews
Menacho-Marquez
P. Ma
Pavesi
Roven
Troyanskaya
Wade
Warner
Wei
Whitlock
Wu
Publication venue: Oxford University Press
Publication date
Field of study

Discovering which regulatory proteins, especially transcription factors (TFs), are active under certain experimental conditions and identifying the corresponding binding motifs is essential for understanding the regulatory circuits that control cellular programs. The experimental methods used for this purpose are laborious. Computational methods have been proven extremely effective in identifying TF-binding motifs (TFBMs). In this article, we propose a novel computational method called MotifExpress for discovering active TFBMs. Unlike existing methods, which either use only DNA sequence information or integrate sequence information with a single-sample measurement of gene expression, MotifExpress integrates DNA sequence information with gene expression measured in multiple samples. By selecting TFBMs that are significantly associated with gene expression, we can identify active TFBMs under specific experimental conditions and thus provide clues for the construction of regulatory networks. Compared with existing methods, MotifExpress substantially reduces the number of spurious results. Statistically, MotifExpress uses a penalized multivariate regression approach with a composite absolute penalty, which is highly stable and can effectively find the globally optimal set of active motifs. We demonstrate the excellent performance of MotifExpress by applying it to synthetic data and real examples of Saccharomyces cerevisiae. MotifExpress is available at http://www.stat.illinois.edu/~pingma/MotifExpress.htm

Crossref

PubMed Central

Information content based model for the topological properties of the gene regulatory network of Escherichia coli

Author: Albert
Alberts
Almirantis
Avery
Ayşe Erzan
Babu
Balcan
Balcan
Balcan
Banzhaf
Barabasi
Barabasi
Benos
Berg
Bergmann
Berkin Malkoç
Bilu
Bollobás
Browning
Buldyrev
Colizza
Colizza
Dawkins
Dawkins
Dobrin
Dodd
Dorogovtsev
Duygu Balcan
Erdös
Erdös
Gama-Castro
Gerland
Gershenzon
Guelzim
Harbison
Jeong
Kashtan
Kauffman
Kim
Koralov
Kugiumtzis
Li
Lynch
Ma
Matsumoto
Milo
Milo
Mungan
Münch
Okuda
O’Flanagan
Pachkov
Reil
Rudd
Salgado
Salgado
Samal
Sengun
Sengupta
Shannon
Shearwin
Sneppen
Spirin
Stormo
Teixeira
van Nimwegen
van Noort
Vazquez
Wagner
Warren
Watson
Wernicke
Zhou
Publication venue: 'Elsevier BV'
Publication date: 29/12/2009
Field of study

Gene regulatory networks (GRN) are being studied with increasingly precise quantitative tools and can provide a testing ground for ideas regarding the emergence and evolution of complex biological networks. We analyze the global statistical properties of the transcriptional regulatory network of the prokaryote Escherichia coli, identifying each operon with a node of the network. We propose a null model for this network using the content-based approach applied earlier to the eukaryote Saccharomyces cerevisiae. (Balcan et al., 2007) Random sequences that represent promoter regions and binding sequences are associated with the nodes. The length distributions of these sequences are extracted from the relevant databases. The network is constructed by testing for the occurrence of binding sequences within the promoter regions. The ensemble of emergent networks yields an exponentially decaying in-degree distribution and a putative power law dependence for the out-degree distribution with a flat tail, in agreement with the data. The clustering coefficient, degree-degree correlation, rich club coefficient and k-core visualization all agree qualitatively with the empirical network to an extent not yet achieved by any other computational model, to our knowledge. The significant statistical differences can point the way to further research into non-adaptive and adaptive processes in the evolution of the E. coli GRN.Comment: 58 pages, 3 tables, 22 figures. In press, Journal of Theoretical Biology (2009)

arXiv.org e-Print Archive

Crossref

Algorithm for prediction of tumour suppressor p53 affinity for binding sites in DNA

Author: Alan R. Fersht
Ang
Benos
Berg
Bulyk
Cawley
Dmitry B. Veprintsev
El-Deiry
Hainaut
Hallikas
Harbison
Joerger
Kaneshiro
Kitayner
Lim
Linnell
Liu
Liu
Liu
Ma
Man
Menendez
Nikolova
Qian
Schneider
Stormo
Stormo
Tanaka
Tomso
Udalova
Veprintsev
Vogelstein
Vousden
Wei
Weinberg
Weinberg
Wu
Publication venue: Oxford University Press
Publication date: 01/01/2008
Field of study

The tumour suppressor p53 is a transcription factor that binds DNA in the vicinity of the genes it controls. The affinity of p53 for specific binding sites relative to other DNA sequences is an inherent driving force for specificity, all other things being equal. We measured the binding affinities of systematically mutated consensus p53 DNA-binding sequences using automated fluorescence anisotropy titrations. Based on measurements of the effects of every possible single base-pair substitution of a consensus sequence, we defined the DNA sequence with the highest affinity for full-length p53 and quantified the effects of deviation from it on the strength of protein–DNA interaction. The contributions of individual nucleotides were to a first approximation independent and additive. But, in some cases we observed significant deviations from additivity. Based on affinity data, we constructed a binding predictor that mirrored the existing p53 consensus sequence definition. We used it to search for high-affinity binding sites in the genome and to predict the effects of single-nucleotide polymorphisms in these sites. Although there was some correlation between the Kd and biological function, the spread of the Kds by itself was not sufficient to explain the activation of different pathways by changes in p53 concentration alone

CiteSeerX

Crossref

PubMed Central

The impact of the UK 'Act FAST' stroke awareness campaign: Content analysis of patients, witness and primary care clinicians' perceptions

Author: C Hodgson
Falko F Sniehotta
Gary A Ford
GJY Peters
Helen Rodgers
J Harbison
J Harbison
J Lecouturier
J Lecouturier
JE Bray
JE Bray
JE Mackintosh
Joan E Mackintosh
K Krippendorff
K Witte
L Mellon
MA Wakefield
Madeleine J Murtagh
Martin P Eccles
Martin White
National Audit Office
RAC Ruiter
Richard G Thomson
SM Davis
SM Noar
SP Jones
SP Jones
Stephan U Dombrowski
SU Dombrowski
TG Robinson
Vera Araujo-Soares
Y Teuschl
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2013
Field of study

Background: The English mass media campaign ‘Act FAST' aimed to raise stroke awareness and the need to call emergency services at the onset of suspected stroke. We examined the perceived impact and views of the campaign in target populations to identify potential ways to optimise mass-media interventions for stroke. Methods: Analysis of semi-structured interviews conducted as part of two qualitative studies, which examined factors influencing patient/witness response to acute stroke symptoms (n = 19 stroke patients, n = 26 stroke witnesses) and perceptions about raising stroke awareness in primary care (n = 30 clinicians). Both studies included questions about the ‘Act FAST' campaign. Interviews were content analysed to determine campaign awareness, perceived impact on decisions and response to stroke, and views of the campaign. Results: Most participants were aware of the Act FAST campaign. Some patients and witnesses reported that the campaign impacted upon their stroke recognition and response, but the majority reported no impact. Clinicians often perceived campaign success in raising stroke awareness, but few thought it would change response behaviours. Some patients and witnesses, and most primary care clinicians expressed positive views towards the campaign. Some more critical participant comments included perceptions of dramatic, irrelevant, and potentially confusing content, such as a prominent ‘fire in the brain' analogy. Conclusions: Act FAST has had some perceived impact on stroke recognition and response in some stroke patients and witnesses, but the majority reported no campaign impact. Primary care clinicians were positive about the campaign, and believed it had impacted on stroke awareness and recognition but doubted impact on response behaviour. Potential avenues for optimising and complementing mass media campaigns such as ‘Act FAST' were identified

Crossref

Stirling Online Research Repository (RIOXX)

Springer - Publisher Connector

PubMed Central

Stirling Online Research Repository

Explore Bristol Research

Leicester Research Archive

Coordination logic of the sensing machinery in the transcriptional regulatory network of Escherichia coli

Author: Agustino Martínez-Antonio
Alon
Anantharaman
Aravind
Balaji
Balazsi
Balazsi
Bar-Joseph
Browning
Collado-Vides
Dobrin
Ghosh
Gutierrez-Rios
Harbison
Heladia Salgado
Hermsen
Ihmels
Janga
Julio Collado-Vides
Lagomarsino
Luijsterburg
Luscombe
Ma
Ma
Madan Babu
Mangan
Mangan
Mangan
Martinez-Antonio
Martinez-Antonio
McAdams
Menchaca-Mendez
Milo
Perez-Rueda
Resendis-Antonio
Salgado
Sarath Chandra Janga
Segal
Shen-Orr
Thattai
Travers
Wall
Wang
Wolf
Yu
Zaslaver
Publication venue: Oxford University Press
Publication date: 01/10/2007
Field of study

The active and inactive state of transcription factors in growing cells is usually directed by allosteric physicochemical signals or metabolites, which are in turn either produced in the cell or obtained from the environment by the activity of the products of effector genes. To understand the regulatory dynamics and to improve our knowledge about how transcription factors (TFs) respond to endogenous and exogenous signals in the bacterial model, Escherichia coli, we previously proposed to classify TFs into external, internal and hybrid sensing classes depending on the source of their allosteric or equivalent metabolite. Here we analyze how a cell uses its topological structures in the context of sensing machinery and show that, while feed forward loops (FFLs) tightly integrate internal and external sensing TFs connecting TFs from different layers of the hierarchical transcriptional regulatory network (TRN), bifan motifs frequently connect TFs belonging to the same sensing class and could act as a bridge between TFs originating from the same level in the hierarchy. We observe that modules identified in the regulatory network of E. coli are heterogeneous in sensing context with a clear combination of internal and external sensing categories depending on the physiological role played by the module. We also note that propensity of two-component response regulators increases at promoters, as the number of TFs regulating a target operon increases. Finally we show that evolutionary families of TFs do not show a tendency to preserve their sensing abilities. Our results provide a detailed panorama of the topological structures of E. coli TRN and the way TFs they compose off, sense their surroundings by coordinating responses

Crossref

IUPUIScholarWorks

PubMed Central

Wide-Scale Analysis of Human Functional Transcription Factor Binding Reveals a Strong Bias towards the Transcription Start Site

Author: A Ambesi-Impiombato
A Blais
A Eto
A Subramanian
AE Kel
AG Clark
AL Lam
AM McGuire
Anat Reiner
Assif Yitzhaky
B Ren
C Kimura-Yoshida
C Plessy
C Yang
CT Harbison
D Pfeifer
D Wang
DB Allison
E Emberly
E Segal
Eytan Domany
FP Roth
GC Pipes
GC Yuan
GQ Yao
GZ Hertz
H Li
H Lodish
J Zheng
JD Hughes
JL DeRisi
JQ Ling
K Frech
K Quandt
KD MacIsaac
L Amir-Zilberstein
L Elnitski
L Marino-Ramirez
L McCue
M Ashburner
M Kellis
M Milyavsky
MA Nobrega
Mark Koudritsky
MC Frith
ML Howard
ML Whitfield
N Rajewsky
Or Zuk
P Carninci
P Carninci
P Cliften
PM Haverty
PR Buckland
R Elkon
R Liu
R Sharan
Ran Brosh
S Aerts
S Rashi-Elkeles
S Tavazoie
SJ Cooper
SJ Ho Sui
Sui Huang
U Gerland
Varda Rotter
WW Wasserman
X Xie
Y Barash
Y Benjamini
Y Benjamini
Y Tabach
Yossi Buganim
Yuval Tabach
Z Wang
Publication venue: 'Public Library of Science (PLoS)'
Publication date: 01/01/2007
Field of study

We introduce a novel method to screen the promoters of a set of genes with shared biological function, against a precompiled library of motifs, and find those motifs which are statistically over-represented in the gene set. The gene sets were obtained from the functional Gene Ontology (GO) classification; for each set and motif we optimized the sequence similarity score threshold, independently for every location window (measured with respect to the TSS), taking into account the location dependent nucleotide heterogeneity along the promoters of the target genes. We performed a high throughput analysis, searching the promoters (from 200bp downstream to 1000bp upstream the TSS), of more than 8000 human and 23,000 mouse genes, for 134 functional Gene Ontology classes and for 412 known DNA motifs. When combined with binding site and location conservation between human and mouse, the method identifies with high probability functional binding sites that regulate groups of biologically related genes. We found many location-sensitive functional binding events and showed that they clustered close to the TSS. Our method and findings were put to several experimental tests. By allowing a "flexible" threshold and combining our functional class and location specific search method with conservation between human and mouse, we are able to identify reliably functional TF binding sites. This is an essential step towards constructing regulatory networks and elucidating the design principles that govern transcriptional regulation of expression. The promoter region proximal to the TSS appears to be of central importance for regulation of transcription in human and mouse, just as it is in bacteria and yeast.Comment: 31 pages, including Supplementary Information and figure

arXiv.org e-Print Archive

CiteSeerX

Public Library of Science (PLOS)

Crossref

Directory of Open Access Journals

PubMed Central

Dissecting complex transcriptional responses using pathway-level scores based on prior information

Author: A Subramanian
Andre Boorsma
BC Foat
BC Foat
CT Harbison
DH Nguyen
E Segal
EM Conlon
F Gao
GD Stormo
Harmen J Bussemaker
HJ Bussemaker
J van Helden
JC Liao
Lucas D Ward
M Ashburner
M Middendorf
MA Beer
MB Eisen
N Friedman
P Khatri
PT Spellman
R Lascaris
S Grossmann
S Tavazoie
SY Kim
TR Hughes
VK Mootha
Publication venue: BioMed Central
Publication date: 01/01/2007
Field of study

Abstract Background The genomewide pattern of changes in mRNA expression measured using DNA microarrays is typically a complex superposition of the response of multiple regulatory pathways to changes in the environment of the cells. The use of prior information, either about the function of the protein encoded by each gene, or about the physical interactions between regulatory factors and the sequences controlling its expression, has emerged as a powerful approach for dissecting complex transcriptional responses. Results We review two different approaches for combining the noisy expression levels of multiple individual genes into robust pathway-level differential expression scores. The first is based on a comparison between the distribution of expression levels of genes within a predefined gene set and those of all other genes in the genome. The second starts from an estimate of the strength of genomewide regulatory network connectivities based on sequence information or direct measurements of protein-DNA interactions, and uses regression analysis to estimate the activity of gene regulatory pathways. The statistical methods used are explained in detail. Conclusion By avoiding the thresholding of individual genes, pathway-level analysis of differential expression based on prior information can be considerably more sensitive to subtle changes in gene expression than gene-level analysis. The methods are technically straightforward and yield results that are easily interpretable, both biologically and statistically.</p

Crossref

Springer - Publisher Connector

Columbia University Academic Commons

Directory of Open Access Journals

PubMed Central

Mapping Genetically Compensatory Pathways from Synthetic Lethal Interactions in Yeast

Author: A Tong
A Tong
Aaron M. Tarone
AP Davierwala
BC Osmond
C Boone
C Harbison
C Stark
C Wiggins
EA Winzeler
G Giaever
G Karp
G Lesage
I Ulitsky
M Ashburner
M Kanehisa
M Schuldiner
P Shannon
P Uetz
P Ye
R Kelley
R Milo
R Nash
S Munro
Suzannah Rutherford
T Ito
T Yoko-o
Wenyuan Li
Xiaotu Ma
Publication venue: Public Library of Science
Publication date: 01/01/2008
Field of study

Background: Synthetic lethal genetic interaction analysis has been successfully applied to predicting the functions of genes and their pathway identities. In the context of synthetic lethal interaction data alone, the global similarity of synthetic lethal interaction patterns between two genes is used to predict gene function. With physical interaction data, such as proteinprotein interactions, the enrichment of physical interactions within subsets of genes and the enrichment of synthetic lethal interactions between those subsets of genes are used as an indication of compensatory pathways. Result: In this paper, we propose a method of mapping genetically compensatory pathways from synthetic lethal interactions. Our method is designed to discover pairs of gene-sets in which synthetic lethal interactions are depleted among the genes in an individual set and where such gene-set pairs are connected by many synthetic lethal interactions. By its nature, our method could select compensatory pathway pairs that buffer the deleterious effect of the failure of either one, without the need of physical interaction data. By focusing on compensatory pathway pairs where genes in each individual pathway have a highly homogenous cellular function, we show that many cellular functions have genetically compensatory properties. Conclusion: We conclude that synthetic lethal interaction data are a powerful source to map genetically compensatory pathways, especially in systems lacking physical interaction information, and that the cellular function network contain

CiteSeerX

Crossref

Directory of Open Access Journals

PubMed Central