Search CORE

3,632 research outputs found

Defining the Plasticity of Transcription Factor Binding Sites by Deconstructing DNA Consensus Sequences: The PhoP-Binding Sites among Gamma/Enterobacteria

Author: A Aguirre
A Hochschild
A Kato
A Manson McGuire
A Martinez-Antonio
AG Blanco
AH Ko
AL Halpern
AM Moses
AM Moses
AP Gasch
B Anand
B Everitt
C Mouslim
C Mouslim
D Greene
D Knuth
D Shin
DF Browning
E Alm
E Bauer
E Benitez-Bellon
EA Groisman
EA Groisman
EA Groisman
Eduardo A. Groisman
F Depardieu
F Herrera
GD Stormo
GJ Klir
GK Smyth
GZ Hertz
H Li
H O'Geen
H Ochman
H Salgado
H Salgado
Henry Huang
HR Berenji
I Holmes
I Zwir
I Zwir
Igor Zwir
J Gertz
JA Hering
JC Bezdek
JC Perez
JC Perez
JD Hughes
JT Wade
K Deb
K Hollands
L McCue
L Ni
M Sugeno
M Thomas-Chollier
M Tompa
MB Eisen
MD Snavely
N Rajewsky
O Cordon
Oscar Harari
P Hong
P Monsieurs
QX Liu
R Janky
R Kohavi
R Krishnapuram
R Nadon
S Lejona
S Mahony
S Minagawa
S Roy
S Tavazoie
SL Pond
Sun-Yang Park
T-P Hong
TL Bailey
TL Bailey
TM Mitchell
Wyeth W. Wasserman
Y Barash
Y Benjamini
Y Setty
Publication venue: Public Library of Science
Publication date: 01/01/2010
Field of study

Transcriptional regulators recognize specific DNA sequences. Because these sequences are embedded in the background of genomic DNA, it is hard to identify the key cis-regulatory elements that determine disparate patterns of gene expression. The detection of the intra- and inter-species differences among these sequences is crucial for understanding the molecular basis of both differential gene expression and evolution. Here, we address this problem by investigating the target promoters controlled by the DNA-binding PhoP protein, which governs virulence and Mg2+ homeostasis in several bacterial species. PhoP is particularly interesting; it is highly conserved in different gamma/enterobacteria, regulating not only ancestral genes but also governing the expression of dozens of horizontally acquired genes that differ from species to species. Our approach consists of decomposing the DNA binding site sequences for a given regulator into families of motifs (i.e., termed submotifs) using a machine learning method inspired by the “Divide & Conquer” strategy. By partitioning a motif into sub-patterns, computational advantages for classification were produced, resulting in the discovery of new members of a regulon, and alleviating the problem of distinguishing functional sites in chromatin immunoprecipitation and DNA microarray genome-wide analysis. Moreover, we found that certain partitions were useful in revealing biological properties of binding site sequences, including modular gains and losses of PhoP binding sites through evolutionary turnover events, as well as conservation in distant species. The high conservation of PhoP submotifs within gamma/enterobacteria, as well as the regulatory protein that recognizes them, suggests that the major cause of divergence between related species is not due to the binding sites, as was previously suggested for other regulators. Instead, the divergence may be attributed to the fast evolution of orthologous target genes and/or the promoter architectures resulting from the interaction of those binding sites with the RNA polymerase

Public Library of Science (PLOS)

Crossref

LAReferencia - Red Federada de Repositorios Institucionales de Publicaciones Científicas Latinoamericanas

Directory of Open Access Journals

PubMed Central

Repositorio Institucional Universidad de Granada

Digital Commons@Becker

Fusion of Domain Knowledge for Dynamic Learning in Transcriptional Networks

Author: Harari Óscar
Romero Zaliz Rocío
Rubio Escudero Cristina
Zwir Igor
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2006
Field of study

A critical challenge of the postgenomic era is to understand how genes are differentially regulated even when they belong to a given network. Because the fundamental mechanism controlling gene expression operates at the level of transcription initiation, computational techniques have been devel oped that identify cis-regulatory features and map such features into differential expression patterns. The fact that such co-regulated genes may be differentially regulated suggests that subtle differences in the shared cis-acting regulatory elements are likely significant. Thus, we carry out an exhaustive description of cis-acting regulatory features including the orientation, location and number of binding sites for a regulatory protein, the presence of binding site submotifs, the class and number of RNA polymerase sites, as well as gene expression data, which is treated as one feature among many. These features, derived from dif ferent domain sources, are analyzed concurrently, and dynamic relations are re cognized to generate profiles, which are groups of promoters sharing common features. We apply this method to probe the regulatory networks governed by the PhoP/PhoQ two-component system in the enteric bacteria Escherichia coli and Salmonella enterica. Our analysis uncovered novel members of the PhoP regulon as and the resulting profiles group genes that share underlying biologi cal that characterize the system kinetics. The predictions were experimentally validated to establish that the PhoP protein uses multiple mechanisms to control gene transcription and is a central element in a highly connected network.Ministerio de Ciencia y Tecnología BIO2004-0270-

idUS. Depósito de Investigación Universidad de Sevilla

Identifying promoter features of co-regulated genes with similar network motifs

Author: del Val Coral
Groisman Eduardo A
Harari Oscar
Huang Henry
Romero-Zaliz Rocío
Shin Dongwoo
Zwir Igor
Publication venue: BioMed Central
Publication date: 01/01/2009
Field of study

Proceedings of the IEEE International Conference on Bioinformatics and Biomedicine (BIBM) 2008, Philadelphia, PA, USA. 3–5 November 2008.Background: A large amount of computational and experimental work has been devoted to uncovering network motifs in gene regulatory networks. The leading hypothesis is that evolutionary processes independently selected recurrent architectural relationships among regulators and target genes (motifs) to produce characteristic expression patterns of its members. However, even with the same architecture, the genes may still be differentially expressed. Therefore, to define fully the expression of a group of genes, the strength of the connections in a network motif must be specified, and the cis-promoter features that participate in the regulation must be determined.Results: We have developed a model-based approach to analyze proteobacterial genomes for promoter features that is specifically designed to account for the variability in sequence, location and topology intrinsic to differential gene expression. We provide methods for annotating regulatory regions by detecting their subjacent cis-features. This includes identifying binding sites for a transcriptional regulator, distinguishing between activation and repression sites, direct and reverse orientation, and among sequences that weakly reflect a particular pattern; binding sites for the RNA polymerase, characterizing different classes, and locations relative to the transcription factor binding sites; the presence of riboswitches in the 5'UTR, and for other transcription factors. We applied our approach to characterize network motifs controlled by the PhoP/PhoQ regulatory system of Escherichia coli and Salmonella enterica serovar Typhimurium. We identified key features that enable the PhoP protein to control its target genes, and distinct features may produce different expression patterns even within the same network motif.Conclusion: Global transcriptional regulators control multiple promoters by a variety of network motifs. This is clearly the case for the regulatory protein PhoP. In this work, we studied this regulatory protein and demonstrated that understanding gene expression does not only require identifying a set of connexions or network motif, but also the cis-acting elements participating in each of these connexions.This research was supported in part by the Spanish Ministry of Science and Technology under project TIN2006-12879 and by Consejería de Innovacion, Investigación y Ciencia de la de la Junta de Andalucía under project TIC02788

Crossref

LAReferencia - Red Federada de Repositorios Institucionales de Publicaciones Científicas Latinoamericanas

Springer - Publisher Connector

PubMed Central

Repositorio Institucional Universidad de Granada

Digital Commons@Becker

NucTools: analysis of chromatin feature occupancy profiles from high-throughput sequencing data

Author: A Kundaje
A Mammana
A Nellore
A Polishko
A Polishko
A Valouev
A Weiner
AF Bardet
AL Hughes
AN Schep
AR Quinlan
B Langmead
B Wen
BE Bernstein
BS Sexton
C Angelini
C Jiang
C Zang
CY McLean
D Park
DA Beshnova
DA Orlando
DJ Gaffney
DS Johnson
E Eden
EM Berkowitz
EY Chen
F Krueger
F Ramirez
F Zambelli
FJ Sedlazeck
G Längst
G Moyle-Heyrman
GA Orsi
H Ishii
H Ji
H Li
H Thorvaldsdottir
H Younesy
HA Cole
HS Rhee
I Dubchak
I Livyatan
J Becker
J Feng
J Rozowsky
JA West
JD Buenrostro
K Brogaard
K Chen
K Fu
K Liang
Karsten Rippe
KE Holde van
L Teytelman
L Teytelman
L Wang
LD Stein
LN Voong
MJ Guertin
N Kaplan
N Krietenstein
NR Zabet
NU Nair
O Bell
O Flores
O Nikolayeva
P Humburg
PF Kuan
PJ Park
PV Kharchenko
R Jothi
R Schöpflin
RK Auerbach
S Heinz
S Kubik
S Ramachandran
S Woo
T Bailey
TK Kelly
V Gesu Di
VB Teif
VB Teif
VB Teif
VB Teif
Vladimir B. Teif
W Chen
W Huang da
W Ma
WB Langdon
X Zhang
Y Fu
Y Zhang
Y Zhang
Y Zhang
Yevhen Vainshtein
YL Jung
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2017
Field of study

Background: Biomedical applications of high-throughput sequencing methods generate a vast amount of data in which numerous chromatin features are mapped along the genome. The results are frequently analysed by creating binary data sets that link the presence/absence of a given feature to specific genomic loci. However, the nucleosome occupancy or chromatin accessibility landscape is essentially continuous. It is currently a challenge in the field to cope with continuous distributions of deep sequencing chromatin readouts and to integrate the different types of discrete chromatin features to reveal linkages between them. Results: Here we introduce the NucTools suite of Perl scripts as well as MATLAB- and R-based visualization programs for a nucleosome-centred downstream analysis of deep sequencing data. NucTools accounts for the continuous distribution of nucleosome occupancy. It allows calculations of nucleosome occupancy profiles averaged over several replicates, comparisons of nucleosome occupancy landscapes between different experimental conditions, and the estimation of the changes of integral chromatin properties such as the nucleosome repeat length. Furthermore, NucTools facilitates the annotation of nucleosome occupancy with other chromatin features like binding of transcription factors or architectural proteins, and epigenetic marks like histone modifications or DNA methylation. The applications of NucTools are demonstrated for the comparison of several datasets for nucleosome occupancy in mouse embryonic stem cells (ESCs) and mouse embryonic fibroblasts (MEFs). Conclusions: The typical workflows of data processing and integrative analysis with NucTools reveal information on the interplay of nucleosome positioning with other features such as for example binding of a transcription factor CTCF, regions with stable and unstable nucleosomes, and domains of large organized chromatin K9me2 modifications (LOCKs). As potential limitations and problems we discuss how inter-replicate variability of MNase-seq experiments can be addressed

University of Essex Research Repository

Crossref

Springer - Publisher Connector

Fraunhofer-ePrints

PubMed Central

Loss of function of myosin chaperones triggers Hsf1-mediated transcriptional response in skeletal muscle cells

Author: Christelle Etard
Marco Ferg
Olivier Armant
Urmas Roostalu
Uwe Strähle
Victor Gourain
Publication venue: Springer Nature
Publication date: 01/01/2015
Field of study

Quality of sequences obtained with CASAVA 1.8.1 (Illumina) workflow. PF reads passing Illumina chastity filter. (XLSX 46 kb

Springer - Publisher Connector

PubMed Central

The University of Manchester - Institutional Repository

FigShare

Robust Detection of Hierarchical Communities from Escherichia coli Gene Expression Data

Author: A Beyer
AL Barabási
BH Good
BW Kernighan
CO Daub
D Duewer
D Marbach
DFT Veiga
E Bonnet
E Ravasz
E Segal
EH Davidson
F Luo
G Balázsi
G Getz
G Palla
G Palla
H Zare
HW Ma
J Chen
J Duch
J Hubble
J Lemke
J Reichardt
JJ Faith
JJ Faith
JN Weinstein
K Baggerly
Kevin E. Bassler
KY Yeung
M Blatt
M Riley
MB Eisen
MEJ Newman
MEJ Newman
MF Traxler
MM Barker
N Friedman
N Friedman
O Alter
PD Karp
Q Lu
R Guimerà
RA Irizarry
S Fortunato
S Fortunato
S Gama-Castro
S Raychaudhuri
S Tavazoie
Santiago Treviño
Satoru Miyano
SB Seidman
SB Seidman
SP Borgatii
SP Borgatii
TF Cooper
Tim F. Cooper
TS Gardner
U Brandes
UN Raghavan
X Wen
Y Benjamini
Y Sun
Yudong Sun
Z Shi
Publication venue: 'Public Library of Science (PLoS)'
Publication date: 11/01/2012
Field of study

Determining the functional structure of biological networks is a central goal of systems biology. One approach is to analyze gene expression data to infer a network of gene interactions on the basis of their correlated responses to environmental and genetic perturbations. The inferred network can then be analyzed to identify functional communities. However, commonly used algorithms can yield unreliable results due to experimental noise, algorithmic stochasticity, and the influence of arbitrarily chosen parameter values. Furthermore, the results obtained typically provide only a simplistic view of the network partitioned into disjoint communities and provide no information of the relationship between communities. Here, we present methods to robustly detect coregulated and functionally enriched gene communities and demonstrate their application and validity for Escherichia coli gene expression data. Applying a recently developed community detection algorithm to the network of interactions identified with the context likelihood of relatedness (CLR) method, we show that a hierarchy of network communities can be identified. These communities significantly enrich for gene ontology (GO) terms, consistent with them representing biologically meaningful groups. Further, analysis of the most significantly enriched communities identified several candidate new regulatory interactions. The robustness of our methods is demonstrated by showing that a core set of functional communities is reliably found when artificial noise, modeling experimental noise, is added to the data. We find that noise mainly acts conservatively, increasing the relatedness required for a network link to be reliably assigned and decreasing the size of the core communities, rather than causing association of genes into new communities.Comment: Due to appear in PLoS Computational Biology. Supplementary Figure S1 was not uploaded but is available by contacting the author. 27 pages, 5 figures, 15 supplementary file

arXiv.org e-Print Archive

CiteSeerX

Public Library of Science (PLOS)

Crossref

Directory of Open Access Journals

PubMed Central

FigShare

Global analysis of patterns of gene expression during Drosophila embryogenesis

Author: Beaton Amy
Berman Benjamin P
Celniker Susan E
Hartenstein Volker
Kwan Elaine
Rubin Gerald M
Tomancak Pavel
Weiszmann Richard
Publication venue: BioMed Central
Publication date: 01/01/2007
Field of study

Embryonic expression patterns for 6,003 (44%) of the 13,659 protein-coding genes identified in the Drosophila melanogaster genome were documented, of which 40% show tissue-restricted expression

Crossref

PubMed Central

MPG.PuRe

Recommended from our members

scAI: an unsupervised approach for the integrative analysis of parallel single-cell transcriptomic and epigenomic profiles.

Author: Jin Suoqin
Nie Qing
Zhang Lihua
Publication venue: eScholarship, University of California
Publication date: 01/02/2020
Field of study

Simultaneous measurements of transcriptomic and epigenomic profiles in the same individual cells provide an unprecedented opportunity to understand cell fates. However, effective approaches for the integrative analysis of such data are lacking. Here, we present a single-cell aggregation and integration (scAI) method to deconvolute cellular heterogeneity from parallel transcriptomic and epigenomic profiles. Through iterative learning, scAI aggregates sparse epigenomic signals in similar cells learned in an unsupervised manner, allowing coherent fusion with transcriptomic measurements. Simulation studies and applications to three real datasets demonstrate its capability of dissecting cellular heterogeneity within both transcriptomic and epigenomic layers and understanding transcriptional regulatory mechanisms

eScholarship - University of California