Search CORE

49 research outputs found

Dissecting complex transcriptional responses using pathway-level scores based on prior information

Author: A Subramanian
Andre Boorsma
BC Foat
BC Foat
CT Harbison
DH Nguyen
E Segal
EM Conlon
F Gao
GD Stormo
Harmen J Bussemaker
HJ Bussemaker
J van Helden
JC Liao
Lucas D Ward
M Ashburner
M Middendorf
MA Beer
MB Eisen
N Friedman
P Khatri
PT Spellman
R Lascaris
S Grossmann
S Tavazoie
SY Kim
TR Hughes
VK Mootha
Publication venue: BioMed Central
Publication date: 01/01/2007
Field of study

Abstract Background The genomewide pattern of changes in mRNA expression measured using DNA microarrays is typically a complex superposition of the response of multiple regulatory pathways to changes in the environment of the cells. The use of prior information, either about the function of the protein encoded by each gene, or about the physical interactions between regulatory factors and the sequences controlling its expression, has emerged as a powerful approach for dissecting complex transcriptional responses. Results We review two different approaches for combining the noisy expression levels of multiple individual genes into robust pathway-level differential expression scores. The first is based on a comparison between the distribution of expression levels of genes within a predefined gene set and those of all other genes in the genome. The second starts from an estimate of the strength of genomewide regulatory network connectivities based on sequence information or direct measurements of protein-DNA interactions, and uses regression analysis to estimate the activity of gene regulatory pathways. The statistical methods used are explained in detail. Conclusion By avoiding the thresholding of individual genes, pathway-level analysis of differential expression based on prior information can be considerably more sensitive to subtle changes in gene expression than gene-level analysis. The methods are technically straightforward and yield results that are easily interpretable, both biologically and statistically.</p

Crossref

Springer - Publisher Connector

Columbia University Academic Commons

Directory of Open Access Journals

PubMed Central

Detecting microRNA binding and siRNA off-target effects from expression data.

Author: A Birmingham
A Rodriguez
A Subramanian
A Tanay
AJ Giraldez
AJ Giraldez
AL Jackson
Anton J Enright
BC Foat
BP Lewis
Cei Abreu-Goodger
E Eden
EM Anderson
KK Farh
M Tompa
MF Berger
P Sood
Stijn van Dongen
Publication venue: Nat Methods
Publication date: 02/11/2008
Field of study

Sylamer is a method for detecting microRNA target and small interfering RNA off-target signals in 3' untranslated regions from a ranked gene list, sorted from upregulated to downregulated, after a microRNA perturbation or RNA interference experiment. The output is a landscape plot that tracks occurrence biases using hypergeometric P-values for all words across the gene ranking. We demonstrated the utility, speed and accuracy of this approach on several datasets

Crossref

PubMed Central

Apollo (Cambridge)

Investigation of EMIC wave scattering as the cause for the BARREL 17 January 2013 relativistic electron precipitation event: A quantitative comparison of simulation with observations

Author: Albert
Baker
Baker
Blake
Bortnik
Carson
Chappell
Chen
Chen
Chen
Chen
Comess
David M. Smith
Denton
Engebretson
Farrugia
Foat
Funsten
Gannon
Goldstein
Green
Harlan E. Spence
Hartley
Horne
Horwitz
Hu
Imhof
Jerry Goldstein
Joseph F. Fennell
Juan V. Rodriguez
Kletzing
Leslie A. Woodger
Li
Lorentzen
Loto'aniu
Lyons
Mark J. Engebretson
Mary K. Hudson
Mauk
Millan
Millan
Miyoshi
Onsager
Reiner Friedel
Robyn M. Millan
Rodger
Rodriguez
Sandanger
Spasojević
Spence
Summers
Summers
Takahashi
Thorne
Thorne
Thorne
Tsyganenko
Usanova
Vampola
West
Wygant
Yahnin
Young
Yue Chen
Zan Li
Publication venue: University of New Hampshire Scholars\u27 Repository
Publication date: 28/12/2014
Field of study

Abstract Electromagnetic ion cyclotron (EMIC) waves were observed at multiple observatory locations for several hours on 17 January 2013. During the wave activity period, a duskside relativistic electron precipitation (REP) event was observed by one of the Balloon Array for Radiation belt Relativistic Electron Losses (BARREL) balloons and was magnetically mapped close to Geostationary Operational Environmental Satellite (GOES) 13. We simulate the relativistic electron pitch angle diffusion caused by gyroresonant interactions with EMIC waves using wave and particle data measured by multiple instruments on board GOES 13 and the Van Allen Probes. We show that the count rate, the energy distribution, and the time variation of the simulated precipitation all agree very well with the balloon observations, suggesting that EMIC wave scattering was likely the cause for the precipitation event. The event reported here is the first balloon REP event with closely conjugate EMIC wave observations, and our study employs the most detailed quantitative analysis on the link of EMIC waves with observed REP to date. Key PointsQuantitative analysis of the first balloon REP with closely conjugate EMIC wavesOur simulation suggests EMIC waves to be a viable cause for the REP eventThe adopted model is proved to be applicable to simulating the REP event

Crossref

UNH Scholars' Repository

RNAcontext: A New Method for Learning the Sequence and Structure Binding Preferences of RNA-Binding Proteins

Author: AP Gerber
BC Foat
BC Foat
BC Foat
C Shin
CT Workman
D Ray
DE Tsai
Debashish Ray
DJ Hogan
E Segal
EM Conlon
Esther T. Chan
FB Gao
FC Oberstrass
G Badis
HG Roider
Hilal Kazan
I Perez
IL Silanes
J Hackermuller
J Ule
JA Granek
JD Keene
JD Keene
JR Sanford
KE Lukong
L Wickham
M Blanchette
M Hiller
M Rabani
MF Berger
MJ Law
ML Bulyk
MT Miller
NC Meisner
P Benos
Quaid Morris
R Tacke
RH Byrd
RJ Buckanovich
S Griffiths-Jones
S Sinha
Saurabh Sinha
SR Eddy
SR Eddy
T Aviv
Timothy R. Hughes
TL Bailey
TM Bailey
X Chen
X Li
X Wang
Y Ding
Y Wang
Publication venue: Public Library of Science
Publication date: 01/01/2010
Field of study

Metazoan genomes encode hundreds of RNA-binding proteins (RBPs). These proteins regulate post-transcriptional gene expression and have critical roles in numerous cellular processes including mRNA splicing, export, stability and translation. Despite their ubiquity and importance, the binding preferences for most RBPs are not well characterized. In vitro and in vivo studies, using affinity selection-based approaches, have successfully identified RNA sequence associated with specific RBPs; however, it is difficult to infer RBP sequence and structural preferences without specifically designed motif finding methods. In this study, we introduce a new motif-finding method, RNAcontext, designed to elucidate RBP-specific sequence and structural preferences with greater accuracy than existing approaches. We evaluated RNAcontext on recently published in vitro and in vivo RNA affinity selected data and demonstrate that RNAcontext identifies known binding preferences for several control proteins including HuR, PTB, and Vts1p and predicts new RNA structure preferences for SF2/ASF, RBM4, FUSIP1 and SLM2. The predicted preferences for SF2/ASF are consistent with its recently reported in vivo binding sites. RNAcontext is an accurate and efficient motif finding method ideally suited for using large-scale RNA-binding affinity datasets to determine the relative binding preferences of RBPs for a wide range of RNA sequences and structures

CiteSeerX

Crossref

Directory of Open Access Journals

PubMed Central

Thermodynamic State Ensemble Models of cis-Regulation

Author: Barak A. Cohen
BC Foat
D van Essen
E Segal
EP Consortium
Fran Lewitter
J Gertz
J Gertz
J Goutsias
J Goutsias
J Wyman
K Sneppen
L Bintu
L Bintu
M Ptashne
M Ronen
MA Shea
Marc S. Sherman
MF Berger
N Rosenfeld
NAM Monk
NE Buchler
R Métivier
RJ Klose
RP Zinzen
RX Luo
S Kuttykrishnan
S Mangan
S Mukherjee
SA Gorski
TM Gruber
U Alon
WD Fakhouri
Publication venue: Public Library of Science
Publication date: 01/01/2012
Field of study

A major goal in computational biology is to develop models that accurately predict a gene's expression from its surrounding regulatory DNA. Here we present one class of such models, thermodynamic state ensemble models. We describe the biochemical derivation of the thermodynamic framework in simple terms, and lay out the mathematical components that comprise each model. These components include (1) the possible states of a promoter, where a state is defined as a particular arrangement of transcription factors bound to a DNA promoter, (2) the binding constants that describe the affinity of the protein–protein and protein–DNA interactions that occur in each state, and (3) whether each state is capable of transcribing. Using these components, we demonstrate how to compute a cis-regulatory function that encodes the probability of a promoter being active. Our intention is to provide enough detail so that readers with little background in thermodynamics can compose their own cis-regulatory functions. To facilitate this goal, we also describe a matrix form of the model that can be easily coded in any programming language. This formalism has great flexibility, which we show by illustrating how phenomena such as competition between transcription factors and cooperativity are readily incorporated into these models. Using this framework, we also demonstrate that Michaelis-like functions, another class of cis-regulatory models, are a subset of the thermodynamic framework with specific assumptions. By recasting Michaelis-like functions as thermodynamic functions, we emphasize the relationship between these models and delineate the specific circumstances representable by each approach. Application of thermodynamic state ensemble models is likely to be an important tool in unraveling the physical basis of combinatorial cis-regulation and in generating formalisms that accurately predict gene expression from DNA sequence

Public Library of Science (PLOS)

Crossref

Directory of Open Access Journals

PubMed Central

FigShare

A Primer on Regression Methods for Decoding cis-Regulatory Logic

Author: A Boorsma
A Kirmizis
A Orian
A Sandelin
A Tanay
AD Smith
BC Foat
BC Foat
C Niehrs
CS Chin
D Das
D Das
D Das
D Das
Debopriya Das
DH Nguyen
DM Wolf
E Segal
EM Conlon
F Gao
Fran Lewitter
GD Stormo
GD Stormo
H Pham
HJ Bussemaker
HJ Bussemaker
I Dubchak
I Minz
J Nardone
Joe W. Gray
L Wang
LA Pennacchio
M Blanchette
M Carey
M Djordjevic
M Levine
M Tompa
Matteo Pellegrini
MB Eisen
ML Bulyk
MQ Zhang
O Elemento
OG Berg
PT Spellman
R Bonneau
RA O'Flanagan
RA Veitia
RX Yu
RZ Wu
S Cokus
S Hannenhalli
S Keles
S Mukherjee
T Hastie
TH Kim
V Matys
W Wang
W Wang
W Zhong
WW Wasserman
Y Fu
Y Pilpel
Publication venue: Public Library of Science
Publication date: 01/01/2009
Field of study

The rapidly emerging field of systems biology is helping us to understand the molecular determinants of phenotype on a genomic scale [1]. Cis-regulatory elements are major sequence-based determinants of biological processes in cells and tissues [2]. For instance, during transcriptional regulation, transcription factors (TFs) bind to very specific regions on the promoter DNA [2,3] and recruit the basal transcriptional machinery, which ultimately initiates mRNA transcription (Figure 1A). Learning cis-Regulatory Elements from Omics Data A vast amount of work over the past decade has shown that omics data can be used to learn cis-regulatory logic on a genome-wide scale [4-6]--in particular, by integrating sequence data with mRNA expression profiles. The most popular approach has been to identify over-represented motifs in promoters of genes that are coexpressed [4,7,8]. Though widely used, such an approach can be limiting for a variety of reasons. First, the combinatorial nature of gene regulation is difficult to explicitly model in this framework. Moreover, in many applications of this approach, expression data from multiple conditions are necessary to obtain reliable predictions. This can potentially limit the use of this method to only large data sets [9]. Although these methods can be adapted to analyze mRNA expression data from a pair of biological conditions, such comparisons are often confounded by the fact that primary and secondary response genes are clustered together--whereas only the primary response genes are expected to contain the functional motifs [10]. A set of approaches based on regression has been developed to overcome the above limitations [11-32]. These approaches have their foundations in certain biophysical aspects of gene regulation [26,33-35]. That is, the models are motivated by the expected transcriptional response of genes due to the binding of TFs to their promoters. While such methods have gathered popularity in the computational domain, they remain largely obscure to the broader biology community. The purpose of this tutorial is to bridge this gap. We will focus on transcriptional regulation to introduce the concepts. However, these techniques may be applied to other regulatory processes. We will consider only eukaryotes in this tutorial

Crossref

Directory of Open Access Journals

PubMed Central

UNT Digital Library

A Linear Model for Transcription Factor Binding Affinity Prediction in Protein Binding Microarrays

Author: A Beyer
A Sandelin
A Seth
A Tanay
AA Philippakis
B Foat
B Ren
CE Lawrence
CO Pabo
DM Rocke
DS Johnson
DS Latchman
E Segal
E Wingender
FG Falkner
G Stolovitzky
GD Stormo
H Lähdesmäki
HA Ingraham
Harri Lähdesmäki
J Mintseris
J Van Helden
JE Darnell
Kirsti Laurila
M Barkett
M Kasowski
M Nykter
Mark Isalan
Matti Annala
Matti Nykter
MF Berger
MF Berger
MJ Solomon
ML Bulyk
ML Bulyk
OG Berg
P Agius
PV Benos
R Tibshirani
S Gupta
S Mukherjee
TL Bailey
V Litvak
V Orlando
X Chen
X Liu
XS Liu
Publication venue: Public Library of Science
Publication date: 01/01/2011
Field of study

Protein binding microarrays (PBM) are a high throughput technology used to characterize protein-DNA binding. The arrays measure a protein's affinity toward thousands of double-stranded DNA sequences at once, producing a comprehensive binding specificity catalog. We present a linear model for predicting the binding affinity of a protein toward DNA sequences based on PBM data. Our model represents the measured intensity of an individual probe as a sum of the binding affinity contributions of the probe's subsequences. These subsequences characterize a DNA binding motif and can be used to predict the intensity of protein binding against arbitrary DNA sequences. Our method was the best performer in the Dialogue for Reverse Engineering Assessments and Methods 5 (DREAM5) transcription factor/DNA motif recognition challenge. For the DREAM5 bonus challenge, we also developed an approach for the identification of transcription factors based on their PBM binding profiles. Our approach for TF identification achieved the best performance in the bonus challenge

Public Library of Science (PLOS)

CiteSeerX

Crossref

Directory of Open Access Journals

PubMed Central

Aaltodoc Publication Archive

Repression of Mitochondrial Translation, Respiration and a Metabolic Cycle-Regulated Gene, SLF1, by the Yeast Pumilio-Family Protein Puf3p

Author: A Breitkreutz
A Kudlicki
A Russo
AC Goldstrohm
AP Gerber
AR Albig
BC Foat
BP Tu
C Stark
E Eliyahu
F Devaux
G Lelandais
Gerald S. Shadel
GP Cereghino
I Hagen
J Cotney
J Ihmels
J Ihmels
Janine Santos
K Zarnack
LA Grivell
LC Lai
LJ Garcia-Rodriguez
M Carlson
MA Lebedeva
Marc Chatenay-Lapointe
MS Rodeheffer
MS Rodeheffer
ND Bonawitz
ND Bonawitz
R Mehta
RE Kellems
SG Sobel
SI Lee
SL Forsburg
SW Ho
T Quenault
TE Shutt
W Olivas
Y Deng
Y Pan
Y Saint-Georges
Z Liu
Publication venue: Public Library of Science
Publication date: 31/05/2011
Field of study

Synthesis and assembly of the mitochondrial oxidative phosphorylation (OXPHOS) system requires genes located both in the nuclear and mitochondrial genomes, but how gene expression is coordinated between these two compartments is not fully understood. One level of control is through regulated expression mitochondrial ribosomal proteins and other factors required for mitochondrial translation and OXPHOS assembly, which are all products of nuclear genes that are subsequently imported into mitochondria. Interestingly, this cadre of genes in budding yeast has in common a 3′-UTR element that is bound by the Pumilio family protein, Puf3p, and is coordinately regulated under many conditions, including during the yeast metabolic cycle. Multiple functions have been assigned to Puf3p, including promoting mRNA degradation, localizing nucleus-encoded mitochondrial transcripts to the outer mitochondrial membrane, and facilitating mitochondria-cytoskeletal interactions and motility. Here we show that Puf3p has a general repressive effect on mitochondrial OXPHOS abundance, translation, and respiration that does not involve changes in overall mitochondrial biogenesis and largely independent of TORC1-mitochondrial signaling. We also identified the cytoplasmic translation factor Slf1p as yeast metabolic cycle-regulated gene that is repressed by Puf3p at the post-transcriptional level and promotes respiration and extension of yeast chronological life span when over-expressed. Altogether, these results should facilitate future studies on which of the many functions of Puf3p is most relevant for regulating mitochondrial gene expression and the role of nuclear-mitochondrial communication in aging and longevity

Public Library of Science (PLOS)

Crossref

PubMed Central

BayesPI - a new model to study protein-DNA interactions: a case study of condition-specific protein binding parameters for Yeast transcription factors

Author: A Delaunay
A Tanay
A Yarragudi
AE Tsong
AR Borneman
B Alberts
BC Foat
BE Bernstein
C Moorman
CK Lee
CT Harbison
CY Chen
D Das
D Das
D Mackay
DC Raitt
DS Fields
E Aurell
E Wingender
F Gao
F Ozsolak
G Tuteja
GL Bond
HG Roider
HJ Bussemaker
HK Tsai
I Nabney
J Deckert
J Lee
J Wang
J Wang
J Wang
J Zeitlinger
JB Kinney
JM Bland
JM Cherry
Junbai Wang
K Murphy
KD MacIsaac
L Jen-Jacobson
L Narlikar
L Segal
M Djordjevic
MJ Buck
ML Bulyk
Morigen
MP Ryan
O Sertil
OG Berg
PV Benos
Q Zhou
RD Kornberg
RF Lascaris
RH Morse
S Ghaemmaghami
S Keles
SF Gull
TE Cheatham 3rd
TK Man
TN Mavrich
U Gerland
U Gerland
VB Zhurkin
W Gorner
W Lee
WK Olson
X Liu
Publication venue: BioMed Central
Publication date: 01/01/2009
Field of study

Abstract Background We have incorporated Bayesian model regularization with biophysical modeling of protein-DNA interactions, and of genome-wide nucleosome positioning to study protein-DNA interactions, using a high-throughput dataset. The newly developed method (BayesPI) includes the estimation of a transcription factor (TF) binding energy matrices, the computation of binding affinity of a TF target site and the corresponding chemical potential. Results The method was successfully tested on synthetic ChIP-chip datasets, real yeast ChIP-chip experiments. Subsequently, it was used to estimate condition-specific and species-specific protein-DNA interaction for several yeast TFs. Conclusion The results revealed that the modification of the protein binding parameters and the variation of the individual nucleotide affinity in either recognition or flanking sequences occurred under different stresses and in different species. The findings suggest that such modifications may be adaptive and play roles in the formation of the environment-specific binding patterns of yeast TFs and in the divergence of TF binding sites across the related yeast species.</p

Helsebibliotekets Research Archive

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

Linking Proteomic and Transcriptional Data through the Interactome and Epigenome Reveals a Map of Oncogene-induced Signaling

Author: A Ceol
A Gaulton
A Ghazalpour
A Lan
AB Heimberger
Adam Labadorf
AE Kel
B Aranda
B Hanstein
B Langmead
B Mukherjee
B Schwanhäusser
BC Foat
BC Foat
BC Foat
C Kim
C Knox
C Liu
C Ritz
C Stark
C-L Tso
Candace R. Chouinard
CD Andl
CE Pelloski
CM Klinge
CM-E Sauvageot
CS Ross-Innes
CT Harbison
D Guo
D Hanahan
D Hanahan
D Yin
David C. Clarke
DB Ramnarain
Douglas A. Lauffenburger
DP Schunemann
DT Odom
E Cerami
E Eden
E Galanis
E Lee
E Lundberg
E Yeger-Lotem
ER Levin
Ernest Fraenkel
F Markowetz
F Yamoutpour
G Cuellar Partida
G Ling
GC Kabat
GD Bader
GK Smyth
H Dong
H Johnson
H Shao
H-W Lo
HI Robins
HS Huang
I Ljubić
I Thiele
I Ulitsky
IY Eyüpoglu
JM Gil
JR Hesselberth
JS Lewis-Wambi
JV Olsen
KD MacIsaac
KH Emami
KV Lu
L Björnström
L Choy
LJ Zhu
M Bansal
M Lepourcelet
MD Robinson
MJ Clark
MM Feldkamp
MS Carro
MW Pedersen
MW Pedersen
N de la Iglesia
P Flicek
P Hallock
P Pu
P-C Leow
PH Huang
PJ Sabo
Q Li
R Bonavia
R Chen
R Kalluri
R Nishikawa
R Pique-Regi
R Schiff
R Zeineldin
RGW Verhaak
RH Shoemaker
RM Hallett
RM Myers
S Bamford
S Imarisio
S Kerrien
S Razick
S Schinner
S-SC Huang
SA Prigent
Sara J. C. Gosline
Shao-shan Carol Huang
SP Panicker
SZ Usmani
T Nagashima
T Takano
TS Keshava Prasad
V Matys
V Milano
W Couldwell
W Lu
W Wei
W Wick
William Gordon
William Stafford Noble
X Liu
Y Benjamini
Y Narita
Y Ning
Y Wang
Y Zhang
Z Wu
Publication venue: 'Public Library of Science (PLoS)'
Publication date: 01/03/2012
Field of study

Cellular signal transduction generally involves cascades of post-translational protein modifications that rapidly catalyze changes in protein-DNA interactions and gene expression. High-throughput measurements are improving our ability to study each of these stages individually, but do not capture the connections between them. Here we present an approach for building a network of physical links among these data that can be used to prioritize targets for pharmacological intervention. Our method recovers the critical missing links between proteomic and transcriptional data by relating changes in chromatin accessibility to changes in expression and then uses these links to connect proteomic and transcriptome data. We applied our approach to integrate epigenomic, phosphoproteomic and transcriptome changes induced by the variant III mutation of the epidermal growth factor receptor (EGFRvIII) in a cell line model of glioblastoma multiforme (GBM). To test the relevance of the network, we used small molecules to target highly connected nodes implicated by the network model that were not detected by the experimental data in isolation and we found that a large fraction of these agents alter cell viability. Among these are two compounds, ICG-001, targeting CREB binding protein (CREBBP), and PKF118–310, targeting β-catenin (CTNNB1), which have not been tested previously for effectiveness against GBM. At the level of transcriptional regulation, we used chromatin immunoprecipitation sequencing (ChIP-Seq) to experimentally determine the genome-wide binding locations of p300, a transcriptional co-regulator highly connected in the network. Analysis of p300 target genes suggested its role in tumorigenesis. We propose that this general method, in which experimental measurements are used as constraints for building regulatory networks from the interactome while taking into account noise and missing data, should be applicable to a wide range of high-throughput datasets.National Science Foundation (U.S.) (DB1-0821391)National Institutes of Health (U.S.) (Grant U54-CA112967)National Institutes of Health (U.S.) (Grant R01-GM089903)National Institutes of Health (U.S.) (P30-ES002109

Public Library of Science (PLOS)

DSpace@MIT

Crossref

Directory of Open Access Journals

PubMed Central

FigShare