Search CORE

21 research outputs found

High Resolution Models of Transcription Factor-DNA Affinities Improve In Vitro and In Vivo Binding Predictions

Author: Aaron Arvey
C Kissinger
C Leslie
C Zhu
Christina Leslie
CT Harbison
D Fulton
DE Newburger
E Bolotin
E Fraenkel
G Badis
G Badis
G Pavesi
MF Berger
O Wallerman
P Kharchenko
Phaedra Agius
R Kuang
S Georgiev
Uwe Ohler
William Chang
William Stafford Noble
WS Noble
X Chen
X Chen
XS Liu
Publication venue: Public Library of Science
Publication date: 01/01/2010
Field of study

Accurately modeling the DNA sequence preferences of transcription factors (TFs), and using these models to predict in vivo genomic binding sites for TFs, are key pieces in deciphering the regulatory code. These efforts have been frustrated by the limited availability and accuracy of TF binding site motifs, usually represented as position-specific scoring matrices (PSSMs), which may match large numbers of sites and produce an unreliable list of target genes. Recently, protein binding microarray (PBM) experiments have emerged as a new source of high resolution data on in vitro TF binding specificities. PBM data has been analyzed either by estimating PSSMs or via rank statistics on probe intensities, so that individual sequence patterns are assigned enrichment scores (E-scores). This representation is informative but unwieldy because every TF is assigned a list of thousands of scored sequence patterns. Meanwhile, high-resolution in vivo TF occupancy data from ChIP-seq experiments is also increasingly available. We have developed a flexible discriminative framework for learning TF binding preferences from high resolution in vitro and in vivo data. We first trained support vector regression (SVR) models on PBM data to learn the mapping from probe sequences to binding intensities. We used a novel -mer based string kernel called the di-mismatch kernel to represent probe sequence similarities. The SVR models are more compact than E-scores, more expressive than PSSMs, and can be readily used to scan genomics regions to predict in vivo occupancy. Using a large data set of yeast and mouse TFs, we found that our SVR models can better predict probe intensity than the E-score method or PBM-derived PSSMs. Moreover, by using SVRs to score yeast, mouse, and human genomic regions, we were better able to predict genomic occupancy as measured by ChIP-chip and ChIP-seq experiments. Finally, we found that by training kernel-based models directly on ChIP-seq data, we greatly improved in vivo occupancy prediction, and by comparing a TF's in vitro and in vivo models, we could identify cofactors and disambiguate direct and indirect binding

CiteSeerX

Public Library of Science (PLOS)

Crossref

Directory of Open Access Journals

PubMed Central

lobChIP: from cells to sequencing ready ChIP libraries in a single day

Author: Claes Wadelius
DS Johnson
ENCODE Project Consortium
H Kilpinen
H Li
H Li
H O’Geen
Helena Nord
JD Nelson
Lisa Borghini
M Garber
M Motallebipour
Madhusudhan Bysani
MJ Solomon
O Wallerman
Ola Wallerman
P Machanick
S Aldridge
S Frietze
T Ye
WA Hoeijmakers
WC Gasper
X Peng
Y Zhang
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

ChIP-seq in steatohepatitis and normal liver tissue identifies candidate disease mechanisms related to progression to cancer

Author: A Rada-Iglesias
A-RN Zekri
AP Boyle
AV Segrè
BS Shastry
Claes Wadelius
CW Wu
D Boison
DW Huang
E Eden
EK Speliotes
H Li
J Nsengimana
Jan Komorowski
JC Cohen
JM Lin
JS Kooner
K Wang
K Zatloukal
Kurt Zatloukal
L Lee
LD Ward
M Kasowski
M Motallebipour
M Yoneda
MA Gyamfi
MA Patil
Madhusudhan Bysani
MG Guenther
MR Lucey
O Delpuech
Ola Wallerman
P Cingolani
P Pajukanta
R Karlić
S Enroth
S Romeo
S Sookoian
SJR Meex
SS Baker
Susanne Bornelöv
SY Neo
TJP Hubbard
X Chen
XR Xu
Y Kurokawa
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

Clustered ChIP-Seq-defined transcription factor binding sites and histone modifications map distinct classes of regulatory elements

Author: A Barski
A Kanhere
A Marson
A Pekowska
A Rada-Iglesias
A Visel
AP Boyle
B Li
BE Bernstein
BE Bernstein
CM Koch
CZ Zang
D Karolchik
DS Johnson
E Birney
E Lieberman-Aiden
Finn Drabløs
G Hon
G Hon
GA Wray
GE Zentner
H Xu
H Yu
J Ernst
J Kim
JE Phillips
JM Vaquerizas
KJ Gaulton
KJ Won
KJ Won
KL MacQuarrie
L Ooi
LA Pennacchio
M Blanchette
M Bulger
M Gupta
M Guttman
MA Nobrega
MB Rye
MC Tsai
MH Kagey
Morten Rye
MP Creyghton
ND Heintzman
ND Heintzman
O Wallerman
PJ Farnham
PJ Park
PV Kharchenko
PV Kharchenko
Pål Sætrom
Q Zhou
R Jothi
S Cuddapah
S Roy
T Kouzarides
T Li
T Ravasi
TH Kim
TK Kim
Tony Håndstad
TS Mikkelsen
V Gotea
W Niu
X Chen
Y Zhang
Z Wang
Publication venue: BioMed Central
Publication date: 01/01/2011
Field of study

Abstract Background Transcription factor binding to DNA requires both an appropriate binding element and suitably open chromatin, which together help to define regulatory elements within the genome. Current methods of identifying regulatory elements, such as promoters or enhancers, typically rely on sequence conservation, existing gene annotations or specific marks, such as histone modifications and p300 binding methods, each of which has its own biases. Results Herein we show that an approach based on clustering of transcription factor peaks from high-throughput sequencing coupled with chromatin immunoprecipitation (Chip-Seq) can be used to evaluate markers for regulatory elements. We used 67 data sets for 54 unique transcription factors distributed over two cell lines to create regulatory element clusters. By integrating the clusters from our approach with histone modifications and data for open chromatin, we identified general methylation of lysine 4 on histone H3 (H3K4me) as the most specific marker for transcription factor clusters. Clusters mapping to annotated genes showed distinct patterns in cluster composition related to gene expression and histone modifications. Clusters mapping to intergenic regions fall into two groups either directly involved in transcription, including miRNAs and long noncoding RNAs, or facilitating transcription by long-range interactions. The latter clusters were specifically enriched with H3K4me1, but less with acetylation of lysine 27 on histone 3 or p300 binding. Conclusion By integrating genomewide data of transcription factor binding and chromatin structure and using our data-driven approach, we pinpointed the chromatin marks that best explain transcription factor association with different regulatory elements. Our results also indicate that a modest selection of transcription factors may be sufficient to map most regulatory elements in the human genome.</p

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

NORA - Norwegian Open Research Archives

MinION analysis and reference consortium: Phase 2 data release and analysis of R9.0 chemistry

Author: Birney E
Brown BL
Eccles DA
Ip CLC
Jain M
Jansen HJ
Leggett RM
Loose M
Malla S
Minion Analysis And Reference Consortium
O'Grady J
Olsen HE
Snutch TP
Tyson JR
Wallerman O
Zalunin V
Publication venue: F1000Research
Publication date: 01/01/2017
Field of study

Long-read sequencing is rapidly evolving and reshaping the suite of opportunities for genomic analysis. For the MinION in particular, as both the platform and chemistry develop, the user community requires reference data to set performance expectations and maximally exploit third-generation sequencing. We performed an analysis of MinION data derived from whole genome sequencing of Escherichiacoli K-12 using the R9.0 chemistry, comparing the results with the older R7.3 chemistry.We computed the error-rate estimates for insertions, deletions, and mismatches in MinION reads.Run-time characteristics of the flow cell and run scripts for R9.0 were similar to those observed for R7.3 chemistry, but with an 8-fold increase in bases per second (from 30 bps in R7.3 and SQK-MAP005 library preparation, to 250 bps in R9.0) processed by individual nanopores, and less drop-off in yield over time. The 2-dimensional ("2D") N50 read length was unchanged from the prior chemistry. Using the proportion of alignable reads as a measure of base-call accuracy, 99.9% of "pass" template reads from 1-dimensional ("1D") experiments were mappable and ~97% from 2D experiments. The median identity of reads was ~89% for 1D and ~94% for 2D experiments. The total error rate (miscall + insertion + deletion ) decreased for 2D "pass" reads from 9.1% in R7.3 to 7.5% in R9.0 and for template "pass" reads from 26.7% in R7.3 to 14.5% in R9.0.These Phase 2 MinION experiments serve as a baseline by providing estimates for read quality, throughput, and mappability. The datasets further enable the development of bioinformatic tools tailored to the new R9.0 chemistry and the design of novel biological applications for this technology.K: thousand, Kb: kilobase (one thousand base pairs), M: million, Mb: megabase (one million base pairs), Gb: gigabase (one billion base pairs)

Directory of Open Access Journals

Oxford University Research Archive

Transcription factor ZBED6 affects gene expression, proliferation, and cell death in pancreatic beta cells

Author: A. Ameur
Bernardo
Bozoky
Calderari
Inoue
L. Andersson
L. Jiang
Lu
Markljung
N. Welsh
O. Wallerman
Pirot
R. K. Gupta
Swales
U. Engstrom
Unger
Van Laere
Wu
X. Wang
Y. Qi
Publication venue: 'Proceedings of the National Academy of Sciences'
Publication date
Field of study

Crossref

Transect relascope sampling for assessing coarse woody debris: The case of a π/2

Author: Backman A.
Cochran W. G.
Coxeter H. S. M.
Lindgren O.
Lämås T.
Matérn B.
Penttinen A.
Schmid P.
Schreuder H. T.
Stähl G.
Ståhl G.
Von Segebaden G.
Wallerman J.
Warren W. G.
Publication venue: 'Informa UK Limited'
Publication date
Field of study

Crossref

Binding sites for metabolic disease related transcription factors inferred at base pair resolution by chromatin immunoprecipitation and genomic microarrays

Author: Ameur A
Andrews R
Carter N
Clelland G
Dhami P
Dovey OM
Dunham I
Ellis PD
Enroth S
James K
Koch C
Komorowski J
Langford C
Ponten F
Rada-Iglesias A
Vetrie D
Wadelius C
Wallerman O
Wester K
Wilcox S
Wraight VL
Publication venue
Publication date: 01/01/2005
Field of study

Enlighten

Allele-specific transcription factor binding to common and rare variants associated with disease and gene expression

Author: A Ameur
A Keinan
A. D. Johnson
Alexander H Li
AT Funding
Claes Wadelius
Dominique J. Verlaan
EE Schadt
Elisa Närvä
Emelie Wallén Arzt
Fabian Grubert
Gang Pan
Helena Nord
HW Mages
Ingegerd Elvers
J Rozowsky
JJ Crowley
JZ Liu
K Musunuru
Kerstin Lindblad Toh
Lars Rönnblom
M Kasowski
M Mayrhofer
M Motallebipour
MA DePristo
Maija-Leena Eloranta
Marco Cavalli
MT Maurano
MT Maurano
O Corradin
O Wallerman
Ola Wallerman
Olof Berggren
P Kheradpour
Q Huang
Richard Cowper-Sal·lari
Sebastian M. Waszak
T Lappalainen
TE Reddy
Wenqing Fu
Y Liu
Y Okada
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

Regulatory changes in pterin and carotenoid genes underlie balanced color polymorphisms in the wall lizard

Author: Afonso S.
Andersson L.
Andrade P.
Bellati A.
Bosakova Z.
Brejcha J.
Bunikis I.
Carneiro M.
Carretero M. A.
de Lanuza G. P. I.
Feiner N.
Font E.
Marsik P.
Pauperio F.
Pellitteri Rosa D.
Pereira P.
Pinho C.
Rubin C. -J.
Sabatino S. J.
Salvi D.
Soler L.
Uller T.
Wallerman O.
While G. M.
Publication venue: 'Proceedings of the National Academy of Sciences'
Publication date: 01/01/2019
Field of study

Reptiles use pterin and carotenoid pigments to produce yellow, orange, and red colors. These conspicuous colors serve a diversity of signaling functions, but their molecular basis remains unresolved. Here, we show that the genomes of sympatric color morphs of the European common wall lizard (Podarcis muralis), which differ in orange and yellow pigmentation and in their ecology and behavior, are virtually undifferentiated. Genetic differences are restricted to two small regulatory regions near genes associated with pterin [sepiapterin reductase (SPR)] and carotenoid [beta-carotene oxygenase 2 (BCO2)] metabolism, demonstrating that a core gene in the housekeeping pathway of pterin biosynthesis has been coopted for bright coloration in reptiles and indicating that these loci exert pleiotropic effects on other aspects of physiology. Pigmentation differences are explained by extremely divergent alleles, and haplotype analysis revealed abundant transspecific allele sharing with other lacertids exhibiting color polymorphisms. The evolution of these conspicuous color ornaments is the result of ancient genetic variation and cross-species hybridization

Archivio Istituzionale della Ricerca - Università degli Studi di Pavia