Search CORE

60 research outputs found

CpG islands or CpG clusters: how to identify functional GC-rich regions in a genome?

Author: AP Bird
C Jiang
C Jiang
D Takai
H Kawaji
IP Ioshikhes
L Han
Leng Han
M Gardiner-Garden
M Hackenberg
M Weber
P Carninci
Zhongming Zhao
Publication venue: BioMed Central
Publication date: 01/01/2009
Field of study

Background CpG islands (CGIs), clusters of CpG dinucleotides in GC-rich regions, are often located in the 5\u27 end of genes and considered gene markers. Hackenberg et al. (2006) recently developed a new algorithm, CpGcluster, which uses a completely different mathematical approach from previous traditional algorithms. Their evaluation suggests that CpGcluster provides a much more efficient approach to detecting functional clusters or islands of CpGs. Results We systematically compared CpGcluster with the traditional algorithm by Takai and Jones (2002). Our comparisons of (1) the number of islands versus the number of genes in a genome, (2) the distribution of islands in different genomic regions, (3) island length, (4) the distance between two neighboring islands, and (5) methylation status suggest that Takai and Jones\u27 algorithm is overall more appropriate for identifying promoter-associated islands of CpGs in vertebrate genomes. Conclusion The generation of genome sequence and DNA methylation data is expected to accelerate greatly. The information in this study is important for its extensive utility in gene feature analysis and epigenomics including gene prediction and methylation chip design in different genomes

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

VCU Scholars Compass

G+C content dominates intrinsic nucleosome occupancy

Author: A Barbic
A Groth
A Thastrom
A Valouev
AV Sivolob
B Efron
B Li
B Suter
BE Bernstein
CK Lee
CR Calladine
Desiree Tillo
E Segal
E Segal
EA Sekinger
F Ozsolak
GC Yuan
GC Yuan
H Cao
HE Peckham
HR Drew
I Brukner
I Ioshikhes
IP Ioshikhes
JC Dohm
JP Thiery
JV Ponomarenko
K Luger
M Gardiner-Garden
MY Tolstorukov
MY Tolstorukov
N Kaplan
PA Rice
R Tibshirani
S Aerts
S Schwartz
SC Satchwell
Timothy R Hughes
V Miele
W Lee
Y Field
YH Wang
YH Wang
Publication venue: BioMed Central
Publication date: 01/12/2009
Field of study

Abstract Background The relative preference of nucleosomes to form on individual DNA sequences plays a major role in genome packaging. A wide variety of DNA sequence features are believed to influence nucleosome formation, including periodic dinucleotide signals, poly-A stretches and other short motifs, and sequence properties that influence DNA structure, including base content. It was recently shown by Kaplan et al. that a probabilistic model using composition of all 5-mers within a nucleosome-sized tiling window accurately predicts intrinsic nucleosome occupancy across an entire genome <it>in vitro</it>. However, the model is complicated, and it is not clear which specific DNA sequence properties are most important for intrinsic nucleosome-forming preferences. Results We find that a simple linear combination of only 14 simple DNA sequence attributes (G+C content, two transformations of dinucleotide composition, and the frequency of eleven 4-bp sequences) explains nucleosome occupancy <it>in vitro </it>and <it>in vivo </it>in a manner comparable to the Kaplan model. G+C content and frequency of AAAA are the most important features. G+C content is dominant, alone explaining ~50% of the variation in nucleosome occupancy <it>in vitro</it>. Conclusions Our findings provide a dramatically simplified means to predict and understand intrinsic nucleosome occupancy. G+C content may dominate because it both reduces frequency of poly-A-like stretches and correlates with many other DNA structural characteristics. Since G+C content is enriched or depleted at many types of features in diverse eukaryotic genomes, our results suggest that variation in nucleotide composition may have a widespread and direct influence on chromatin structure.</p

University of Toronto Research Repository

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

Features of mammalian microRNA promoters emerge from polymerase II chromatin immunoprecipitation data

Author: A Bird
A Marson
A Rodriguez
A Sandelin
A Sandelin
AP Bird
Arindam Bhattacharjee
Ben Gordon
CD Schmid
Christopher K. Patil
D Karolchik
David L. Corcoran
DL Corcoran
DP Bartel
DS Prestridge
DS Prestridge
E Wingender
F Ozsolak
GD Stormo
GG Loots
GM Borchert
H Wakaguri
HJ Bussemaker
HK Saini
I Rigoutsos
IP Ioshikhes
J Taylor
J van Helden
K Woods
KD Taganov
Kusum V. Pandit
M Gardiner-Garden
M Megraw
MJ Buck
MP Brown
N Liu
Naftali Kaminski
NJ Martinez
O Chapelle
P Carninci
P Jin
Panayiotis V. Benos
R Gangal
R Shalgi
RM Kuhn
S Baskerville
S Fujita
S Mahony
S Mahony
SJ Cooper
T Abeel
T Thum
T Wang
TA Down
U Ohler
U Ohler
WJ Kent
X Zhao
X Zhou
Y Lee
Publication venue: 'Public Library of Science (PLoS)'
Publication date: 01/04/2009
Field of study

Background: MicroRNAs (miRNAs) are short, non-coding RNA regulators of protein coding genes. miRNAs play a very important role in diverse biological processes and various diseases. Many algorithms are able to predict miRNA genes and their targets, but their transcription regulation is still under investigation. It is generally believed that intragenic miRNAs (located in introns or exons of protein coding genes) are co-transcribed with their host genes and most intergenic miRNAs transcribed from their own RNA polymerase II (Pol II) promoter. However, the length of the primary transcripts and promoter organization is currently unknown. Methodology: We performed Pol II chromatin immunoprecipitation (ChIP)-chip using a custom array surrounding regions of known miRNA genes. To identify the true core transcription start sites of the miRNA genes we developed a new tool (CPPP). We showed that miRNA genes can be transcribed from promoters located several kilobases away and that their promoters share the same general features as those of protein coding genes. Finally, we found evidence that as many as 26% of the intragenic miRNAs may be transcribed from their own unique promoters. Conclusion: miRNA promoters have similar features to those of protein coding genes, but miRNA transcript organization is more complex. © 2009 Corcoran et al

Public Library of Science (PLOS)

Crossref

Directory of Open Access Journals

PubMed Central

D-Scholarship@Pitt

Dissecting Nucleosome Free Regions by a Segmental Semi-Markov Model

Author: A Gunjan
A Krogh
AD Basehoar
BE Bernstein
CK Lee
DK Pokholok
E Segal
EA Sekinger
Enrico Scalas
F Ozsolak
F Xu
FC Holstege
Feng Xu
G-C Yuan
H Ji
I Albert
IP Ioshikhes
JH Wright
JL Parrou
K Sakaki
KD Fascher
Ker-Chau Li
L David
LR Rabiner
M Ostendorf
MA Newton
Michael Grunstein
MS Lee
R Durbin
RD Kornberg
RH Morse
S Cawley
SR Eddy
T Kim
W Lee
W Li
WE Johnson
Wei Sun
Wei Xie
X Mai
Publication venue: Public Library of Science
Publication date: 01/01/2009
Field of study

BACKGROUND: Nucleosome free regions (NFRs) play important roles in diverse biological processes including gene regulation. A genome-wide quantitative portrait of each individual NFR, with their starting and ending positions, lengths, and degrees of nucleosome depletion is critical for revealing the heterogeneity of gene regulation and chromatin organization. By averaging nucleosome occupancy levels, previous studies have identified the presence of NFRs in the promoter regions across many genes. However, evaluation of the quantitative characteristics of individual NFRs requires an NFR calling method. METHODOLOGY: In this study, we propose a statistical method to identify the patterns of NFRs from a genome-wide measurement of nucleosome occupancy. This method is based on an appropriately designed segmental semi-Markov model, which can capture each NFR pattern and output its quantitative characterizations. Our results show that the majority of the NFRs are located in intergenic regions or promoters with a length of about 400-600bp and varying degrees of nucleosome depletion. Our quantitative NFR mapping allows for an investigation of the relative impacts of transcription machinery and DNA sequence in evicting histones from NFRs. We show that while both factors have significant overall effects, their specific contributions vary across different subtypes of NFRs. CONCLUSION: The emphasis of our approach on the variation rather than the consensus of nucleosome free regions sets the tone for enabling the exploration of many subtler dynamic aspects of chromatin biology

Crossref

Directory of Open Access Journals

PubMed Central

Carolina Digital Repository

ScholarBank@NUS

The Set2/Rpd3S Pathway Suppresses Cryptic Transcription without Regard to Gene Length or Transcription Frequency

Author: AA Joshi
Andrew B. Nobel
Andrey A. Shabalin
B Li
B Li
B Li
B Li
B Rao
BD Strahl
Bhargavi Rao
Brian D. Strahl
C-K Lee
CD Kaplan
Colin R. Lickwar
DK Pokholok
DWK Andrews
E Segal
FCP Holstege
G-C Yuan
GJ Hogan
H Cedar
I Whitehouse
IP Ioshikhes
J Bai
JA Knezetic
Jason D. Lieb
JD Anderson
JD Lieb
KO Kizer
Laura Rusche
M Huarte
M-C Keogh
MJ Carrozza
ML Youdell
NJ Krogan
PB Mason
T Xiao
V Cheung
X Liu
Publication venue: Public Library of Science
Publication date: 01/01/2009
Field of study

In cells lacking the histone methyltransferase Set2, initiation of RNA polymerase II transcription occurs inappropriately within the protein-coding regions of genes, rather than being restricted to the proximal promoter. It was previously reported that this “cryptic” transcription occurs preferentially in long genes, and in genes that are infrequently transcribed. Here, we mapped the transcripts produced in an S. cerevisiae strain lacking Set2, and applied rigorous statistical methods to identify sites of cryptic transcription at high resolution. We find that suppression of cryptic transcription occurs independent of gene length or transcriptional frequency. Our conclusions differ with those reported previously because we obtained a higher-resolution dataset, we accounted for the fact that gene length and transcriptional frequency are not independent variables, and we accounted for several ascertainment biases that make cryptic transcription easier to detect in long, infrequently transcribed genes. These new results and conclusions have implications for many commonly used genomic analysis approaches, and for the evolution of high-fidelity RNA polymerase II transcriptional initiation in eukaryotes

CiteSeerX

Crossref

Directory of Open Access Journals

PubMed Central

Carolina Digital Repository

Nucleosome-coupled expression differences in closely-related species

Author: AL Olins
AM Tsankov
BE Bernstein
C Koch
Corey Nislow
CT Harbison
DE Schones
E Segal
EA Sekinger
F Ozsolak
G Badis
G Zhu
GC Yuan
GJ Hogan
H Li
I Tirosh
IP Ioshikhes
JD Hughes
KA Zawadzki
Kyle Tsui
L Bai
Maitreya J Dunham
Marinella Gebbia
N Kaplan
N Morohashi
O Elemento
O Troyanskaya
OC Martin
Olga G Troyanskaya
P Cliften
P Clifton
RD Kornberg
S Mahony
S Shivaswamy
S Washietl
SW Doniger
T Owen-Hughes
T Pramila
Victoria Yao
W Lee
X Liu
Y Guan
Y Zhang
Yuanfang Guan
Publication venue: BioMed Central
Publication date: 01/01/2011
Field of study

Abstract Background Genome-wide nucleosome occupancy is negatively related to the average level of transcription factor motif binding based on studies in yeast and several other model organisms. The degree to which nucleosome-motif interactions relate to phenotypic changes across species is, however, unknown. Results We address this challenge by generating nucleosome positioning and cell cycle expression data for <it>Saccharomyces bayanus </it>and show that differences in nucleosome occupancy reflect cell cycle expression divergence between two yeast species, <it>S. bayanus </it>and <it>S. cerevisiae</it>. Specifically, genes with nucleosome-depleted MBP1 motifs upstream of their coding sequence show periodic expression during the cell cycle, whereas genes with nucleosome-shielded motifs do not. In addition, conserved cell cycle regulatory motifs across these two species are more nucleosome-depleted compared to those that are not conserved, suggesting that the degree of conservation of regulatory sites varies, and is reflected by nucleosome occupancy patterns. Finally, many changes in cell cycle gene expression patterns across species can be correlated to changes in nucleosome occupancy on motifs (rather than to the presence or absence of motifs). Conclusions Our observations suggest that alteration of nucleosome occupancy is a previously uncharacterized feature related to the divergence of cell cycle expression between species.</p

University of Toronto Research Repository

Princeton University Open Access Repository

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

Transcriptional interaction-assisted identification of dynamic nucleosome positioning

Author: A Barski
A Flaus
A Tanay
AP Gasch
AP Gasch
Caisheng He
CK Lee
CT Harbison
DE Schones
DK Pokholok
E Segel
EA Sekinger
F Ozsolak
FCP Holstege
G Yuan
GC Yuan
GJ Hogan
HE Pechham
HK Tsai
I Albert
I Simon
I Whitehouse
I Whitehouse
IB Dodd
IP Ioshikhes
J Mellor
J Mellor
J Widom
Jiang Wang
Jihua Feng
JL Derisi
K Luger
L Narlikar
M Ashburner
NM Luscombe
P Lu
PT Spellman
Qian Xiang
R Jansen
R Santoro
RD Kornberg
RT Kamakaka
S Chu
S Ghaemmaghami
S Henikoff
S Shivaswamy
SL Berger
T Kouzarides
TJ Richmond
W Lee
X Liu
Xianhua Dai
Yangyang Deng
Zhiming Dai
Publication venue: BioMed Central
Publication date: 01/01/2009
Field of study

Abstract Background Nucleosomes regulate DNA accessibility and therefore play a central role in transcription control. Computational methods have been developed to predict static nucleosome positions from DNA sequences, but nucleosomes are dynamic in vivo. Results Motivated by our observation that transcriptional interaction is discriminative information for nucleosome occupancy, we developed a novel computational approach to identify dynamic nucleosome positions at promoters by combining transcriptional interaction and genomic sequence information. Our approach successfully identified experimentally determined nucleosome positioning dynamics available in three cellular conditions, and significantly improved the prediction accuracy which is based on sequence information alone. We then applied our approach to various cellular conditions and established a comprehensive landscape of dynamic nucleosome positioning in yeast. Conclusion Analysis of this landscape revealed that the majority of nucleosome positions are maintained during most conditions. However, nucleosome occupancy at most promoters fluctuates with the corresponding gene expression level and is reduced specifically at the phase of peak expression. Further investigation into properties of nucleosome occupancy identified two gene groups associated with distinct modes of nucleosome modulation. Our results suggest that both the intrinsic sequence and regulatory proteins modulate nucleosomes in an altered manner.</p

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

Structural constraints revealed in consistent nucleosome positions in the genome of S. cerevisiae

Author: A Siepel
A Stein
A Thastrom
A Travers
A Valouev
AB Cohanim
AB Lantermann
Christoforos Nikolaou
CT Harbison
DS Goodsell
E Segal
F Battistini
G Arents
G Babbitt
GC Yuan
GP Vicent
H Tilgner
HE Peckham
HR Chung
HR Drew
HR Widlund
I Brukner
IP Ioshikhes
J Feng
J Mellor
JJ Hayes
M Caserta
MG Guenther
MG Munteanu
Miguel Beato
ML Eaton
N Bellora
N Kaplan
N Ramsay
P Milani
R Kiyama
R Kornberg
R Ogawa
RD Kornberg
RM Fraser
Roderic Guigó
RS Edayathumangalam
S Belikov
S Henikoff
S Shivaswamy
S Yin
SM Johnson
SM Reynolds
Sonja Althammer
T Bettecken
TJ Richmond
TN Mavrich
W Lee
W Mobius
Y Zhang
Publication venue: BioMed Central
Publication date: 01/01/2010
Field of study

Abstract Background Recent advances in the field of high-throughput genomics have rendered possible the performance of genome-scale studies to define the nucleosomal landscapes of eukaryote genomes. Such analyses are aimed towards providing a better understanding of the process of nucleosome positioning, for which several models have been suggested. Nevertheless, questions regarding the sequence constraints of nucleosomal DNA and how they may have been shaped through evolution remain open. In this paper, we analyze in detail different experimental nucleosome datasets with the aim of providing a hypothesis for the emergence of nucleosome-forming sequences. Results We compared the complete sets of nucleosome positions for the budding yeast (<it>Saccharomyces cerevisiae</it>) as defined in the output of two independent experiments with the use of two different experimental techniques. We found that < 10% of the experimentally defined nucleosome positions were consistently positioned in both datasets. This subset of well-positioned nucleosomes, when compared with the bulk, was shown to have particular properties at both sequence and structural levels. Consistently positioned nucleosomes were also shown to occur preferentially in pairs of dinucleosomes, and to be surprisingly less conserved compared with their adjacent nucleosome-free linkers. Conclusion Our findings may be combined into a hypothesis for the emergence of a weak nucleosome-positioning code. According to this hypothesis, consistent nucleosomes may be partly guided by nearby nucleosome-free regions through statistical positioning. Once established, a set of well-positioned consistent nucleosomes may impose secondary constraints that further shape the structure of the underlying DNA. We were able to capture these constraints through the application of a recently introduced structural property that is related to the symmetry of DNA curvature. Furthermore, we found that both consistently positioned nucleosomes and their adjacent nucleosome-free regions show an increased tendency towards conservation of this structural feature.</p

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

UPF Digital Repository

Tight associations between transcription promoter type and epigenetic variation in histone positioning and modification

Author: A Barski
A Kratz
A Lebrun
A Roopra
AE Smith
AJ Bannister
Anton Kratz
BE Bernstein
BR Cairns
C Bock
C Jiang
C Jin
CA Davey
CR Vakoc
D Karolchik
DE Schones
E Segal
F Ozsolak
GC Yuan
H Kawaji
H Suzuki
I Albert
I Tirosh
IP Ioshikhes
J Ponjavic
J Wang
JE Butler
JJ Wyrick
K Luger
KJ Brayer
Masaru Tomita
MC Frith
MY Tolstorukov
Nozomu Yachie
P Bucher
P Carninci
P Carninci
PJ Park
R Karlic
R Lister
RA Coleman
RD Kornberg
Rintaro Saito
RS Illingworth
Ryu Ogawa
T Shiraki
Tadasu Nozaki
TN Mavrich
TY Roh
VR Ramirez-Carrozzi
Y Zhang
Z Wang
Publication venue: BioMed Central
Publication date: 01/01/2011
Field of study

Abstract Background Transcription promoters are fundamental genomic cis-elements controlling gene expression. They can be classified into two types by the degree of imprecision of their transcription start sites: peak promoters, which initiate transcription from a narrow genomic region; and broad promoters, which initiate transcription from a wide-ranging region. Eukaryotic transcription initiation is suggested to be associated with the genomic positions and modifications of nucleosomes. For instance, it has been recently shown that histone with H3K9 acetylation (H3K9ac) is more likely to be distributed around broad promoters rather than peak promoters; it can thus be inferred that there is an association between histone H3K9 and promoter architecture. Results Here, we performed a systematic analysis of transcription promoters and gene expression, as well as of epigenetic histone behaviors, including genomic position, stability within the chromatin, and several modifications. We found that, in humans, broad promoters, but not peak promoters, generally had significant associations with nucleosome positioning and modification. Specifically, around broad promoters histones were highly distributed and aligned in an orderly fashion. This feature was more evident with histones that were methylated or acetylated; moreover, the nucleosome positions around the broad promoters were more stable than those around the peak ones. More strikingly, the overall expression levels of genes associated with broad promoters (but not peak promoters) with modified histones were significantly higher than the levels of genes associated with broad promoters with unmodified histones. Conclusion These results shed light on how epigenetic regulatory networks of histone modifications are associated with promoter architecture

University of Toronto Research Repository

Crossref

Springer - Publisher Connector

PubMed Central

Transcription Initiation Patterns Indicate Divergent Strategies for Gene Regulation at the Chromatin Level

The application of deep sequencing to map 5′ capped transcripts has confirmed the existence of at least two distinct promoter classes in metazoans: “focused” promoters with transcription start sites (TSSs) that occur in a narrowly defined genomic span and “dispersed” promoters with TSSs that are spread over a larger window. Previous studies have explored the presence of genomic features, such as CpG islands and sequence motifs, in these promoter classes, but virtually no studies have directly investigated the relationship with chromatin features. Here, we show that promoter classes are significantly differentiated by nucleosome organization and chromatin structure. Dispersed promoters display higher associations with well-positioned nucleosomes downstream of the TSS and a more clearly defined nucleosome free region upstream, while focused promoters have a less organized nucleosome structure, yet higher presence of RNA polymerase II. These differences extend to histone variants (H2A.Z) and marks (H3K4 methylation), as well as insulator binding (such as CTCF), independent of the expression levels of affected genes. Notably, differences are conserved across mammals and flies, and they provide for a clearer separation of promoter architectures than the presence and absence of CpG islands or the occurrence of stalled RNA polymerase. Computational models support the stronger contribution of chromatin features to the definition of dispersed promoters compared to focused start sites. Our results show that promoter classes defined from 5′ capped transcripts not only reflect differences in the initiation process at the core promoter but also are indicative of divergent transcriptional programs established within gene-proximal nucleosome organization

Public Library of Science (PLOS)

Crossref

Directory of Open Access Journals

PubMed Central

MDC Repository